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Preface 


This book is based on lectures given in quantum theory over the years at 
various levels culminating into graduate level courses given to the students 
in physics. It is a modern self-contained textbook and covers most aspects of 
the theory and important recent developments with fairly detailed presenta- 
tions. In addition to traditional or so-called standard topics, it emphasizes on 
modern ones and on theoretical techniques which have become indispensable 
in the theory. I have included topics which I believe every serious graduate 
student in physics should know. This volume is also a useful source of infor- 
mation and provides background for research in this discipline and related 
ones as well. As such, the book should be valuable to the graduate student, 
the instructor, the researcher and to all those concerned with the intricacies 
of this subject. To make this work accessible to a wider audience, some of 
the technical details occurring in the presentations have been relegated to 
appendices. A glance at the Contents will reveal that although the book is 
fairly advanced, it develops the entire formalism afresh. As for prerequisites, 
a familiarity with general concepts and methods of quantum physics as well 
as with basic mathematical techniques which most students entering grad- 
uate school seem to have is, however, required. The evident interest of my 
students in my quantum theory courses has led me quite often to expand and 
refine my notes that eventually became the book. I often witness many of my 
earlier students, who have already taken my courses, coming back to sit in 
my lectures and continue to do so. Some of these learners are A-students. In 
developing the formalism, at the very early stages, and of the rules for com- 
putations, I have followed a method based on Schwinger’s (1970, 1991, 2001) 
elegant and incisive approach of direct analyses of selective measurements, 
rather than of the historical one, as well as in the introduction of quantum 
generators and the development of transformation theory. The selective mea- 
surement approach has its roots in Dirac’s abstract presentation in terms of 
projection operators and provides tremendous insight into the physics behind 
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the formalism. Other authors who have also shown interest in this approach 
include Kaempffer (1965), Gottfried (1989) and Sakurai (1994). 

Some of the highlights of the book are: 1) Selective measurements. Direct 
analyses of such measurements and extensions thereof lead to the develop- 
ment of the underlying rules of the theory in the most natural and elegant 
way. 2) Wigner’s Theorem on symmetry transformations. This theorem is of 
central importance in quantum theory and provides the nature of symmetry 
transformations and is the starting point on how to implement them. 3) Con- 
tinuous transformations as well as supersymmetry and discrete transforma- 
tions. 4) Hilbert space concepts and self-adjoint operators. 5) General study 
of the spectra of Hamiltonians. 6) Localizability, uncertainties and stability 
of quantum systems, such as of the H-atom, and their relations to bound- 
edness of the corresponding Hamiltonians from below. 7) Decay of quantum 
systems and the Paley-Wiener Theorem. 8) Harmonic oscillators at finite tem- 
peratures, with external sources and coherent states. Bose-Fermi oscillators. 
9) Hyperfine splitting of the H-atom for arbitrary angular momentum states. 
10) The non-relativistic Lamb shift. 11) The anomalous magnetic moment of 
the electron. 12) Measurement, interference and the role of the environment. 
13) Schrédinger’s cat and quantum decoherence. 14) Bell’s test. 15) Quan- 
tum teleportation and quantum cryptography. 16) Geometric phases under 
non-adiabatic and non-cyclic conditions. The AB effect. Rotation of a spinor 
by 27 radians. Neutron interferometry. 17) Analytical quantum dynamical 
treatment of the Stern-Gerlach effect. 18) Ramsey oscillatory fields method 
and applications. 19) Green functions, and how they provide information on 
different aspects of the theory in a unified manner. 20) Path integrals and 
constrained dynamics. 21) The quantum dynamical principle as a powerful, 
simple and most elegant way of doing quantum physics. This approach has 
not yet been sufficiently stressed in the literature and it is expected to play 
a very special role in the near future not only as a practical way for compu- 
tations but also as a technically rigorous method. 22) The stability of matter 
in this monumental theory. This problem is undoubtedly one of the most im- 
portant and serious problems that quantum physics has resolved. The Pauli 
exclusion principle is not only sufficient for stability but it is also necessary. 
23) The intriguing problem of “bosonic matter” and the collapse of matter if 
the Pauli exclusion principle were abolished with the energy released upon 
the collapse of two such macroscopic objects in contact being comparable to 
that of an atomic bomb. 24) Systematics of quantum scattering including a 
detailed treatment of the Coulomb problem. Emphasis is also put on the con- 
nection between configuration and momentum spaces in a scattering process. 
25) Spinors, quantum description of relativistic particles, helicity and rela- 
tivistic equations for any mass and any spin. As the energy and momentum 
of a particle become large enough, the Schrédinger equation, with a non- 
relativistic kinetic energy, becomes inapplicable. One is then confronted with 
the development of a formalism to describe quantum particles in the relativis- 
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tic regime. The chapter in question emerging from this endeavor provides the 
precursor of relativistic quantum theory of fields. 26) Spin & Statistics, as 
probably one of the most important results not only in physics but in all of 
the sciences, in general. The spin and statistics connection is responsible for 
the stability of matter, without it the universe would collapse. 27) Detailed 
mathematical appendices, with proofs, tailored to our needs which may be 
otherwise not easy to read in the mathematics literature. 

The above are some of the topics covered in addition to the more standard 
ones. I have made much effort in providing a pedagogical approach to some 
of the more difficult ones just mentioned. These relatively involved topics are 
treated in a more simplified manner than that in a technical journal with- 
out, however, sacrificing rigor, thus making them more accessible to a wider 
audience and not only to the mathematically inclined reader. The problems 
given at the end of each chapter form an integral part of the book and should 
be attempted by every serious reader. Some of these problems are research 
oriented. With the rapid progress in quantum physics, I hope that this work 
will fill a gap, which I feel does exist, and will be useful, and also provides a 
challenge, to all those concerned with our quantum world. 


July, 2005 E. B. M. 


1 


Fundamentals 


This chapter begins with the development of the formalism of quantum 
theory being sought. The first three sections deal with the early stages of the 
formalism, with setting up the language and the preliminary rules of compu- 
tations. The procedure follows a method based on Schwinger’s elegant and 
incisive manner of direct analyses of selective measurements and extensions 
thereof, and has its roots in Dirac’s abstract presentation in terms of pro- 
jection operators. Several examples of selective measurements will be given. 
The method developed provides tremendous insight into the physics behind 
the formalism and it leads naturally to the notion of probability associated 
with observations, to the generation of states, of wavefunctions in different 
descriptions and to various basic operations occurring in quantum mechanics, 
as well as to the emergence of Hermitian operators and inner-product spaces 
(§1.4). Preparation of pure ensembles of systems and mixtures is the subject 
matter of §1.5. In §1.6, the transmission of photons with given polarizations 
through polarizers is used to provide an illustration of rules developed earlier. 
The physical spaces in which computations are carried out are, in general, 
the Hilbert space, as an extension of the (finite) inner-product spaces en- 
countered before, and the Rigged Hilbert space which are introduced in §1.7. 
Self-adjoint operators, representing observables, and their associated spectra 
are studied in §1.8. We will see that symmetry operations are implemented 
by either so-called unitary or anti-unitary operators and is the content of a 
famous theorem due E. P. Wigner (Wigner’s Theorem on symmetry trans- 
formations) which is proved in §1.9. This theorem is of central importance 
and a key one in quantum physics and deserves the special attention given 
here. The concept of probability and measurement with detailed illustrations 
are given in $1.10, emphasizing, in the process, the physical significance of 
a conditional probability associated with a measurement. This section also 
deals with non-ideal apparatuses that may disturb the physical system under 
consideration. Additional pertinent material related to this section will be 
given in §8.7—§8.9. 


2 1 Fundamentals 
1.1 Selective Measurements 


From the possible values that a physical quantity, under consideration, 
may take on, one may select, through a filtering process, a special range of 
its values or select some of its particular values for further investigations 
by a process referred to as a selective measurement. Some examples of se- 
lective measurements are given in Figure 1.1. Such selective measurements 
may be considered, for example, as a preparatory stage for a system before 
undergoing a subsequent analysis. By a selective measurement, for example, 
one may prepare the momentum of a particle within a given range before 
it participates in a collision process with other particles. As a final selective 
measurement, in a typical experiment, one may be interested in counting 
the number of particles with spin emerging, in turn, from a given physical 
process, with the component of spin, along some, specified direction. 

We consider first physical quantities which may take on only a finite 
number of discrete values. An example of such a physical quantity is the 
component of spin of a particle of spin s, along a given axis, which may take 
on (2s +1) values. Generalizations to physical quantities which may take on 
an infinite number of possible discrete values and/or may take on values from 
a continuous set of values will be dealt with later. 

Suppose that the measurement of a physical quantity A (also called an 
observable), as a physical attribute of a system, can lead to a certain fi- 
nite set of discrete real values {a,a’,a”,...}. In general, the measurement of 
another physical quantity B may destroy the assigned value in a previous 
measurement of the physical quantity A, and both quantities cannot be mea- 
sured simultaneously. In such a case A and B are said to be incompatible. 
Otherwise they are said to be compatible observables. 

To obtain the optimum information about a system one needs to intro- 
duce a complete set of compatible observables, say, {A1,...,A,}. By this 
it is meant that any observable not belonging to this set and which is not 
a function of these observables is incompatible with at least one of them. 
To simplify the notation, we will denote such a complete set {A1,..., Ax} 
of compatible observables simply by A. Each of the values in {a,a’,a”,...} 
given above will then, in general, stand for k-tuplet of real numbers. 

Through a filtering process, as in a Stern-Gerlach experiment (see Fig- 
ure 1.1 (d)), an ensemble of identical systems each having a definite value, 
say a, for A may be prepared. Each one of such prepared systems is said to 
be in the state a. If these prepared systems are fed, in turn, into another 
filtering machine which selects and transmits systems having only the value 
a’ for A, then 100% of these systems will be transmitted if a’ = a and none 
will be transmitted if a’ £ a. 

With a filtering process, we introduce the symbol 


A(a) = |a)(a| (1.1.1) 
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Fig. 1.1. Some examples of idealized selective measurements. (a) Selection 
of a frequency range Av for light. (b) Selection of linearly polarized light. 
(c) Selection of a momentum range Ap = gArB/c for a charged particle of 
charge q initially in a uniform magnetic field B. (d) Selection of a particular 
component of spin by blocking systems with other orientations through a 
filtered beam by a Stern-Gerlach apparatus . A particle of magnetic moment 
ul experiences, classically, a force F = V(u- B) in a non-uniform magnetic 
field B. 


to denote the operation which selects and transmits only those systems in 
state a. From the description given in the previous paragraph, we may con- 
sider the successive operations to be defined through (see Figure 1.2 for an 
example): 


A(a‘)A(a) = A(a’) (a’|a) (1.1.2) 
where (a’|a) = 0(a’,a) is the numerical factor 
, \  fl,for a =a 
d(a’,a) = ‘i foe ata (1.1.3) 


with A(a)0 = O standing for the operation which accepts no system what- 
soever. For a’ = a, the second selective measurement, symbolized by A(a’), 
simply accepts and transmits 100% (6(a’,a) = 1) of the systems prepared by 
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(b) 


Fig. 1.2. (a) Schematic representation of the Stern-Gerlach apparatus of 
Figure 1.1 (d) with 0B/0z 4 0, showing the splitting of a beam of spin s 
particles into (2s + 1) components. (b) Spin 1/2 particles initially prepared 
with component of spin in the +z direction. In an obvious notation here, 
6(+z,+z) = 1 and 6(—z,+z) = 0 for the corresponding numerical factors. 


the first selective measurement, symbolized by A(a). One is naturally led to 
introduce the identity operation 


1= 50 |a)(al (1.1.4) 


which simply accepts and transmits all systems with no discrimination in any 
of the states a corresponding to all the values taken by the physical quality 
A (i.e., by the complete set of compatible observables {A,..., Ax}.) 

If A and B are incompatible, we may still consider the selective mea- 
surement A(a) = |a)(a| followed by the selective measurement A(b) = |b) (b|: 
A(b)A(a). This is a |b)(a|-type of an operation which initially prepares sys- 
tems in state a and then, through another filtering process, transmits a sub- 
ensemble of systems in state b. Since only a fraction of the systems in state 
a are expected to be finally transmitted through the B-filter, the operation 
A(b)A(a), in analogy to (1.1.2), may be defined (see Figure 1.3, for an exam- 


ple) through: 
A(b)A(a) = |b) (a| ((b| a) ) (1.1.5) 


reflecting the fact that it is a |b)(a|-type selective operation and also providing 
a numerical factor (b|a) as a measure of the fraction of the systems initially 
prepared, by the A-filter, in state a, to be finally transmitted through the 
B-filter and found in state b. 
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t 


FRACTION GOES 
THROUGH 


Fig. 1.3. Spin 1/2 particles initially prepared with the component of spin in 
the +z axis direction. This experimental set up is represented, in an obvious 
notation, by |+1/2,Z) (41/2, z| ((+1/2,Z|+1/2,z)) with (4+1/2,2|+1/2, z) 
being a numerical factor providing a measure of the fraction of particles go- 
ing through the middle apparatus with the —Z component of spin blocked 
(as shown by the appearance of the hand). The rules for the computation 
of numerical factors such as (+1/2,2|+1/2,z) will emerge naturally later. 
Here for the numericals ™, m in (7, Z|m, z) we have, ™ = +1/2, m = +1/2, 
for example, corresponding to spin components along the +Z, +z directions, 
respectively. 


Clearly more elaborate successive selective measurements (see, for exam- 
ple, Figure 1.4) may be considered and one may establish, in the process, the 
following associative law of the measurement symbols: 


A(c)[A(b)A(a)] = A(c) |b) (al ((b|@) ) 
= |e) (al ( (c|b) (bla) ) 
= [A()A(b)] A(a). (1.1.6) 


At this stage, it is worth re-examining the role of incompatible observables. 
Corresponding to an observable A, we note first from (1.1.2) that 


A(a')A(a) =0, a’ Aa. (1.1.7) 


We recall that what the latter means is that the first filtering operation, 
via A(a), has prepared systems in the state a, and for a 4 a’, the second 
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f FRACTION GOES 
FRACTION REDUCED THROUGH 
HERE 


Fig. 1.4. Spin 1 (massive) particles prepared with the component of spin in 
the +z direction. This experimental set up is represented, in an obvious no- 
tation, by successive measurement symbols (|+1,2) (+1,2|) (|-1,2) (-1, 2]) 
x (\41, 2) (41, 2|) = (|+1, 2) (41, 2|) ((41,2|-1, 2) (-1,2]41, z)). The nu- 
merical factor (+1, Z| —1,2) (—1,Z|+1, z) is a measure of the fraction of the 
number of particles going through the five apparatuses. [The 0 in |0,Z) cor- 
responds to a spin component perpendicular to the Z-axis.] 


operation, via A(a’), rejects all such systems. That is, no systems appear in 
the final stage after the application of the two filtering processes. What is 
quite remarkable, is that if we insert a B-filter, via the application of A(b), 
between the two successive operations in (1.1.7) we obtain 


A(a’)A(b)A(a) = Ja’) (al ((a’|b) (b|a) ) (1.1.8) 


and for two incompatible observables A and B, (a’|6) (b| a) is not necessarily 
equal to zero. In such cases 


A(a’)A(b)A(a) # 0. (1.1.9) 


That is, by making the selective B-measurement, via A(b), after the selective 
A-measurement, via A(a), followed finally by a selective A-measurement, via 
A(a’), some systems may emerge in the state a’ even if a’ # a (!), although 
this would not happen if the B-filter were absent, thus increasing the fraction 
of systems finally transmitted from zero to a possible non-zero value. 
Re-iterating the above remarks, is that although the first selective mea- 
surement via A(a) makes sure that no systems are transmitted through it in 
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NO PARTICLES 
EMERGE HERE 


PARTICLES MAY 
EMERGE HERE ! 


(c) 


Fig. 1.5. (a) Experimental set up involving spin 1/2 particles showing 
that no particles finally emerge with component of spin along the +z di- 
rection. (b) Orientation of the Z-axis relative to the z-axis. (c) The inser- 
tion of the middle filtering (Stern-Gerlach) apparatus may allow some par- 
ticles with component of spin along the +z direction to appear in the fi- 
nal stage knowing that the first filter has rejected particles in such a state! 
This is because (+1/2, z|+1/2, 2) (+1/2,Z|—1/2,z) # 0. This provides an 
illustration of the fact that the observables associated with measuring of 
components of spin along different orientations, as shown in (b), are in- 
compatible. The rules for the computation of numerical factors such as 
(+1/2, z|+1/2,Z) (41/2, Z| —1/2, z) will be worked out later. 


a state a’ # a, the B-filter allows such systems in a state a’ # a be finally 
transmitted after the selective measurement, via A(a’), is carried out. 

An example of the operations in (1.1.7) and (1.1.9) is given in Figure 1.5 
illustrating the above remarks. Another example worked out in some details 
dealing with polarization of light will be given in §1.6. 

The filtering Stern-Gerlach devices considered above are ideal and allow 
simple deductions to be made without going into the subtleties of their per- 
formance. A fairly detailed account of the Stern-Gerlach effect will be given 
in §8.14. 
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1.2 A, B, C to Probabilities 


We obtain a useful identity involving the numerical factors such as (c|b). 
To this end we insert the identity operation in (1.1.4) as shown below 


A(c)A(b) = A(c) 1. A(b) 
= |e)(e (Sieve ({b) (Ol ) 
= |e) (b| (ela) (a|8) + (cla’) (a! |b) +...) 


=|c)(b bs (c\a) i) (1.2.1) 


a 


I 


and infer from (1.1.5), in a corresponding notation, that 


(c|b) = S— (ela) (ald). (1:9:9) 


a 


In particular, if C' is chosen to be the observable B, with c replaced by, 
say, 0’, we have from (1.1.3) and (1.2.2) 


S- (b'|a) (a|b) = 6(0, b) (1.2.3) 
and for 0! = 
S © (bla) (a|b) = 1. (1.2.4) 


We note that under the arbitrary scale transformations: 


|a)(b| — Ja) (b] (A(b)/A(a)) (1.2.5) 


and 
(a|b) — (ab) (A(a)/A(0)) (1.2.6) 


all the equations involving the selective measurements and successive selective 
measurements, (1.1.1)—(1.1.9), together with the identities (1.2.2)—(1.2.4) for 
the numerical factors, remain invariant. Because of this arbitrariness under 
such scale transformations, a numerical factor (a|b), although of physical 
interest as discussed in §1.1, cannot have a physical meaning by itself. The 
combination (a|b) (b|a), however, from (1.2.6) remains invariant. For the 
subsequent analysis, we introduce the notation 


(a|b) (b| a) = pa(d). (1.2.7) 


At this stage the following basic points should be noted which are relevant 
to the numerical factor p,(b): 
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(i) 
Pala) = 1. (1.2.8) 

(ii) As mentioned above, p_(b) is invariant under the scale transformations 
in (1.2.6). 

(iii) As already noted in §1.1, the factor (b|a) in it, for example, is a measure 
of the fraction of systems all in state a that will be found in state 6 after 
the corresponding selective measurement. 

(iv) pa(b) satisfies the normalization condition 


S/ pa(b) =1 (1.2.9) 
b 


as follows from (1.2.4). 


Accordingly, pa(b) is qualified to represent the probability of observing a 
system in state b knowing that it was in state a prior to the B-measurement. 
A probability, however, has to be a real and non-negative number p,(b) > 0. 
One may satisfy both of these conditions if one requires that 


(ab) = (bla)*. (1.2.10) 


Where * denotes complex conjugation. The probability p,(b) will be then 
given from (1.2.7) to be 


pa(b) = |(b|a)|” > 0, (1.2.11) 


where all the initial systems prior to a B-measurement were in state a. The 
numerical factor (b|a) which, in general, is a complex number is referred to 
as the amplitude of obtaining the value b for a B-measurement on a system 
initially known to be in a state a. 

The scale factors A(a), A(b), in (1.2.5), (1.2.6) must then obey the rule 


(\(a))" = 1/X(a). (1.2.12) 
That is, they are necessarily phase factors: 
d(a) = e'?() (1.2.13) 


i.e., with ¢(a) denoting a real number. 

As an application, consider feeding systems, all prepared in the state a, 
into a B-filtering apparatus via the application of the A(b)-operation, and 
then into a C-filtering apparatus, via the application of the A(c)-operation. 
This sequence of measurements, including the preparatory one, is represented 
by 

A(c)A(b)A(a) = |e) (al ((e|b) (b] a) ) (1.2.14) 
(see Figure 1.4, for an explicit situation). The probability of obtaining the 


value b for a B-measurement, then the value c for a C-measurement, on a 
system initially in the state a is given by 
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Prob, {b then c] = |(c|b) (b|a)|” 
= |(c|b)|? |(b]a)|? 


= po() Pal). (1.2.15) 


It is convenient to introduce the concept of the trace operation. For any 
number a, we define 


Tr [a |b) (a| | = a (ald). (1.2.16) 
Hence pa(b) may be rewritten as 
pa(b) = Tr [A(b)A(a)]. (1.2.17) 


We observe from (1.2.11), that the probability of obtaining the value 6 of 
a measurement of the observable B on a system in the state a is the same as 
the probability of obtaining the value a of a measurement of the observable A 
on a system in the state b. This suggests to consider the reversal of a sequence 
of selective measurements, called the adjoint, defined as follows 


({b) (al)" = |a)(b| (1.2.18) 
(( Jc) (d] ) |b) {a i = (|a)(b|) (|d)(c}). (1.2.19) 


Upon expanding the left-hand and right-hand sides of (1.2.19) we may infer 
from (1.2.10) that 
(d|b)' = (d|b)*. (1.2.20) 


That is, the adjoint transformation introduces the complex conjugation of 
numerical factors. 


1.3 Expectation Values and Matrix Representations 


1.3.1 Probabilities and Expectation Values 

Consider systems all in the state a being fed into a B-filtering machine 
(see, e.g., Figure 1.6) transmitting systems in states b, b’, .... The probability 
that a transmitted system is found in state b is given by p,q(b) in (1.2.11). 
Accordingly, the expectation value of the observable B of systems in state a 


is 
S/ bpa(b) = (B), (1.3.1) 

b 
which we have conveniently denoted by (B),. The latter may be rewritten as 


(B), = 7b Tr [AW)A(a)] 
b 
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=) 0 Tr [bA(b)A(a)] (3:3) 
b 
(see (1.2.16), (1.2.17). This suggests to introduce the object 
B=)_bA(b) = ¥_ b/d) (| (1.3.3) 
b b 


as a linear combination of B-selective measurement symbols which has far 
reaching consequences. For simplicity of the notation, we have used the same 
symbol B for this object as the physical quantity it represents. In particular 
we may write 


(B), = Tr [BA(a)]. (1.3.4) 


Fig. 1.6. A beam of spin 1/2 particles all with component of spin along 
the +z direction is fed into a Stern-Gerlach apparatus and is split into two 
beams with components of spin prepared along the +Z and —zZ directions, 
respectively. The initial beam is referred to as being completely polarized. 


The expression in (1.3.3) in turn suggests to introduce more general ob- 
jects like 
M = 5) (b|M|b’) |b) (b" (1.3.5) 


b,b! 


where (b|M/|b’) are numerical factors. Immediate consequences of this defin- 
ition are 


MyM, =~ (= (b|M, |b") (B" ii) lb) (0 (1.3.6) 


b,b’ b’” 
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= S/ (b|M|b) (1.3.7) 
b 
Mt = > (b|M|o’)* ([b)(o"|)" 
b,b’ 
Mt = 5° (b|M|b’)* |b’) (0. (1.3.8) 
b,b! 


From (1.3.6)—(1.3.8), we may infer that the numerical factors (b|M|b’) 
may be interpreted as the matrix elements of a matrix labelled by the possible 
values of the observable B. In particular, we note, according to the definition 
(1.3.5) that 


My Mz = S~ (b| Mi Mz|0") |b) (8 (1.3.9) 
b,b/ 


and upon comparison with (1.3.6) we obtain the expected result of matrix 
multiplication definition that 


> (b|.Mi|b"") (8 |Mz|b") = (b|Mi Mob!) (1.3.10) 
b” 


One may rewrite the expression in (1.3.5) directly for the object M? and 
compare it with (1.3.8) to conclude that 


(b| Mt |b’) = (b'|M|b)". (1.3.11) 
Upon multiplying M in (1.3.5) by |b’) (| and taking the trace gives 
Tr [M |b’) (b|] = (b|M|b”) . (i342) 
For the identity operation the latter gives 
(b|1|b”) = Tr [1 |b”) (6) ] (1.3.13) 
and from (1.1.4) and (1.2.3) that 
(b|1|b”) = 5(b, b”) (1.3.14) 


which from (1.3.5) leads finally to 


1=5_ [d)(o| (1.3.15) 


b 


emphasizing the fact that the identity operation accepts all systems, with- 
out discrimination, whether it is written in the A-description or the B- 
description. 

M in (1.3.5) may be also rewritten in a mixed-description as 
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M = 5° (b|M|a) |b) (al. (1.3.16) 


a,b 
To show the equivalence of (1.3.16) and (1.3.5) we note from the former that 
(b|M|a) = Tr [M |a) (d| J. (i307) 


On the other hand, upon multiplying M in (1.3.5) by |a)(b| and taking the 
trace yields 
Tr [M |a)(d|] = ys (b|M |b’) (b' |a). (1.3.18) 
b/ 
Finally upon multiplying M in (1.3.5) by the identity in the A-description 


we have 
M= S- ( (b|.M|b') (b'|a) ) |b) (al . (1.3.19) 
a,b,b! 
Upon comparison of (1.3.18)/(1.3.19) with (1.3.16)/(1.3.17) establishes the 
equivalence. Another equivalent expression for M is 


M = 5° (a|M|b) |a)(b]. (1.3.20) 
a,b 


The following rules then easily follow: 


M, Mz = > (b|My|a) (a|Mole) |b) (el (1.3.21) 
a,b,c 
(a|M"|b) = (b|M|a)* (1.3.22) 


The equivalence of the descriptions of M given in (1.3.5), (1.3.16) and 
(1.3.20) may be then summarized by multiplying M from the right and left by 
the identity written in any description and noting, in particular, the identity 


S~ (a|M|b) (b|¢) = (a|M|c). (1.3.23) 


b 


1.3.2 Representations of Simple Machines 


For a concrete example of an object of the form in (1.3.5), consider the 
following machine (apparatus). It consists of two parts. The first part is 
a Stern-Gerlach apparatus (a filter) which transmits a beam of spin 1/2 
particles with their spin components prepared in the +z direction of the z- 
axis, while the beam of particles with spin components in the —z direction is 
blocked. The resulting beam is then fed into a Ramsey apparatus,! consisting 
of the second part of the machine. In its simplest description, this apparatus 
consists of an oscillatory time-dependent magnetic field B(¢) switched on for 
some time 7, followed by a uniform time-independent magnetic field Bo for 
some time T, and then finally the oscillatory magnetic field B(¢) is switched 
on again for an additional time rT. 


' Ramsey (1990) based on the 1989 Nobel Prize in Physics Lectures. The under- 
lying theory will be discussed in some detail in §8.8. 
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RAMSEY APPARATUS 
M 


Fig. 1.7. A machine (apparatus) M consisting of a Stern-Gerlach apparatus 
(a filter) transmitting a beam of spin 1/2 particles with spin components 
along the +z direction, with the beam then fed into a Ramsey apparatus as 
described in the text. The machine M may be conveniently represented by 
the expression in (1.3.24), (1.3.25) and is of the form in (1.3.5). 


The above machine is depicted in Figure 1.7, and may be represented in 
the form 


M= So |m,2)(m,z|M|m', 2) (m’, 2| (1.3.24) 
m,m/=+1/2 
with 
ba, 2 Mia! = oak £170) Gee (1.3.25) 


where the Ramsey apparatus with an initial beam of particles with spin 
component in the +z direction fed into it is represented by the quantity 
(m, z|M|+1/2, z). 

Other examples of objects of the form in (1.3.5), (1.3.16), with increasing 
complexity, are given in Figure 1.8. 

From (1.3.5), (1.3.16), the machines 4, Mo, M3, M4 in Figure 1.8 may 
be then represented in the simple forms: 


M, = A(a) (1.3.26) 


Mz = 5 — |b) (bla) (al (1.3.27) 
b 


Ms = 5~ |b) (b| a) (al (1.3.28) 


beA 


Mz = S~ |b) (b| Mala’) (a'| (1.3.29) 


b,a’ 


with 
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Fig. 1.8. Examples of four machines which may be represented in the form 
(1.3.5), (1.3.16). These machines select only systems for which a measurement 
of a physical quantity A, characteristic of the systems, yields a given fixed 


value a and reject all other systems. Machine M, is the simplest one repre- 
sented by the measurement symbol A(a) in (1.1.1). Mg is a filtering machine 
which transmits only systems with given values of a physical quantity B, 
characteristic of the systems, obtained through a filtering process, as in a S-G 
apparatus, within some specified range A of b values. M2 is identical to M3 
except that it transmits all the systems with any b values without discrimi- 
nation. M, is a generalization of the machine M in Figure 1.7, where after 
the selection of systems with a given a value, the systems may, in general, 
go through complicated processes, such as the collisions of the underlying 
particles, absorptions, and so on, and the machine, then through a filtering 
process, transmits the emerging systems having b values of a physical quantity 
B characteristic of the systems. 


(b| Myla’) = 6(a’, a) (b| Mala) (1.3.30) 


and the (b|M,|a) are some complex quantities. [See also the representation 
of the machine M of Figure 1.7 in (1.3.24), (1.3.25).] 

The adjoint operation in (1.3.22) for a machine introduces, formally, to a 
machine operating in reverse, and from (1.3.16), 


Mt = 5° Ja) (b| (b|M|a)*. (1.3.31) 
a,b 


The successive operations of two machines M2 followed by M;, is given in 
(1.3.21). The significance of the trace operation in (1.3.18) will be considered 
in (1.4.18). 


1.4 Generation of States, Inner-Product Spaces, 
Hermitian Operators and the Eigenvalue Problem 


One may regard the significance of a selective measurement |b)(a|, after 
the selection of all systems in a state specified by a value a of a physical 
quantity A, characteristic of the systems, as a two-stage process. The first 
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being the annihilation of the selected systems in state specified by a and 
subsequently, as the second stage, the production of systems in a final state 
specified by a value b of a physical quantity B, characteristic of the systems, 
with the (a| and |b) symbols being associated with the two stages of the 
process just discussed. 


1.4.1 Generation of States and Vector Spaces 


The symbols |b), for example, acquire a significance mathematically as 
they may be represented as vectors which generate a vector space of di- 
mensionality directly obtained from the associated observable B (see (1.3.3)) 


representing a complete set of compatible observables, say, B = {Bi,..., By} 
with b = {bi,...,b,}. The dimensionality of the generated vector space coin- 
cides with the number of different vectors that one may define as bj,..., by 


take on consistently their allowed real physical values. It is often convenient, 
but not always so, to use the notation |b) as well for the corresponding vector 
representation. The 0 vector, in this vector space, is associated with the mea- 
surement of producing no systems at all. The state of a system characterized 
by a given fixed values taken by the k-tuplet {b),...,b,} corresponding to 
the complete set of observables {B,,...,B;,} is also often denotes by |b). 

To see how such a vector space, as mentioned above, arises, consider, for 
example, the function of machine My, in Figure 1.8 which is represented in 
the form (see (1.3.29), (1.3.30)) 


Mz = S~ |b) (6|Mala) (al (1.4.1) 
b 


with the (b|M,4|a) denoting some complex quantities. This machine may be 
considered to operate, effectively, in two general stages. In the first stage, it 
annihilates all the systems selected in state specified by a, and finally creates 
systems in some new state |W) given by 


[b) = 9718) (61 Mala) (1.4.2) 
b 


for a priori given fixed value a of a physical quantity A characteristic of the 
systems. From (1.4.1), the machine M4 may be then also represented in the 
compact form 


Ma = |) (al (1.4.3) 


for some fixed value a. 

That is, starting from a system initially prepared in a state |a) and fed 
into the machine My, the latter produces a system in some final state which 
may be also denoted by |w). From (1.1.3), this operation procedure of the 
machine M, may be then defined by 


M,|a) = |¥). (1.4.4) 
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With (b|M,4|a) as some complex quantities, the state |) in (1.4.2) is 
written as a linear combination of the states |b). This is very much as having 
a vector space with the |b), corresponding to a complete set of observables 
characteristic of the system into consideration, providing a basis for such 
a vector space. To define a vector space, however, we have to consider the 
addition of states such as |) and also define a O vector in it. This is done 
below. 

To the above end, for each given fixed value c of some physical quan- 
tity C, characteristic of the systems in consideration, consider the state |®,) 
produced by a B-filtering machine 


Mo = Y_ |b) (0| (1.4.5) 
beA 
with a given fixed range A of b values, from an initially prepared state |c): 
= 5° |b) le) (1.4.6) 
beA 


written as a linear combination of |b) states. On the other hand, for any given 
fixed value a, we may produce a state |v) from the successive operations of a 
machine Mj, followed by that of machine Mo from an initially prepared state 
|a), where 


Mi = Se) Celia) (a (1.4.7) 


which is of the same form as My, in Figure 1.8 with observable B replaced by 
an observable C’. That is, 


=> (= 1) (b|) lan 


beA @ 
=i (c|Mila) 55 |b) (bl) 
beA 
= S~ |b) > (le) (e| Mila) (1.4.8) 
beA c 


again written as a linear combination of |b) states. 
Upon comparison of the second equality in (1.4.8) with (1.4.6), we obtain 


api |®.) (c| Mia) (1.4.9) 


with, in general, (c|M{|a) denoting complex quantities, the vectors |x), |®<) 
are written as linear combination of |b) states. Equation (1.4.9) provides a 


18 1 Fundamentals 


linear superposition of vectors |®,) in the underlying vector space leading to 
a vector |) also in the same vector space. 

The O vector in the underlying vector space may be simply defined by 
carrying out a selective measurement via the symbol A(b’) with b’ ¢ A, on 
the state |®,) in (1.4.6) 

A(v’) |®.) = 0. (1.4.10) 

In analogy to (1.2.17), (1.1.5), we consider the successive measurement 
symbols 

A(b) |W) eb] = |b) (| ((6] 2) ) (1.4.11) 
with (b|) as a measure of the fraction of systems found in the state |b), and 
from (1.2.17), 

pus() = |b ¥)/? (1.4.12) 
is interpreted as the probability that the physical quantity B, characteristic 
of the system, takes the value 0 if the system is in the state |q), provided 


S-|(old)P? =1 (1.4.13) 
b 


giving a normalization condition for the generally complex quantities (b|~). 

Now we use the expression for |v) in (1.4.2), and consider the application 
of the selective measurement provided by A(b) of a system in the state |~) 
giving 


|b) (b| Mala) = |b) (b| ) (1.4.14) 
(see also (1.3.23)) leading to the identification 
(b| Mala) = (b| b) = o(0) (1.4.15) 


where with (b|w), in general, a complex quantity, we have denoted it by w(b). 
Thus we may rewrite (1.4.2) as 


1w) = S— |b) W(0). (1.4.16) 
b 


Conversely, with a a priori given and fixed, and (b|M,|a) denoting, in 
general, some complex quantity which may be denoted, say, by 7)(b) in (1.4.1), 
equations (1.4.2), (1.4.14) lead to the identification (b|~) = ~(b). Finally note 
that the application of the selective measurement, denoted by A(b), on the 
state |7)) in (1.4.16) confirms this notation. 


1.4.2 Transformation Functions and Wavefunctions in Different 
Descriptions 


Equation (1.4.16) emphasizes again the expansion of the state |y) in terms 
of the |b) states, with the b values corresponding to a complete set of compat- 
ible observables characteristic of the system into consideration. |q)) is referred 
to as a state vector, and 1)(b) as the wavefunction in the B-description. 
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Upon the application of a selective measurement A(a), of a physical quan- 
tity A characteristic of the system in state |w), one obtains 


(a) = D7 (ad) W(b) (1.4.17) 


b 


providing the transformation law of a wavefunction from the B-description to 

the A-description, with the amplitude (a|b6), as is referred to below (1.2.11), 

is also called the transformation function from the B- to A-descriptions. 
The trace operation 


Tr [|a)(b| Ma] = (b|Mala) = (6) (1.4.18) 


corresponding to the machine My, in (1.4.1), as also introduced in (1.3.29), 
gives the wavefunction ~(b). 


Fig. 1.9. A machine (apparatus) M consisting of two parts, a Stern-Gerlach 
apparatus (a filter) transmitting a beam of spin 1/2 particles with spin com- 
ponents along the +z direction, with the beam then fed into a region of 
constant magnetic field. As far as a particle is concerned the machine M 
may be represented as in (1.4.24). If the initial state of a particle is different 
from |+1/2,z), such as being in a state |+1/2, x), the interesting situation of 
absorption by the machine arises. 


1.4.3 An Illustration 


As an illustration, consider the simple machine (apparatus) M in Fig- 
ure 1.9. It consists of two parts. The fist part is a filter which transmits par- 
ticles of spin 1/2 with spin components along the +z direction while those 
with components along the —z direction are not transmitted. If we represent 
the state |+1/2, z) by 


}+1/2,2) = (4) (1.4.19) 


and its adjoint by 
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7 
1 
(z,+1/2| = (5) — (1 0) ; (1.4.20) 
Then the filter may be represented by the selective measurement symbol 
1 10 
A,(+1/2) = (5) (1 0) = ( : : (1.4.21) 


The second part consists of a region of constant magnetic field 
B = (B,0,0). (1.4.22) 


Let p denote the magnetic dipole moment of a particle. If to is the time 
spent by a particle in the magnetic field B, then as we will see later when 
studying the physics of spin 1/2 in Chapter 8 (see (8.1.28) later), that as far 
as a particle is concerned, the second part of the machine may be represented 
by the 2 x 2 (non-Hermitian) matrix 


Bt Bt 
cos ae isin is Ps a 
M(B) = : (1.4.23) 
_. UBto —MBto 
isin i cos - 


From (1.4.21) and (1.4.23), the combined machine M in Figure 1.9 may 
be then represented simply by 
UBto 0 
M= (1.4.24) 
UBto 
a 0 


cos 


isin 


Hence, if the particles are initially prepared in the state |+1/2,z), so 
that 100% of them are transmitted through the first stage, via the filtering 
machine in (1.4.21), the machine M in Figure 1.9, from (1.4.4), produces 
particles each in the state 


|b) = M|+1/2, z) 


cos note 0 ; 
is 6 (1.4.25) 
cp 4 
A 
or 
|b) = @ cos to +i (7) sin HA to (1.4.26) 


We may rewrite (1.4.26) as 
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I) = V+1/2, 2}) |+1/2, 2) + VY -1/2, z}) |-1/2, 2) (1.4.27) 


with 


w({+1/2, z}) = cos Ho to, w({-1/2, z}) = isin Ht (1.4.28) 


By comparing (1.4.28)/(1.4.27) with (1.4.15) /(1.4.16), we may infer that 
B B 
(+1/2,2| 0) = cos to, (-1/2,z|b) =isin “to (1.4.29) 
and from (1.4.12), (1.4.13) that 


B B 
I(+1/2, 21 ¥)? + (1/2, 2)? = cos® to + sin? to 


I 


(1.4.30) 


as expected. 
As will be seen later (§8.1), that particles in the states |£1/2,x) may be 
represented as 


|41/2,2) = 5 @ , (ie a= 5 e (1.4.31) 


For the selective measurement symbol A,(+1/2), we then have the represen- 
tation 


Ae(41/2) = 5 c y . (1.4.32) 


Upon making a selective measurement via A,(+1/2) of a system in the 
state |¢)) in (1.4.26), we obtain, in analogy to (1.4.17), 


1 1 
v2 v2 


where we have used the normalizability of the state |+1/2,2) as given in 
(1.4.31), and the identifications in (1.4.28). 
Upon the comparison of (1.4.33) with (1.4.17) we obtain 


P{+1/2,2}) = Fev({+1/2, 2}) + Fev({-1/2, 2}) (1.4.33) 


(41/2, 2|-+41/2, 2) = = ey Rea ee (1.4.34) 


Similarly, one derives that 


ie pole ee (1.4.35) 


(—1/2,2|-1/2,z) = —- : 


a (1.4.36) 
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thus obtaining the transformation function (m’,x|m,z) for, m’,m = +1/2, 
and we have developed the transformation from a description of spin along 
the z-axis to one along the z-axis, with wavefunctions w({m, z}), w({m’, x}) 
in these descriptions, respectively. 

With particles in an initial state |+1/2,z) fed into the machine M in 
Figure 1.9, the presence of the filter as part of the machine seems redundant 
since 100% of the particles in this initial state will be transmitted through 
the filter. The interesting situation then arises if the particles are initially in 
a different state, say, in the state |+1/2, 2), which we now consider. 

According to (1.4.34), only 50% of the particles will go through the filter, 
i.e., we will have absorption, and the machine M will produce a particular 
state |®) ps from |+1/2, x) reflecting this fact. In detail 


|®) apg = M|+1/2, 2) 


UBto 


cos 0 


cb tty GON Fa BB 16 
— V2 cos Fy 10 (5) + on? pe iD (1.4.37) 


and as expected 


2 
tol + (1.4.38) 


| 1 UB 
— cos — 
v2 oh 
showing 50% absorption. 

In general, consider an initial state |c), with c a priori fixed value taken by 
a different physical quantity C’ characteristic of the system into consideration, 


then from (1.4.3), machine M, in Figure 1.8, will produce, from the initial 
state |c) fed into it, a state 


I®) aps = M4 |c) 
= |#) (ale) (1.4.39) 
or from (1.4.16), (1.4.15) 


IP) aps = > 1b) (bY) (ale). (1.4.40) 


b 


From (1.4.12), (1.4.40) we then have 
[1 - Kall? | x 100% 


absorption by the machine M4. 
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1.4.4 Generation of Inner Product Spaces 


Finally, we are led to consider the measurement symbol |wW)(¢|, with |v), 
|b) two states, written as linear combinations of |b) states. The trace operation 
(see (1.2.16)) in the following 


Tr [|v al] = (eld) (1.4.41) 


leads to the definition of an inner product, which from (1.4.16) may be written 
as 


(6|¥) meee) (1.4.42) 


where we have used the property that the adjoint transformation takes the 
complex conjugation of numericals, 


=O W = I (1.4.43) 
b 


denoting the right-hand side of the above by (¢| as an expansion in terms of 
the adjoints (b|. The vector space generated by the adjoints (b| is called the 
dual vector space to the vector space generated the |b) vectors having similar 
properties as the initial vector space itself. A vector |b) and its adjoint (b| are 
often referred to as a ket and as a bra, respectively. 

From (1.2.17), we also have, 


Pull) = (dll? (1.4.44) 


representing the probability that the system is found in the state |@) if it 
is initially in the state |W), and the trace operation in (1.4.41) gives the 
corresponding amplitude (| w). 

A vector space on which an inner product is defined is called an inner 
product space. Thus with the inner product given in (1.4.42), we have thus 
introduced such an inner product space from the vector space generated by 
the |b) vectors. 

It is easily seen that (1.4.42) actually provides a definition of an inner 
product by explicitly verifying the following properties, 


(b|b) = dW) (1.4.45) 


(g|¥)" = (14) (1.4.46) 


(ag|o) =a" (oly), — (Pla) = a(o|) (1.4.47) 
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(iv) 
(o/d1 + b2) = (9| (141) + |v) ) 
= (o| 41) + (o| ¥2) (1.4.48) 


for any complex number a. In regard to | + w2) see (1.4.9). 
The length or the norm of a vector |) is defined by 


Ill = Vole) (1.4.49) 


and hence the normalization condition (1.4.13) reads 
I? = Sov)? = 1. (1.4.50) 
b 
From (1.4.17), we note that 


S7 *(b)v(b) = $5 6*(a)(a) (1.4.51) 
b a 


establishing the description independence of an inner product corresponding 
to any two physical quantities B and A, characteristics of the systems in 
question. 

The following basic inequalities follow from the definitions of the inner 
product and the norm of vectors given above 


(el e)| < llell lvl (1.4.52) 
referred to as the Cauchy-Schwarz inequality, and 


I+ vl < Hell + [lvl (1.4.53) 


referred to as the triangular inequality. 


1.4.5 Hermitian Operators and the Eigenvalue Problem 


The definition in (1.3.3), together with (1.1.2) and (1.1.3), lead to the 
eigenvalue problem of a Hermitian operator 


B |b) = b|b) (1.4.54) 


or 


B; |b) = 6; |b), a re (1.4.55) 
where B = {B,,..., By}, |b) = {b1,..., by}. The Hermiticity condition reads 


Bi=B (1.4.56) 
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and follows from the reality of the b values and the definition of the adjoint 
through (1.2.18)-(1.2.20). 

Clearly, for any given value 6; in (1.4.55), there will be, in general, more 
than one vector |b),...,0:,..., bx), satisfying (1.4.55), as the other b; (j # 
i) take on, consistently, their allowed values. The number of such distinct 
vectors |b1,..., 6%) defines the degree of degeneracy of the eigenvalue b; of B;. 
Once all the eigenvalues |b,,...,b,) are specified, then there will be only one 
eigenvector |b;,...,b,) corresponding to the eigenvalue b in question. That 
is, by definition, the eigenvalue b of a complete set of compatible observables 
{B,,..., By} = B is non-degenerate. The set of values {b} of B is called its 
spectrum, and the set of vectors |b) provides a basis for the generated vector 
space, with 


(b|b') = 6(b,0') = 116 (b;, b,) (1.4.57) 


providing the orthonormality condition of the vectors |b), as follows from 
(1.1.3). 

In the case when the eigenvalues b take on an infinite number of discrete 
values and/or continuous values, one is faced with convergence problems. 
These problems are dealt with by defining the concept of a Hilbert space 
(81.7), as an underlying vector space, and introduce in turn the concept of 
self-adjoint operators (§1.7), representing observables, operating on vectors 
in such a vector space. 


1.5 Pure Ensembles and Mixtures 


In the elementary selective measurements experiments discussed in §1.1— 
§1.3, all the initial systems were prepared in some definite state a. Such 
a collection of systems all prepared in the same state is referred to as an 
ensemble or more appropriately as a pure ensemble. More generally, one may 
design an apparatus which may prepare an ensemble of systems, for further 
experimentation, such that every system in the ensemble may be represented 
by a creation symbol |) in the form 


= S$" |b) ¥(b) (1.5.1) 
b 


satisfying the normalization in (1.4.50). 
The expectation value of an observable B for systems in the state |w) is 
then 


|B) y = Tr [b)(b| B] 
= = 2 HoG) = (W|Ble) (1.5.2) 


26 1 Fundamentals 


(see (1.3.1)—-(1.3.4), (1.3.17), (1.4.18)). 
The operator 


p= |v) (1.5.3) 


is called the density or statistical operator for the ensemble of systems all in 
the state |). Immediate properties of p, as defined in (1.5.3), are 


Teo) = (1.5.4) 


OLE ioe | aah (125.5) 


Fig. 1.10. A figure displaying n machines preparing independently n ensemble 
of systems in states |q1), |W2), ..., |Wn), respectively, constituting wi%, w2%, 
..+, Wn% of the total number of prepared systems. 


One may also consider more general preparatory procedures such as de- 
scribed in Figure 1.10, where we have, say, n machines which produce (pre- 
pare) independently n ensemble of systems, respectively, in states |w1), |wWe), 
..., Un) with w1% of systems in state |y), we% of systems in state |22), 
..+; Wn% of systems in state |w,,), respectively, with 


a =1. (1.5.6) 
t=1 


A collection of such ensembles is called a mixture. The expectation values 
of an observable B in these different states are (W1|Blv1), (W2|Bluv2), ..., 
(Un |BlUn) and hence the average over all the systems produced is given by 


(B) => wi (bi | Bhi). (1.5.7) 


i=l 


In this case the statistical operator associated with the mixture is given 
by 


p= So wi i (Wal (1.5.8) 


i=1 
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and it is easily checked that the latter may not be rewritten in the form 
|) (| for any state |w). In terms of p, (1.5.7) reduces to 


(By-= Te |pB).. (1.5.9) 
In particular, 
Teg) =e pa: (1.5.10) 
t=1 


Unlike the pure ensemble case in (1.5.5), for a mixture we have 


Tr [p?| = So? < a (1.5.11) 
i=1 i=1 


that is, 
Tr |p| <1 (1.5.12) 
since for at least two of the w1, we, ..., Wn non-zero, say, w; and w;, w? < w; 
(and wi <wj). 
In an extreme case when all the states |v1), ..., |tn) occur equally likely, 


w; = 1/n and we have, what is called, a completely random mixture. 


KK 


Fig. 1.11. A machine which because of its very nature (such as malfunc- 
tioning or for any other reason) produces systems in a statistically fluctu- 
ating manner in states |®1), |®2), ..., |®,), described by an overall state 


n 
|°) = S- a; |®;) with statistically fluctuating coefficients (a1,..., Qn). 
i=1 


The density operator in the form given in (1.5.8) also holds if we have 
a machine which because of its very nature (such as malfunctioning or for 
any other reason) produces systems in a statistically fluctuating manner in 
states |®;), |®2), ..., |®,) as summarized in Figure 1.11. This situation and 
the one spelled out in Figure 1.10 may be described by the same underlying 
theory. To this end we may define a state 
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|b) = S> a; |B) (1.5.13) 
i=1 


where because of the experimental set up as given in Figure 1.10 or because 
of the fluctuating nature of a machine (Figure 1.11) or both, the coefficients 
(Q1,...,Q@n) = @ may statistically fluctuate. Since the coefficients a; are, in 
general, complex numbers, we may write 


a; = 1; e° (1.5.14) 


where 7;, 6; are real. 
For a fixed value of a, the expectation value of an observable B in the 
state (1.5.13) is given by 


n 


(B)| = So rinje*4 (6; |BIB,) (1.5.15) 
ee Ga 
where 
Vij = 91 — 4; (1.5.16) 
and 2 
Sg (1.5.17) 
t=1 
Let 


Fiti, sey Tn, V125 Y135 ++ +5 Y235--- »Yn—1,n) (1.5.18) 


denote the probability density describing the statistical distribution of ri, ..., 
Tny Y12; +++; Yn—1,n, Whose explicit knowledge is not essential. To obtain the 
expectation value of the observable B over all fluctuations of the coefficients 
7,7; exp(—ijzj) in (1.5.15), one has to average the latter over the density in 
(1.5.18), subject to the constraint (1.5.17). Suppose w;; denotes the latter 
average. Accordingly, the overall expectation of the observable B is given by 


(B) = S> wiz (®;|BI®)). (1.5.19) 
i,j=l 
We note, in particular, that 


That is, w;; denotes the matrix elements of a Hermitian n x n matrix w. 
From elementary matrix algebra, we know that we may then find a unitary 
matrix U, ie., U' = U~! such that 


UtwU = diag [wi,..., wn] (1.5.21) 


where, from (1.5.17), w; > 0 and satisfy (1.5.10). Accordingly, let 
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UZO; = vj (1.5.22) 

where 
USU ky = bik (1.5.23) 
U;, mig Uj = Wmomk (1.5.24) 


and a summation over repeated indices in (1.5.22)—(1.5.24) is understood. 
Hence 


n 
B) = Sw; (Wi |Blyi) (1.5.25) 
i=1 
coinciding with the formula in (1.5.7). 

For a completely randomized system where the probability density f in 
(1.5.18) has a constant value over its domain of definition, the unitary op- 
erator U above becomes the identity operator with w;; = 1/n (no sum over 
i here). To see this, note that for the phase averages (¢ < 7) in this case we 


have 
1 Qn 


ee dyig eo 4 = 6:4; (1.5.26) 


and by symmetry or by explicit calculation (see Problem 1.4) 


(GS =ar = (1.5.27) 


under the constraint in (1.5.17). Accordingly, 


= B=" (wh; | B|bi) (1.5.28) 
i=l 


for a completely random mixture. 


1.6 Polarization of Light: An Interlude 


An illustration of most of the developments provided so far is readily given 
by examining polarization aspects of light. The relative simplicity of dealing, 
at this stage, with a photon, as opposed to, say, a spin 1/2 particle, is that the 
vector character of light, being of spin 1, allows one readily to decompose the 
polarization states along different directions by using the same elementary 
geometry as one uses in decomposing the three dimensional position vector of 
a particle along different directions in Euclidean space. The situation dealing 
with the decomposition of spin 1/2 states along different axes is different and 
will be dealt with later. One should be careful, however, that for a photon, 
like any other massless particle, the direction of polarization is perpendicular 
to the direction of its propagation. 
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It is convenient to denote the polarization state of light polarized along a 
unit vector making an angle ¢ with the z-axis simply by |¢). An x-polarized 
state may be then written as |0) and a y-polarized state as |7/2). 

From the vector nature of light we may decompose |¢) as (see Figure 1.12) 


|p) = |0) cos d + |7/2) sin ¢. (1.6.1) 
The states |0), |7/2) may be represented by the column vectors: 
1 
|0) = o) ((0] = (100) ) (1.6.2) 
0 
|r /2) = ; ( (r/2| = (010)) (1.6.3) 
and hence 
cos @ 
|\o) = | sing | , ( (d| = (cos¢ sing 0) ). (1.6.4) 


Fig. 1.12. Light polarized along a unit vector in the xy-plane making an 
angle @ with the x-axis on its way to an x-polarizer. 


The selective measurement symbols may be then explicitly represented as 
follows: 


100 
A(O) = [000 (1.6.5) 
000 
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000 
A(n/2) = | 010 (1.6.6) 
000 
cos?@ singdcos¢ 0 
A(¢) = |sindcosd sin? d 0]. (1.6.7) 
0 0 0 


In particular for successive selective measurements, 


cos? ¢ sind cos ¢ 0 


A(O)A(d) = {0 0 0 (1.6.8) 
0 0 0 
cos?¢ 00 
A(¢)A(0) = | sindcos¢ 00 (1.6.9) 
0 00 


and for 0< ¢< 7/2 
A(O)A(O)  A(P)A(O) (1.6.10) 
establishing the non-commutativity of measurements of polarization along 


the two different orientations. 
To make contact with formula (1.1.5), (1.6.8) may be written as 


A(0)A(4) = |0)(4| cos (1.6.11) 
with 

(0| ¢) = cos ¢. (1.6.12) 
The latter is also directly obtained from (1.6.1). By referring to Figure 1.12 
we may infer that the probability that a photon goes through the x-polarizer 


1S 


|(0|.4)° = cos? (1.6.13) 


which is the famous Malus formula. 
The density operator corresponding to the state |) in Figure 1.12 is 


cos? ¢@ sindcos¢0 
p= |¢)(o|=|sindcosd sin? d 0 (1.6.14) 
0 0 0 


as given in (1.6.7). The probability that a photon goes through the x-polarizer 
shown in Figure 1.12 is equivalently given by 


Tr [A(0)p] = cos? 4. (1.6.15) 


For an initial mixture, the density operator p may be written as 
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p = wi A(0) + w2A(z/2) (1.6.16) 


for light propagating along the z-axis. For such a mixture going to a U- 
polarizer, that is a polarizer with polarization axis making an angle J with 
the x-axis, the probability of transmission is given by 


Tr [A(@)p] = wi cos? 9 + wy sin? 0 (1.6.17) 


as is easily worked out. 
For unpolarized light, that is light with a complete random polarization, 
W = W2 = 1/2, (1.6.17) gives 


Tr [A(Y) punpai] = ; (1.6.18) 


independently of the orientation of the polarization axis, specified by the 


angle J, of the polarizer. 
x 
A 
y 
Ye 
| 


(b) 


Fig. 1.13. (a) An 2z-polarizer followed by a y-polarizer corresponding to 
successive selective measurement symbols. No light may go through the com- 
bined polarization system. (b) A J-polarizer is inserted between the x- and 
y-polarizers in (a). The corresponding successive selective measurements is, 
for 0 < 0 < 2/2, for example, not the O-operation. Depending on the ini- 
tial state for light, a photon may go through the arrangement with the three 
polarizers. 


Finally we consider the successive selective measurements provided by the 
polarizers in Figure 1.13. For the successive measurements corresponding to 
Figure 1.13 (a) 

A(a/2)A(0) =O (1.6.19) 


and no photon may go through the two polarizers. On the other hand if we 
insert a J-polarizer between these two polarizers as shown in Figure 1.13 (b), 
we obtain 
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0 00 
A(m/2)A(V)A(O) = | sindcosv 00] 40 (1.6.20) 
0 00 


with 0 < J < 7/2, for example, and a photon may go through the new 
arrangement with the three polarizers. 

What is remarkable about the arrangement in Figure 1.13 (b) is that 
although the «-polarizer makes sure to eliminate light polarized along the 
y-axis to go through it, the insertion of the J-polarizer, depending on the 
initial state, may allow y-polarized light to be finally transmitted through the 
y-polarizer. The naive impression that the insertion of a polarizer between 
two polarizers will reduce the final intensity of light transmitted through the 
system is not necessarily true. To this end, suppose that light is initially 
polarized along an axis making an angle ¢, with the x-axis given in (1.6.4). 
Consider, using in the process (1.6.20), the successive selective measurements 


A(m/2)A(0)A(0)A(@) = |7/2) (| sin ¥ cos V cos ¢. (1.6.21) 


Thus the insertion of the v-polarizer, as in Figure 1.13 (b), increases the 
intensity of light transmitted from 0 to 100 x (sin? J cos? J cos? 6) %. 

The arrangements given in Figure 1.13 together with the quantitative 
details given above provide an illustration of the theory developed with in- 
compatible observables at the end of §1.1. The system of the three polarizers 
in Figure 1.13 (b) is represented by the matrix 


0 00 
M = A(n/2)A(V)A(O) = | sin’ cos 0 00 (1.6.22) 
0 00 


in the notation of §1.3, §1.4, as introduced in (1.6.20). 

Although aspects of the polarization of light are easily treated, as shown 
above, and many authors use light to illustrate concepts in quantum mechan- 
ics, a detailed quantum description of light is far from straightforward. 
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In quantum physics, one generally deals not only with finite dimensional 
inner-product spaces but also with infinite dimensional ones. The Hilbert 
space concept is such a generalization which applies to both cases. In view 
of applications, we consider only separable Hilbert spaces, the property of 
which is spelled out in point (iii) below. 


Definition of a Hilbert Space 


A set of vectors is called a Hilbert space, denoted by H, if 
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(i) |f1), |.f2) are in H, then so is ay |f1) +@2|f2) for all complex numbers ay, 
Q. For the 0 vector in H, 0+ |f) = |f) for all |f) in H. This provides 
just the definition of a vector space. 

(ii) It is equipped with an inner product (f; | fo) for all |f1), |f2) in H, which 
is in general a complex number such that 


(fil fa + fa) = (fil fo) + (Al fs) 
(afi | f2) = o* (fil fe) 
(fil fo)” = (fal ft) 
(f\f) 20 


and (f|f) = 0 if and only if |f) is the zero vector. Property (ii) provides 
the definition of an inner product space. The norm of a vector |f), denoted 


by |||, is defined by 
fll = v«F IP). 


(iii) In H there exists a sequence of orthonormal vectors {| fi) ,|fo),..-}, 
called a basis, ie., (fi| fj) = 6i;, and for any f in 1H, we may find con- 


stants ol) such that 


N 
ras —0 as N—-oo. 


k=1 


Property (iii) constitutes of what is called its separability condition of H. 
(iv) A sequence of vectors {| f),|f?),...} in 1 is called a Cauchy sequence, 
if given any € > 0, we may find a positive integer N such that 


Fo = f™|| <e 


whenever n > N and m > N. Then every Cauchy sequence in H con- 
verges to a vector |f) in H. That is, 


jee" 8 


for n — co. This is called the completeness property of 1. 


Immediate consequences of the above definitions are the following inequal- 
ities. For any |f), |g) in H 


(flo) < Ifll Ilgll (1.7.1) 


referred to as the Cauchy-Schwarz inequality (see also (1.4.52)), 


If + all < Ifll + Ila (1.7.2) 
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referred to as the triangular inequality (see also (1.4.53)). Finally let 
{|f1),|fo),--.} be an orthonormal (ie., (fi | fj) = 6;;) basis in H. For any 
|f) in H define 


fo) = So fe) el PD- (1.7.3) 
k=1 


Then 
FO] < IF (1.7.4) 


and is referred to as Bessel’s inequality. 

The proofs of these inequalities are elementary and follow. If at least one 
of |f), |g) is the zero vector then (1.7.1) is obvious. If two non-zero vectors 
|f), |g) are orthogonal, that is (f|g) = 0, then (1.7.1) is again obvious. If 


(flg) 40, define 


_— al(al 
=a (Fla) (1.7.5) 

where qa is a real number. Then 
O< If +agll? = lf? + 20 |(g] f)| + 07 |Igll? (1.7.6) 


which upon minimizing over a gives (1.7.1). 
For the triangular inequality, we have 


If + ll? = IIfI? + 2Re(f]g) + Ilgll?- (1.7.7) 


But by making use of (1.7.1), 


Re(flg) <M(F191 < IlFll lal (1.7.8) 


that is 
If +gll? < (Fil + Ilgll) (1.7.9) 


which is equivalent to (1.7.2). 
For Bessel’s inequality we note that 


(ff 


fr) =0. (1.7.10) 


Hence 


lh 


AP = p= 00+ OP = FFP EOP 


> FOI? (1.7.11) 


giving (1.7.4). 

A typical example of a Hilbert space is the set, denoted by ¢?(0o), of all 
vectors |a) = (a1,@2,...), involving components as complex numbers such 
that 
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llall? = S— Jax? < 00 (1.7.12) 
k=1 
with the addition law defined by 
ala) + 8 |b) = (aay + Bb1, waz + Bbo,...) (1.7.13) 


and the inner-product given by 
(ab) = So axbe. (1.7.14) 
k=1 


[The proof of the separability and completeness of this space is relegated to 
Problem 1.7.] 

A particularly important example of a Hilbert space is the space of square- 
integrable functions, denoted by L?(R*), associated with a particle of spin 0. 
For any f(x), g(x) in L?(R%), x in R°, 


Coe Jo f°) g(x) (1.7.15) 


\|f |? = [ex | f(x)|? < 00. (716) 

For a (massive) particle of spin s, having (2s + 1) components along a 
given direction, we may introduce the space L?(R°,C?**1) (where C stands 
for complex), and define 


a=) [ox f°(x,0) 9(x,0) (1.7.17) 


where the sum is over all o = —s,—s+1,...,s. 
For n distinguishable spin 0 particles, for example, one may introduce the 
space L?(R°”) with inner product 


(f\lg) = [ix ax f* (&1,---,Xn) 9(X1,---,Xn)- (1.7.18) 


Of particular interest is the case dealing with n indistinguishable particles 
of spin s, such as n electrons (s = 1/2), n neutral pions (s = 0), etc. In such 
cases we have to invoke the spin and statistics connection and restrict to 
subclasses of square-integrable functions w(x1,01;...;Xn,@n) with definite 
symmetries as defined below for the interchange of any two particles: 


Das TRO sa OST ks) teh Se ee ty a) (1.7.19) 


for half-odd integer spin s, with such particles referred to as fermions, and 
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Dla PRO 8-5 RO ee) = AWE oS ROG aj Ki G 2s) (1.7.20) 


for integer spin s, and such particles are referred to as bosons. 

The state of a system is described by a unit vector |q) in some Hilbert 
space H. Since, as we have already seen in $1.2, §1.3, §1.5 and we have ample 
opportunity to see this further later on, physical quantities (such as proba- 
bilities, expectation values, ...), to be compared with experiments, involve 
the combination |7) (|, and the states |q) and |w) e!®, defined up to arbi- 
trary phase factors, are equivalent. The set { |) elf} also denoted by |w) 
for convenience, with ¢ varying over all real numbers, is called a unit ray. 
Accordingly, the states of a physical system are represented by unit rays. 

Given a Hilbert space H, not every unit ray |) in H, however, is necessar- 
ily a physically realizable state. From the very definition of a Hilbert space, 
for example, a linear combination of two states |W1) and |w2) with different 
charges @1, Qe is also in 1. But such a vector is not physically realizable. 
Similarly the superposition of two vectors |), |w2) with integer and half- 
odd integer angular momentum states, respectively, resulting a vector |w), is 
not physically realizable, if the principle of rotational invariance is invoked. 
The reason is that under a rotation by an angle 27, for example, about the 
quantization z-axis, as we shall see later, |¢1) and |w2) transform in different 
ways and the projection operator |w)(w| does not remain invariant. In detail, 
if 


|b) = a |) + a2 |b2) (1.7.21) 

then under the above specified rotation with 
Ib) (bl — |") bl (1.7.22) 
|b") = ar \1) — 2 [2 - (1.7.23) 


Accordingly, not every unit ray in a Hilbert space is necessarily physi- 
cally realizable and any rule which singles out such rays is referred to as a 
superselection rule. 

Any given vector |f) in a Hilbert space H may be also defined as an anti- 
linear functional, through the inner product as follows. For |g) a vector in H, 
and with an inner product (g| f), one may consider |f) as a functional in |g), 
with the property 


(ard + ar2|f) = a7 il f) + 05 (He | f) (1.7.24) 


for any two complex numbers a1, a2. The “linearity” condition in (1.7.24) is 
implicit. “Anti” in anti-linear refers to the fact that the coefficients a1, a2 
in (1.7.24) are complex conjugated which follows from the definition of the 
inner product (g| f). 

The above definition is useful in studying properties of observables with 
corresponding operators having a continuous spectrum. Such operators are 
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studied in §1.8. But for the moment, consider, for example, the position 
operator in one dimension. This may be written as 


X= [ave |x) (a| (1.7.25) 


with the sharp selective measurement symbol A(x) = |2)(x| satisfying (com- 
pare with (1.1.2), (1.1.3)) 


A(a’)A(a) = A(a) (x’ | 2) (1.7.26) 
(x’ |x) = 6(2’ — x) (1.7.27) 


with the latter denoting the Dirac delta. The identity operator, which accepts 
and transmits all particles crossing anywhere the z-axis with no discrimina- 
tion, is given by the expression 


1= fiw |a) (a . (1.7.28) 


—co 


If the system is initially prepared to be in a state |), then one may write 


jw) <1) = fae |e) ely) (1.7.29) 
and the probability density that a particle crosses the z-axis at point 2 is 


Tr (A(x) |b) WI] = vl 2)I (1.7.30) 


Although |) is obviously not a vector in the underlying Hilbert space H, 
it is nevertheless rigorously defined as an anti-linear functional on vectors |~) 
in H through 


(ard + agv2| 2) = aj (1 |x) + a9 (2|2). (1.7.31) 


The set of all anti-linear functionals consists then not only of vectors 
|f), as discussed above, but also generalized ket vectors (such as |x)) and 
is obviously larger than the Hilbert space H itself. The triplet consisting of 
H, the set of all anti-linear functionals and the domain, depending on the 
problem in hand, consisting of those vectors in 7 on which the anti-linear 
functionals are defined, is referred to as a Rigged Hilbert space. 

It is interesting to dwell further on the anti-linear functional |x) as a 
functional of functions belonging to L?(R*). Given some function 7(-) in 
L?(R3), the anti-linear functional |x) may be represented by 63(x — x’), ina 
space larger than L*(IR*), as defined through 


(|x) = (W(-) | O° — -)) 
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= [ox w(x’) O°(x — x’) = o* (x). (1.7.32) 


A sharp momentum state |p) of a particle, as we shall see later, may be 
written as 


|p) = [ox eiPx/F |x) (1.7.33) 


where fi is the Planck constant h divided by 27. As an anti-linear functional 
on functions 7)(-) in L?(IR°), |p) may be represented by e!P**/” in, obviously, 
a larger space than L?(R*), due to its lack of square-integrability, defined 
through 


(lp) = (X) 


elP-(/n) 


= [ex w* (x) ep x/h (1.7.34) 


(a Fourier transform). Obviously, the domain of |p) consists of functions ¢(x) 
in L?(R°) for which the integral in (1.7.34) exists. 
Finally consider the anti-linear functional 


|) = [ox |x) &(x) (1.7.35) 


where ®(x) is not a square-integrable function, and hence necessarily belongs 
to a space larger than L?(R°). Its domain consists of those functions (x) in 
L?(R°) such that 


(w|®) = Jo w* (x) ®(x) (1.7.36) 


exists. 

The moral of the Rigged Hilbert space formalism is that physics dictates, 
in general, to introduce in addition to a Hilbert space H, a space, say, D*, 
larger than #1 of anti-linear functionals, including generalized ket vectors, 
with a domain of definition for the anti-linear functionals, say, D, contained in 
H. The Rigged Hilbert space is then written as the triplet DC H C D*. The 
Rigged Hilbert formalism has clarified some of the ambiguous manipulations 
carried out in earlier days of quantum physics. 


1.8 Self-Adjoint Operators and Their Spectra 


An observable B which may take on a finite number of real discrete values, 
{b, b’,...}, is represented (see §1.3) by a Hermitian operator 


B=YS_bj|b)(0| (1.8.1) 
b 
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(using the same notation for the latter as the observable it represents) in 
a finite dimensional inner-product space generated by the eigenvectors |b). 
That is, any vector |f) in the generated inner-product space may be written 


as (see (1.4.2), (1.4.16)) 
= 5° |b) f(b) (1.8.2) 
b 


where 

f(b) = (| f) (1.8.3) 
are complex numbers. The operator B maps any vector |f), in the generated 
inner product space, into another vector 


B|f) = S° 2) 9(b) =|9) (1.8.4) 
b 


where 
g(b) = bf (b) (1.8.5) 


in the same inner-product space. In particular, the Hermiticity of B implies 
that 


(f2|Bl fi) = 2580) 


= (Bfa| fi) (1.8.6) 


for any two vectors in the generated inner-product space. 
The selective-measurement symbol 


A(b) = |b) (b| (1.8.7) 


A(b)A(b) = A(b) (1.8.8) 


(see (1.1.2)). One may also define the following projection operator 


= 5° A() (1.8.9) 


b<b! 


by summing over the projection operators over all the eigenvalues of B, from 
the lowest, up to any given eigenvalue 0’ as indicated in the summation sign in 
(1.8.9). Properties of the projection operators in (1.8.9) are easily established 
and will be spelled out for the more general cases to be discussed below. 

In the general case when an observable, say, A may take on infinite num- 
ber of real discrete values and/or continuous real values, one is faced with 
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convergence problems and one carries out physical computations, more gen- 
erally, in a Hilbert space as defined in §1.7. The object of physical interest 
representing an observable, such as A, is a self-adjoint operator, as defined 
below, and coincides with the definition above, for a Hermitian operator, in 
the simpler case discussed above with a finite number of real discrete values, 
operating in a finite dimensional inner-product space. Hermitian operators 
may, however, be defined in the general case as well, but in quantum physics 
it is the more general concept of a self-adjoint operator that is relevant as 
representing a given observable. We do not wish to get too technical about 
the distinction between these two types of operators and we provide the bare 
minimum in this respect. 

An operator A maps, in general, a vector |f) in the underlying Hilbert 
space H into some other vector A|f) in H. The totality of all vectors {| f)} 
in H such that for each |f) in this set, A|f) is also in H, i-e., ||Af|| < 00, 
is called the domain of A. We are interested in operators with domains rich 
enough such that, as a generalization of (1.8.2), any vector in H may be 
well approximated by a vector in the domain of the operator in question. 
[Technically, this means that for any vector |h) in H, and any given « > 0, 
we may find a vector |h-) in the domain of A such that ||h — he|| < e.| For 
such an operator, the adjoint A’ may be defined through 


(g Alf) = (A'g | f) (1.8.10) 


for all |f) in the domain of A. 
If the domains of A and At coincide and 


(9 Alf) = (Aglf) (1.8.11) 


(see (1.8.6) for the special case) then A is said to be self-adjoint and one 
writes At = A. [In the case (1.8.11) holds for all |f) and |g) in the domain of 
A, but the domains of A and A! do not coincide, A is said to be Hermitian.| 

The remaining part of this section deals with the spectra of self-adjoint 
operators and the meaning of the eigenvalue problem. This topic is of central 
importance in quantum physics. We restrict, however, the presentation to 
those aspects which are only relevant to the rest of the volume. 

Given a self-adjoint operator A we may write 


A= faa d6(A — A) (1.8.12) 


which has obviously a meaning whenever a measurement of the observable 
with which the operator A is associated takes a given real value, say, Xo. 


Define 
d 


Pa(A) = @(A— A) = / dn 5(N — A) (1.8.13) 
(see (1.8.9) for comparison), where O(A) is the step function, O(A) = 1 for 
> 0 and O(A) = 0 otherwise. Then formally 
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d P(A) = 6(\ — A) dA. (1.8.14) 


We may then write from (1.8.12) 
A =| AdPa(A). (1.8.15) 


The latter is called the spectral decomposition of A. 
The Pa(A) are projection operators whose properties are readily estab- 
lished from (1.8.13). They are: 


(i) Since a step-function is bounded by 1, 
|PaQ) fl] < Ill (1.8.16) 


for all |f) in the underlying Hilbert space H, i.e., (1.8.16) is defined for 
all vectors | f) in H. In particular, P4(A) is self-adjoint. 


P4(—o0) = 0. (1.8.17) 


Pa(+oo) =1= [ara (1.8.18) 


giving the identity operator (compare with (1.1.4), see also (1.8.1), 
(1.8.9)). The equality on the right-hand side of (1.8.18) is called the res- 
olution of the identity operator. 


(iv) 
(f|PaQa)If) < (fF |PaQa) If) (1.8.19) 
for Ay < Ao. 
(v) 
POD) PiOoy = Palo) (1.8.20) 
where 
Xo = min(A1, 2). (1.8.21) 


(vi) From (1.8.18), 


wiP=(L) =f adsiPaa@in= falPaoysir (1.8.22) 


where in writing the last equality we have used (1.8.20). 


To study the spectrum of A and hence infer about the possible values 
that may be obtained by a measurement of the observable with which A is 
associated, we consider the operator 


Ao+e 
[Pa(\o + €) — Pa(Ao — €)] = dPa(A): (1.8.23) 


Gs. 
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Suppose Xo is an isolated point of the spectrum of A. That is, for some 
€ > 0, A takes no values in (Ag —€, Ap + €) except the value Xo. The integrand 
in (1.8.23) then makes a finite “jump” from zero when A takes on this value 
and the subspace 

[Pa(Ao + €) — Pa(Ao — €) JH (1.8.24) 


including the case with the limit « — +0, is not empty. Hence for ¢« — +0, 
the latter contains at least one vector, say, |f), and 


[Pa(Ao + 0) — Pa(Ao — 9)] |f) =f) - (1.8.25) 


Accordingly, from (1.8.15), property (iii) in (1.8.18) and property (iv) in 
(1.8.22) 


||(A — do) fll” = fe — do)? dl] Pa(d) fl]? 


do+0 ‘ 
=| (A — Xo)? d||Pa(A) f ||" = 0 (1.8.26) 
do—0 

where we have also used (1.8.23), (1.8.25). Hence (A — Xo) |f) is the zero 


vector, 1.e., 


A|f) = olf) (1.8.27) 


which is the familiar eigenvalue equation for a (discrete) eigenvalue Xo. This 
occurs whenever the integrand in (1.8.23) makes a “jump” from zero when 
A takes on the value 4 = Ao and the point Ao is isolated in (Ao — €, Ao + €) 
with no other values occurring in this interval for some ¢ > 0, and (1.8.24) is 
non-empty for « > +0. 

On the other hand, suppose that for some value Xo, that the observable 
in question may take, and for all ¢ > 0, no matter how small, one may find 
a point Ay # Xo in (Ap — €, Ao + €) in the spectrum of A, and 


[Pa(ro + €) — Pa(Ao — €)] (1.8.28) 
becomes the zero operator for ¢ — +0. For such a point 
[Pa(\o + 0) — Pa(Xo — 0)]H (1.8.29) 


is empty and hence contains no non-zero vectors. As we shall see, this refers to 
the fact that there are no eigenvectors in H corresponding to the continuous 
spectrum of A. 

For ¢ > 0, however, any non-zero vector in the subspace 


[Pa(ro +e) — Pa(Ao — ©) | (1.8.30) 


of H for which (1.8.23) is not the zero operator, necessarily depends on e. 
Let |f(e)) be a non-zero in this non-empty subspace of H. That is, 


44 1 Fundamentals 


[Pa(Ao + ©) - Pa(do —)] Lf) = IF). (1.8.31) 


For ¢ > 0, we may consider |f(<)) to be normalized, i-e., || f(e)|| = 1. Hence 


Aote 
2 2 
(4 Ao FOP = fA ro}PaliPa (1.8.32) 
is: 
where we have used (1.8.31), (1.8.23). 
The right-hand side of (1.8.32) is bounded above by 


AotE 2 
2 | dj[Pa(d) Fe) 


ms 


= ¢? [aleao) f(e||" =< (1.8.33) 


where we have used (1.8.22), and the normalizability of f(¢) for ¢ > 0. 
That is, for any « > 0, for which (1.8.30) is not empty (and is otherwise 
empty for « — +0, we may find a vector |f(e)), depending on ¢, in H such 
that 
||(4 — Ao) F(e)|| < e (1.8.34) 


and (A— Ao) f(e) can be made closer and closer to the zero vector by making 
€ smaller and smaller. Equation (1.8.34) covers the earlier case in (1.8.27) for 
the eigenvalue equation as well by taking the limit ¢ — +0. 

One encounters this latter case quite often in elementary quantum physics 
courses and a classic example of this is the one dealing with the position 
observable and is given below. 

Try to set up an eigenvalue equation for the position of a particle, via the 
equation 

X fxg (X) = Xo fxg (X). (1.8.35) 


We may rewrite the latter as 

(x — x0) fxo(x) = 0 (1.8.36) 
whose solution, up to a multiplicative constant, is 

fico () = 6°( — Xo). (1.8.37) 


[Note that any function of x multiplied by (1.8.37) is evaluated at x = x9 and 
hence reduces to a constant.| The solution (1.8.37) is obviously not square- 
integrable. What the above analysis, however, shows, and this is in conformity 
with experimental limitations, that is given any € > 0, as small as one wishes, 
one may find a square-integrable function 6-(x, x09), depending on ¢, such that 


1/2 
|| (x — Xo) de(x, xo) || = ( f°x jx — xol?|de0,0)|”) Ke. (1.8.38) 


Such an explicit function is given by 
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6-(x,X0) = Ce = exp | cca a (1.8.39) 


TT 


This is obviously normalized (< > 0), and the left-hand side of (1.8.38) is 
equal to ¢/2 <e. 

A similar analysis may be given for the momentum operator of a particle 
(see Problem 1.8). 

The above analysis leads to the definition of the spectrum of a self-adjoint 
operator A as consisting of all (real) Ao such that 


[Pa(\o +€) — Pa(Ao — €)] #0 (1.8.40) 


for all c > 0. 
For the eigenvalue equation, (A— Ao) annihilates an eigenvector, i.e., some 
vector in H, hence the inverse operator (A — A9)~! does not exist in all H. 
On the other hand for ¢ > 0 for which (1.8.40) is true, that is the subspace 
in (1.8.30) is not empty, and, is otherwise empty for « > + 0, 


(A — Ao) IF (€)) = lg(€)) (1.8.41) 


as well as |f(e)), are non-zero vectors in H. In such cases (A — \o)~! may 
be defined in H since there are no vectors |f) in H that make (A — Ao) |f) 
zero. Using the normalizability condition for |f(e)), we have from (1.8.34), 
(1.8.41) 

l|g(e)|| < € = e||(A — Ao)" g(e)|| (1.8.42) 


which leads to 
4-0) 1 


0) 


The latter says that although the inverse (A — A9)~' may be defined in such 
cases it is an unbounded operator as the right-hand side of (1.8.43) may be 
made larger and larger as € is chosen smaller and smaller. 

Consider any two distinct points 41 and 2 in the spectrum of A. For any 
€, > 0, €2 > 0, and any normalized vectors |fi(€1)), |fo(€2)) such that (see 
(1.8.34)) 


(1.8.43) 


||(A = Ar) far) || < €1 (1.8.44) 
||(A — Az) fa(€2)|| < €2 (1.8.45) 
the orthogonality relation 


lim lim, (fi (1) | falé2)) = a ii (fi(€1) | fo(e2)) = 0 (1.8.46) 


€1—0 £5. 


easily follows. To this end, 


(Ai — Az) (fi (1) | fo(€2)) = (fr (Er) (A — A2) I fo(€2)) 
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— (filer) (A — Ax) | fa(E2)) - (1.8.47) 
Hence from the Cauchy-Schwarz inequality (1.7.1), (1.8.44) and (1.8.45), 


| filer) | falen))| < SAF) (1.8.48) 
|A1 — Aa 

leading to the result in (1.8.46) for Ay 4 Az. [Note that (1.8.48) is an inequality 
and its left-hand side may indeed vanish for some €1, €2.] 

The results in (1.8.46) establish the familiar statement of orthogonality 
not only of two eigenvectors, with eigenvalue equations defined as in (1.8.27), 
but also in a limiting sense of the orthogonality of two eigenfunctions with 1 
and Ag in the continuous spectrum and also the orthogonality relation with 
Ai, an eigenvalue (i.e., belonging to the discrete spectrum) and \2 belonging 
to the continuous spectrum in a unified manner. In particular, for the very 
last situations with , in the discrete spectrum and Az in the continuous one 
we may write 


A|fi) = Ar |f1) (1.8.49) 
||(A — Az) fa(e2) |] < €2 (1.8.50) 

and 
Jim, (fi| fa(e2)) = 0 (1.8.51) 


whose importance cannot be overemphasized. 

Since the inverse (A — £)~1, where now €, in general, is taken to be some 
complex number is important in studying the spectrum of A, we consider 
further some of its properties. (A — €)~' is called the resolvent of A. To this 
end, we may formally write 


hn fr 
(A=¢) =f aoy-A 


=f means (1.8.52) 


where in the last equality we have used (1.8.14). 
If, as we will shortly investigate, (1.8.52) exists, we have 


4-9 =f age alPaaysh 889 


For Im € ¥ 0, this integral is obviously meaningful since |\—€|? > | Im €|? > 0, 


(A - 6-1? < sage | allPa Fl = jo (1.8.54) 
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for all |f). 
If for € real and some ¢ > 0 


[Pa(€ +) — Pa(é—«)] =0 (1.8.55) 


then 


Wa-erts = ft aeons? 


+f ree aaa’ < 


and (A — €)~! exists. All complex € (i.e., Im 4 0), and all real € such that 
(1.8.55) is true, constitute of what is called the resolvent set of A. 

Finally, for € = Xo real, with Xp an isolated point of the spectrum of A, 
i.e., in particular, 


gle 


(1.8.56) 


[Pa(Ao +0) — Pa(Ao — 0)] |f) #0 (1.8.57) 


obviously (1.8.53) does not exist as we already know. On the other hand for 
€ = Xo real such that 


[Pa(\o +€) — Pa(Ao — €)] #0 (1.8.58) 


for all « > 0, and is otherwise the zero operator for « > +0, 


AoE 
1p ||? : 1 Siu 
|a-o-uP> [pe allPaanel? > (1.8.59) 


(1.8.53) exists but (A—€)~? is an unbounded operator as the right-hand side 
of the above inequality may be made larger and larger by making € smaller 
and smaller, which we also already knew. These last two cases corresponding 
to values of € as given through (1.8.57)—(1.8.59) constitute the spectrum of 
A. 

Suppose that for a given self-adjoint operator A, a real number Xp belongs 
to its continuous spectrum or is an eigenvalue of infinite degeneracy, then, by 
definition 

dim ([P4Qo This Pah )|#) = 0 (1.8.60) 


for all « > 0. Because of this property, eigenvalues of infinite degeneracy 
and the continuous spectrum are grouped together and constitute of what is 
called the essential spectrum of a self-adjoint operator. 

On the other hand, eigenvalues of at most finite degeneracy (i.e., which are 
non-degenerate or of finite degree of degeneracy) constitute of what is called 
the discrete spectrum of a self-adjoint operator. In this case, suppose that 
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some real number Ao belongs to the discrete spectrum of a given self-adjoint 
operator A. Then we can always find some 6 > 0 such that 


04 dim ([PaQo A) = Pe 6)]1) < 00 (1.8.61) 
including the limiting case 6 > + 0, ie., 
04 dim ([PaQo 40) PiOg= 0)]7) <0. (1.8.62) 


Two powerful propositions establish results concerning the nature of es- 
sential and discrete spectra of self-adjoint operators. 


Proposition 1.8.1 

For a real number Xo to belong to the essential spectrum of a given self-adjoint 
operator A it is necessary and sufficient that there exists an infinite sequence 
{|fn)} of orthonormal vectors such that 


||(A = Ao) fal] +0 for noo. (1.8.63) 


To establish? this, suppose first that Ag belongs to the essential spectrum 
of A. Then let ¢9 be any positive number such that 


€0 > _ (1.8.64) 


Choose a number A; 4 Xo and a corresponding ¢1 such that 
€1 = |A1 — Aol < €o. (1.8.65) 
Similarly, choose a number Az 4 Ao and a corresponding €2 such that 
|A2 — Ao| = €2 < é1 (1.8.66) 


and so on (see Figure 1.14). 
We have thus generated a sequence {e,,} such that 


£9 > €1 > €Q >... > En >... (1.8.67) 


and (by definition of Xo as belonging to the essential spectrum) correspond 
to infinite dimensional spaces: 


[Pa(o + £0) _ Pa(Xo - eo) |H = [Pa(ro + E1) - Pa(Xo - e1) JH 
Dias 


) [Pa(Ao an re) = PaQro a En)|H 


? The proofs of Propositions 1.8.1 and 1.8.2 may be omitted at a first reading. 
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Ao — £0 A0 — £1 


moe 
op 


Fig. 1.14. The figure shows the process of generating the sequence {én} for 
forming intervals about Xo in the essential spectrum of a self-adjoint operator. 


=r (1.8.68) 


We may, therefore, select an infinite sequence {|f,,)} of orthonormal vec- 
tors such that 


[Pa(\o + €n) — Pa(Ao — én)] |fn) = |fn) - (1.8.69) 
Hence 
2 AGT St 2 
A= do)fal? = f= 20)?aliPaQyfal? <2. (2.8.70) 


Since in the process of construction we may arrange such that ¢2 — 0 for 
n — oo, it follows that 


I|(A - Xo) Fn| —0 for n— oo. (1.8.71) 


Conversely, suppose that there exists an infinite sequence {|f;,)} of ortho- 
normal vectors such that 


||(4 - do) fn| — 0 for n— oo. (1.8.72) 


We then have to show that 9 belongs to the essential spectrum of A. 
To the above end, for any ¢ > 0 
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AoE 
(A= rofal? > f= 20)allPa)sall? 


+f (=o)? PaQFull 
Ao+e 
On the other hand 
AoE 

i (A — do)? dl] Pad) fall? > 2] Palo — 6) fall? 
and 

if (A— Ao)? dl] Pa(d) fall? > e2]|[1 — Paro +] fall? 

ote 


where we have used (1.8.17), (1.8.18), (1.8.20). 
Hence 


||(A = Ao) fall” > €?|| Pao — ©)fall” + €?|| [1 — Paro + £)] fall” 


and from this and (1.8.72) we conclude that 


im, ||PaQo — Stall = 


aim [1 — PaQo +2) full? = 
for any e > 0. These results may be equivalently rewritten as 
‘lim (fn |Pa(do — €)Lfn) = 0 
Jim (fn|Pa(do+ €)lfn) = 1 
or, by combining the two, we may write 
Jim (fn [Pa(Ao + €) — Pa(Ao — €)] |fn) = 1 
for any € > 0. 


Therefore 
[Pa(Ao + €) — Pa(Ao — €)]H 


(1.8.73) 


(1.8.74) 


(1.8.75) 


(1.8.76) 


(1.8.77) 


(1.8.78) 


(1.8.79) 


(1.8.80) 


(1.8.81) 


(1.8.82) 


is not empty and it remains to show that the latter is infinite dimensional. 
Suppose that this is not true. That is, it is a finite dimensional space. Select 


an orthonormal set {g1,..., 9g}, k < co in it. Then 


k 
(fal [Pa(do + 6) — PaQo —2)] [fn) = 32 Mm fa)? < 
m=1 


(1.8.83) 
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for any € > 0 and all n. 
Since {|f,)} constitutes an orthonormal set, Bessel’s inequality (1.7.4) 
reads 


YS" lgm| fe)? < (1.8.84) 

l=1 
and the convergence of this series for n — oo in particular for allm=1,...,k, 
implies that 

lim |{9m| fn)|> = 0 (1.8.85) 
for m = 1,...,&. Due to the finite number of terms in the sum in (1.8.83), 


we may take the limit n — oo inside the summation sign in (1.8.83) to infer 
that 


(fn| [Paro + €) — Palo — €)] |fn) — 0 (1.8.86) 


for n — oo in contradiction with (1.8.81). That is, for any ¢ > 0, the space 
(1.8.82) must be infinite dimensional. This establishes the statement made in 
the proposition. 

Before stating and establishing the next important proposition regard- 
ing now discrete spectra of self-adjoint operators, we need some preliminary 
results. 

To the above end, we first recall the definitions of the infimum and supre- 
mum of a set of real numbers. A real number A is a lower bound of a set 
of real numbers if A is less than or equal to every element of the set. If in 
addition, no lower bound of the set is greater than A, then . is referred to as 
the infimum of the set. Similarly, \ is an upper bound of a set of real numbers 
if \ is greater than or equal to every element of the set. If in addition, no 
upper bound of the set is less than \ then ) is referred to as the supremum 
of the set. 

We define® the following real numbers associated with a self-adjoint op- 
erator A for which a lower bound for its spectrum (§3.2, §3.3) exists: 


a) chee a fn 1) lace jeri fn—1) (olan) eae 
where |f1),...,|fn—1), |W) are normalized vectors, |fi),..-,|fn—1) are not 
necessarily independent, and |w) € [|f1),---,|fn—1)]+ means that (| f;) = 0 
for i=0,...,n—1, ie., the |W) are orthogonal to |f1),...,|fn—1)- 

That is, we define by [|f1) ,...,|fn—1)]+ the space generated by the vectors 
(in the domain of A) orthogonal to |f1),...,|fn—1). 
Since, 
(| f1) ane) Eoare es =) [|f1) Peng beeen , | fn)}> (1.8.88) 


with the set on the left-hand side involving, in general, of more vectors than 
the set on the right-hand side, we may infer that 


3 This treatment follows that of Reed and Simon (1978). 
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inf Alv) < inf A 1.8.89 
weteinoe © ) 16) €[l fa) +s] fn—1) 5] fn) > ‘olAlé) ( ) 


and then considering the operation of taking the supremum over all 
fis--->fn—1, fn we obtain 


An(A) < An41(A). (1.8.90) 


We also need the following results. Let a and b be any two real numbers 
such that 


a@< nA) <b. (1.8.91) 
Then 
(i) 
dim ([Pa(a)]) <n (1.8.92) 
(ii) 
n< dim ([P4(0)]7). (1.8.93) 


To establish (i), suppose that dim ([Pa(a)]7) =k 2>n. That is, there 


exist k > n independent vectors in [Pa(a)]H. Given any k — 1 vectors 
\f1),---5|fe—-1), not necessarily independent then 


(lft) .---s1fe—1)]7 9 [Pa(@)]H (1.8.94) 


is not empty. Let ~ be a (normalized) vector (in the domain of A) belonging 
to the latter space. Then 


(laly) =f raljPa@yul? <a (1.8.95) 


ie., A,(A) < a. Since by hypothesis & > n, (1.8.90) implies that A,,(A) < a 
in contradiction with the fact that a < A,(A). This establishes (i) in (1.8.92). 

To establish (ii), suppose that dim ([Pa(0)] 1) = k < n-1, and 
that |gi),..-,|gx) are & independent vectors that generate this space. Let 
|b) be any (normalized) vector (in the domain of A) such that |) € 
[lg1) »---+|9e)}> = [1 — Pa()|H. Then 


(wll = [ralPavnell? > (1.8.96) 


ie., Ax4i(A) > b, and hence 4,,(A) > b, since by hypothesis n > k + 1. This 
contradicts the fact that ,,(A) < 6 and establishes (ii) in (1.8.93). 
Now we are ready to state and establish the following proposition. 
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Proposition 1.8.2 

Either there are n eigenvalues (counting degeneracy) \,(A) < ... < An(A) 
below the bottom of the essential spectrum of A, where An(A) is the n*” eigen- 
value, or 


\n(A) = inf {A, with X belonging to the essential spectrum} 


with Xn (A) = An41(A) = ..., and there may be at most (n — 1) eigenvalues 
(counting degeneracy) below (A). 


To establish the above statements, we first note that for all « > 0, we 
have shown earlier in (1.8.92), (1.8.93) that 


dim ([PaQn = <)]) <n (1.8.97) 


n< dim ([PaQn 4 |) (1.8.98) 


respectively, where we have simply written 2,, for A,,(A). 
We consider the two possibilities: 
(i) 
dim ([Pa(an +£0)]#) < 00 (1.8.99) 
for some €9 > 0, or the possibility that 
(ii) 
dim ([Pa(n + £0)] 74) = 00 (1.8.10) 
for all eo > 0. 
For the case (i), (1.8.97)—(1.8.99), imply that 


1< dim ([PaQn te Se Pays 0)|7) < 00 (1.8.101) 


for some €9 > 0. That is, ,, is an eigenvalue (of at most finite degeneracy). 
Hence (1.8.98), (1.8.101) imply that we may find a do > 0 such that 


n< dim ([PaQn)]#) = dim ([PaQn a: 0)]7). (1.8.102) 


This means that there are exactly (n — 1) eigenvalues strictly below »,,. This 
is because if there are, say, n + k eigenvalues less or equal to An, ie, Ay < 


-<,,, < An, then (1.8.97) implies that dim ([PaQnan)]7) Se, 
But N44 < An means that n+ k —1<n-—1, which is true only if k = 0, 
and Xi, = An. We also note from (1.8.97) that dim ([PaQu = )]#) = 0 for 


all ¢ > 0 and hence the spectrum set is empty below A. This settles the first 
part of the proposition. 
For the case (ii), given in (1.8.100), (1.8.97) then necessarily implies that 
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dim ([Pa(n Pies Pass 2)|H) = 00 (1.8.103) 
for all ¢ > 0. Hence Xp, in this case, belongs to the essential spectrum of A. 


Let a be any real number such that a < Ay, — €, i.e., a+¢e < An. Hence from 
(1.8.92), 


dim ([Pa(a+)]H) <n=1, (1.8.104) 

Also 
dim ([Pa(a = e)|#) < dim ([Pa(a = e)]), (1.8.105) 
= dim ([Pa(a+e) — Pa(a—e)]H) <0 (1.8.106) 


for all ¢ > 0. That is, a cannot belong to the essential spectrum and this is for 
all ¢ > 0. This in turn means that X,, is the bottom of the essential spectrum 
as stated in the second part of the proposition. Now suppose An+1 > An, and 
note that 


Xn-+1 os hs An 4 —— An) Vrs t An) (1.8.107) 


which from (1.8.92) and (1.8.93) imply the contradictory statements that 


dim ([Pa(A*)] x) (1.8.108) 


is <n+1 and = o, respectively. That is, we must have An,+41 = An. 
Also for a < An — €, we have from (1.8.104), (1.8.105) 


dim ([Pa(a = <)|H) 2He4 (1.8.109) 


for all ¢ > 0, hence there may be at most (n — 1) eigenvalues below X,,. This 
establishes the second part of the proposition. 

For future developments, we establish an order relationship between 
the eigenvalues of two self-adjacent operators A and B, whose spectra are 
bounded from below, such that for all vectors |7)) in their domains 


(bl Aly) > (p|Bly). (1.8.110) 


From the very definitions of the infimum and supremum of a set of real 
numbers it is not difficult to see, as shown below, that 


N(AY Se (BY. (1.8.11) 


To this end note that if, relative to a given space [|fi),..-,|fn—1)|+ 
(see (1.8.87)), ch, provides the infimum corresponding to the operator B, 
then from (1.8.110), cp also gives a lower bound to (w|Alw) for all |) € 
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(|f1),---|fn—1)]+- Since, by definition, no such a lower bound can be greater 
than the corresponding c’,, we conclude that c, < cy. The result then follows 
by considering the supremum of all such c}, and c’, as we consider all vectors 
louder: 
A special function of a self-adjoint operator A is the unitary operator 
defined by 
U(t) = et (1.8.112) 


depending on a real parameter t, which may be rewritten as 


U(t) = / dd e' 6(\ — A) (1.8.113) 
and from (1.8.14) as 
U(t) = / e dPa(X) (1.8.114) 
providing its spectral decomposition. It satisfies the basic property 
U-1(t) = UT(#), (1.8.115) 
Le., 
Ut(t)U(t) = 1 = U(t)UT(t) (1.8.116) 
and also the group property 
U(t1)U(t2) = U(t, + te). (1.8.117) 


Particular attention will be given later to the study of some basic prop- 
erties of an important self-adjoint operator — the Hamiltonian for various 
physical systems. 


1.9 Wigner’s Theorem on Symmetry Transformations 


Invariance of physical laws under some given transformations lead to con- 
servation laws and the underlying transformations are referred to as symmetry 
transformations. For example, invariance of a physical law under the rotation 
of one’s coordinate system in describing the underlying theory leads to the 
conservation of angular momentum, and invariance under time translation 
(by setting, for example, one’s clocks back by a certain amount) and under 
space translation (by shifting the origin of one’s coordinate system) lead, re- 
spectively, to energy and momentum conservations. Other transformations, 
for example, involve space reflection (also known as parity transformation), 
time reversal, and charge conjugation, where in the latter every particle in 
the physical process under consideration is replaced by its anti-particle. [It is 
remarkable that by combining relativity with quantum mechanics, leading to 
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what is called quantum field theory, one concurs that the simultaneous trans- 
formation of charge conjugation, parity transformation and time reversal is 
a symmetry transformation based on such a merger.| 

Invoking invariance properties in developing a dynamical theory, conve- 
niently, narrows down one’s choices in providing the final stages of the the- 
ory. Not all transformations are, obviously, symmetry transformations of a 
given physical system. But invoking the invariance of a system under such 
transformations may provide the starting point in describing the underlying 
dynamical theory, and then one may consistently modify the interaction in 
the theory to take into account any symmetry breaking. 

A celebrated theorem due to Wigner, in the thirties originating on sym- 
metry due to rotation in space, spells out the nature of the transformations 
implemented on elements of a Hilbert space under symmetry transformations. 

To see how these implemented transformations on elements of a Hilbert 
space occur, we reconsider the general physical question arising in quantum 
physics (§1.2-§1.6). One prepares a system in a state |W). The question then 
arises as to what is the probability of finding the system is a state |) if |) 
is what we initially have? The latter is given by 


Tr [16)(4] lo) WI] = ely? (1.9.1) 


More generally, one may have a mixture described by a density operator 
(§1.5) 
p= Yl wi ibid wil: (1.9.2) 


The probability of finding a system in state |) if p, in (1.9.2), is what we 
initially have is 


r [19)(4| p] = Dwi Kolar (1.9.3) 


That is, in both cases one is confronted with the problem of computing 
\(|w)|? for given vectors |1), |¢). 

If |~’), |¢’) denote the vectors |w), |b) resulting under a symmetry trans- 
formation, then the invariance of the corresponding probabilities may be 
stated by the equality 


eb" | OV? = [eb] @)?. (1.9.4) 


That is, under a symmetry transformation, {|w’) ,|¢’)} give an equivalent 
physical description as {|y) ,|¢)}. 

Each of the vectors |’), |¢’), |W), |¢) may be scaled by arbitrary phase 
factors without changing the physically relevant probabilities given in (1.9.4). 
Accordingly, one needs to consider only unit rays (see §1.7) generated by such 
vectors. Although such overall phase factors are not important, the relative 
phases occurring when adding two or more vectors are physically relevant 
with far reaching consequences. Such details will be dealt with later such as 
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in 81.10, §8.7, §8.8 and Chapter 8, in general. For the time being we note 
that in computing an expression like 


(ard + a2tb2 | 8161 + Bob2)|? (1.9.5) 


overall phase factors of the vectors (aq |w1) + a2 |w2)), (G1 |G1) + G2 |2)), for 
complex numbers aj, a2, (1, 32 are unimportant, and one is dealing with 
rays generated by the two vectors. There are, however, in general important 
relative phase factors which arise, when expanding the expression in (1.9.5), 
in terms of scalar products of |¢1), |we) with |¢1), |¢2). 

To see how rays |w), |W’), as occurring in (1.9.4), are related and hence, 
in the process, spell out Wigner’s Theorem of Symmetry Transformations, 
we provide and re-iterate the following definitions. 


Definition 1. An operator L is called linear, or else anti-linear, if given any 


vectors |), |b), (al) + 81¢)), 
L(a|p) + 816) = aL |) + BL|¢), (1.9.6) 


or else 


L(a|y) + 816)) = oF DL |b) + BL 16), (1.9.7) 


for the corresponding rays generated, respectively, by the vectors L|w), L|d), 
L(a |) + B\¢)). [Note that these equalities do not necessarily hold for the 
vectors themselves but only for the corresponding rays just described.] 


Definition 2. A linear or else anti-linear operator U, as given as given in 
Definition 1, is called unitary or else anti-unitary if 


(Up|U¢) = (14), (1.9.8) 


or else 
(Up|Ud) = WI)", (1.9.9) 
for the corresponding rays generated by vectors |W), |d). 


Wigner’s Theorem: 

Under a symmetry transformation, these exists a unitary, or else, an anti- 
unitary, operator U such that (1.9.6), (1.9.8) or else (1.9.7), (1.9.9), hold 
with 

Iv") =U |) (1.9.10) 
|¢’) =U |¢) (1.9.11) 


(a |) + 8|6))’ = U(alh) + 86) (1.9.12) 
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as applied to rays. The latter means that |’), |’), (@|W) + |d))’ are defined 
up to overall phase factors. 

The proof of the theorem is not difficult but long if one spells out all the 
details. 

To establish the validity of the theorem, we proceed through various pro- 
gressive steps. 


1. Let {|f1),|fo),...} be an orthonormal basis in H. Any vector |) in H 


may written as 


[b) = Do ae |fe) (1.9.13) 
k 
By definition, under a symmetry transformation, 


(fic fed] = (Fe | fe) | = dee, 


i.e., 

(fil fe) = One. (1.9.14) 
Also 

fed’) | = [fe | b)| = level. 

Hence let 

an = (fe |v') (1.9.15) 
then we have 

lax| = |ag|. (1.9.16) 


2. Consider the vector 


Io) = 10") — So ae | Fe) 
k 
with a, defined in (1.9.15). Then 


loll? = Ww"? — SO lax? 
k 


= ||? — >- low|? = 0. 
k 
That is, |¢) is the zero vector, and we may write 
lo") = So ax | fa) (1.9.17) 
k 


with the a, satisfying (1.9.16). Thus | f{), |), ... provide an orthonormal 
basis for expanding the transformed state |W’). 
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3. Consider the three vectors (j 4 k, 7 41, k £1) 


lois) = fr) + If) (1.9.18) 
bin) = | fi) + |Fe) (1.9.19) 
|bige) = |f1) + 15) + fa) - (1.9.20) 
From (1.9.18), (1.9.16), 
wig) = el! | ft) + el |F;) (1.9.21) 
which may be rewritten as 
wis) = 9 [ (1) +e [9] (1.9.22) 


where the subscripts jl in 7;, indicate that the corresponding phase 
factor may, in general, depend on |f;) as well as on |f11). 
Similarly, 


Iie) = el? [| ft) +e | f) ] (1.9.23) 
[isn) = 8" [1 f) + el [3) +e [A)]. (1.9.24) 
By definition 
[big | Miya) | = Mebry | Page) | (1.9.25) 
from which we obtain 
1 + elltgm—nin)| = 2, (1.9.26) 
That is, 
eVsk1 = @'Ms1 (1.9.27) 


and the phase factor multiplying | fi) within the square brackets in 
(1.9.24) is independent of the vector | f;,). 
Similarly, 

[bin | Mise) | = bral erga) | (1.9.28) 
implies that 

eltts — gima (1.9.29) 

and the phase factor multiplying |f/) within the square brackets in 
(1.9.24) is independent of the vector | f;). 
Accordingly, under a symmetry transformation, |q1;), |W1e), |Wijex) in 


(1.9.18), (1.9.19), (1.9.20), respectively, transform to |e 5)> |i.) as given 
in (1.9.22), (1.9.23), and 


[Wigey = [| f,) +e [F5) + e'™ | F) | (1.9.30) 
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and the phase factors multiplying | fi), and | fj) within these square 
brackets are the same as the corresponding ones within the square brack- 
ets in (1.9.22) and (1.9.23). 

Since the phase factors e'/?, e'”*1 are understood to be defined relative 
to, and may depend on, | f{), we will suppress in the sequel the subscript 
1 in jl, Mk1- 


. Suppose first that a, #4 0. Then by definition 


bin YD] = [bre] )| (1.9.31) 
which implies that 
h p enine Oh) + = (1.9.32) 
ay a1 


That is, 
Re (m4) = Re (<*) : (1.9.33) 
ay QA 


Finally from the constraints |a,| = |a,| in (1.9.16), this implies that 


im(om)f =n (ef ont 


. By definition, 


bie LY’) = daze | YI (1.9.35) 
which implies that 
pt erm S 4 orm Se = p+ See a . (1.9.36) 
ay ay ay a1 


This equality together (1.9.33) and the constraints in (1.9.16) give 
Er Ne ellos 
ay ay a1 a1 
(1.9.37) 
We use this equality in conjunction with (1.9.34). If from the latter 


Im (-™ <r) =+Im (*) (1.9.38) 
ay ay 


then (1.9.37) implies that simultaneously 


Im (™ =i =+Im (3) (1.9.39) 
ay a1 


and vice versa, occurring with the same signs for all 7 # k not equal to 
1. 
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Im (-™ “«) =-—Im (**) (1.9.40) 
ay ay 

Im (™ “) =-—Im (3) (1.9.41) 
ay Ay 


simultaneously, and vice versa, occurring with the same signs for all 7 4 k 
not equal to 1. 

From (1.9.33) and (1.9.38), (1.9.39), or (1.9.40), (1.9.41), we conclude 
that we have two alternatives 


Similarly, if 


then 


a, = aye (**) (1.9.42) 


1 


for all k £ 1 uniformly for the expansion coefficients of |W’) in (1.9.17) or 


else : 
a, = aye" (**) (1.9.43) 
ay 


again for all & 4 1 uniformly for the expansion coefficients of |~’). 
That is, under a symmetry transformation, |) transforms to 


gy= ot of Le ink | (1.9.44) 


with e = 1 and a* denotes either a or else a*. The same # rule for the 
transformation applies to every coefficient in |W) multiplying the vectors 
| fx). 

The case a; = 0 is easily treated. To this end, we introduce the vector 
Ix) 


Ix) = fi) + So an Ife) - (1.9.45) 
k>2 
Since 
(fel e’)| = fel d)| = (fel xdI (1.9.46) 
we obtain 
a, =e™athay, k>2 (1.9.47) 


where a; is some phase factor. 
That is, in all cases, we have either 


j=" ope™ |.) (1.9.48) 
k 


or else 


= Sl ape lf) (1.9.49) 
k 
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up to overall phase factors, thus defining the corresponding ray transfor- 
mations. The phases e'”* are independent of the expansion coefficients, 
and for each k, e'”* is defined in terms of the pair of vectors | fx), | f1) only. 
Here it is also worth recalling that the vectors | fi), | fo), ... are pairwise 
orthogonal and hence the relative phases expi(n; — 7x), for 7 4 k, never 
occur. 


. It remains to establish, that under a symmetry transformation, if one 


vector transforms under the # rule of complex conjugation (respectively 
unaltered) rule, then every other vector transforms under the same # 
rule.* To establish this, we assume otherwise and, in turn, run into a 
contradiction. 

Accordingly, suppose that for two given vectors |w), |), 


lv) = So ak |fe) (1.9.50) 
k 
16) = So Be | fe) (1.9.51) 
k 
we have 
1y’) = S¢ axel | fx) (1.9.52) 
k 
Io") = So Bpel™ | fh) - (1.9.53) 
k 
Then 
d'1 ')| = 19) (1.9.54) 
implies that 
S/ Im (aja%) Im (3; 8%) = 0. (1.9.55) 


ik 


We consider the various possible cases regarding (1.9.55). 


There is at least one pair (j,k) with 7 4 k, such that 
Im (ajaz) 40 (1.9.56) 
Im (8; 84) 4 0. (1.9.57) 


We may then introduce the following vector 


hd = glo) -ilfn)] = Df. (1.9.58) 


4“ The importance of considering this step was particularly and rightly, emphasized 


by Weinberg (1995). 


[1] 


[IIT] 
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If this vector transforms like the vector |¢) we obtain 
Im (aj;0%) Im (77%) = 0. (1.9.59) 


But since Im (7;7) = 1/2, the above equality contradicts (1.9.56). Sim- 
ilarly, if |x) transforms like |w) we obtain a contradiction with (1.9.57). 
That is, |) and |¢) must follow the same # rule. 
No such a pair as in [I] exists, but there exists a triplet (j, k, 2), all unequal, 
such that 

Im (ajax) 4 0, Im (aj;az) = 0 

(1.9.60) 
Im (8; 8;) = 0, Im (687) 4 0. 


Note that if either (or both) of Im (a;a%), Im (8; 6%) are not equal to 
zero we are back to case [I]. We introduce the vector 


4 
(VB 


where Im (157%) = In (W577) = 1/3, Im (eZ) = 0. Again we run into a 
contradiction with Im (aj;az) # 0, Im (3,67) # 0 if |€) transforms either 
as |b) or |W). That is, |v) and |¢) must follow the same # rule. 

No such a triplet may be found as in [II], but we may find a quadruplet 
(j,k, €,m), all unequal, such that 


Ie) 


[| fi) ilfe) -ilfe)] = Dov LA) (1.9.61) 


Im (ajaj) #0, Im (aj;a%,) =0 
(1.9.62) 
Im (2,07) = 0, Im (aax,) =0 


and 


Im (8; 6;) = 0, Im (G;G;,) = 0 
Im (8487) = 0, Im (G¢(;,) 4 0. 


We may then introduce the vector 
1 


and note that 


(1.9.63) 


[lfi) — il fe) + Lfe) —ilfm)] = So vf) (1.9.64) 


a 


(1.9.65) 


Im (7772) = Im (ve77,) = 0. (1.9.66) 
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We then run into a contradiction and conclude of the necessity that the 
same transformation rule holds for |) and |@). 

[IV] For all pairs (j,k), 7 4k, Im (aj;a;) = 0. This in turn leads to two pos- 
sibilities. Suppose Ima, = 0 for all k. Then a, = aj and |w) transforms 
the same way as |), up to an overall phase. Suppose at least for one j, 
Ima; # 0. Then Im (a;az) = 0 gives 


a, = at (2) (1.9.67) 


J 


* 


since a,/ aj is just a phase factor, we again reach the same conclusion 


regarding |) and |d). 
For all pairs (j,k), 7 4 k, Im (8; 6%) = 0. This case is treated in the same 
way as case [IV] by reversing the roles of |b) and |q). 


Iv 


Therefore we have reached the conclusion that we may introduce an op- 
erator U such that 
e™ | f,) = U | fe) (1.9.68) 


and 
U (=: Ok in] = So axl | fr) (1.9.69) 
k k 
or else 


v (Scotin) = Soa (1.9.70) 
k k 


and therefore 
|b’) =U |y). (1.9.71) 


It is worth recalling that any vector |) may be expanded in terms of the 

orthonormal basis {|f;,)}, and the transformed vector |’) is then expanded 

in terms of the orthonormal basis {|f;)}. Also due to the orthogonality of 

the vectors | f;,), the relative phases exp i(n; — 7x), for j A k, never occur. 
Hence for any two rays |w), |¢) one has 


(Up|Ud) = S° af Gn = (14) (1.9.72) 
k 


or else 


(Ud |Ud) = >- an Be = 1d)" 
k 


= (el). (1.9.73) 


Finally consider any linear combination such as 


1.10 Probability, Conditional Probability and Measurement 65 


a |b) + B|d) = 50 (aag + BBr) |fr) - (1.9.74) 


k 
Under a symmetry transformation, we have shown that 
(a) + 814))' = S> (aan + BBx)e™* | f,) (1.9.75) 
k 


or else 


(ab) + 614))' = > (at af + B* Bye | fi) (1.9.76) 


k 


up to overall phase factors for the combination (a |b) + 3B |d) is which do not 
necessarily coincide with those of the separate vectors |W’), |¢’). According 
to (1.9.68), the right-hand of (1.9.75), or else (1.9.76), is given by 


a |b) + BU |¢) (1.9.77) 


or else by 


a*U |b) + B*U |¢) (1.9.78) 


which establish the linearity, or else the anti-linearity, of U upon identifying 
(a |) + 8/9) )' with U(a |) + 3|¢) ), for the corresponding rays. 


1.10 Probability, Conditional Probability and 
Measurement 


As a physical attribute of a given system, consider the measurement of 
some quantity B that may take on values from a discrete set of real numbers 
{b,b’,...}. For simplicity of the notation, let B also denote the self-adjoint 
operator associated with this physical quantity satisfying the eigenvalue equa- 
tion 


B |b) = b|b) (1.10.1) 
(b! |b) = 6(b', b). (1.10.2) 


The states of the apparatus, which registers in which state it has found 
the physical system, will be denoted by |a’,v’), where a’ corresponds to a 
“needle” registering-value. The latter will be denoted by ay if the “needle” 
registers the value b for the physical system. We consider the situation in 
which the apparatus may, in general, disturb the physical system causing the 
latter to make a transition from a state specified by the value b, as registered 
by the apparatus to a state specified by some other value, say, b’ afterwards. 
In |a’,v’), v’ denotes the collection of all other quantum numbers needed to 
specify the state of the apparatus. After a value for B is registered by the 
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apparatus, v’ may also change and take on some other set of values, say, v’” 
specifying together with the “needle” registering value ap, the new state of the 
apparatus. In this section, the states |a’,v’) of the apparatus will be taken 
to satisfy the orthonormality condition® 


(a” uv" |a’,v') = 6(a",a’)6(v",’). (1.10.3) 


1.10.1 Correlation of a Physical System and an Apparatus 


Initially, the state of the apparatus before it is switched on, or before it 
interacts with the physical system, will be denoted by |a,v). The initial state 
of the combined system consisting of the physical system under consideration 
and the apparatus will be taken to be of the form 


lwo) = bs Cb ») la, v) ’ S- |co|? =1. (1.10.4) 
b b 


The situation with the physical system and the registering apparatus is 
depicted pictorially in Figure 1.15 for a given b value. 

During the registration process by the apparatus, the combined system 
will evolve and finally after the registration has been completed, will be de- 
scribed by a state having the structure 


|b) = S- cp |b’) jay, uv’) C(O’, ay, ’; b, a, V”) (1.10.5) 


b,b!,v’ 


showing a correlation has occurred between the apparatus and the physical 
system and also incorporating a general disturbing transition as a result of 
the interaction between these two sub-systems. 

We introduce the class of unitary operators which lead from the state |~)o) 
to the possible states |7)) in (1.10.5). [For an apparatus that is not switched 
on, the identity operator is the corresponding unitary operator.] In general, 
such a unitary operator has the structure 


U= 5s |b’) Ja’, v') U(8', a’, v';b" a", v") (b"| (a v"| (1.10.6) 
leading to 
U ldo) = SY) cold’) |a’,v’) U(0',a',v';b, a,v) (1.10.7) 
b,b’ ,a’,v’ 


which, in particular, requires from (1.10.5) that 


° For greater generality in applications, such orthogonality conditions will be re- 
laxed in Chapter 8 (cf. §8.7, §8.9) allowing less restrictive apparatuses. 
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BEFORE 


° 
b 


Fig. 1.15. Distinguishable states of the physical system in question are shown 
on the left-hand side of the figure. The registering apparatus may, in general, 
disturb the physical system to jump from a state |b) to a state, say, |b’) 
following the registration process by interacting with this system. The “nee- 
dle” registering-value a’ together with a collection of other possible quantum 
numbers v’ specify the state of the apparatus. 


U(0', a’, v;b,a,v) = 6(a’, ay)C(b', ap, 1’; b, a, v) (1.10.8) 


implying that the apparatus has registered the value “b” for the physical 
system under consideration, and that 


U |Wo) = |) (1.10.9) 
with |) defined in (1.10.5). 
Unitarity of U means that 
Uiu=1, Uut=1 (1.10.10) 


giving, respectively, the general constraints 
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A N i WW WW Wy 1" A i / / / 
U* (0 —,a' usb Fay )U(b 76a," 50 7,a,v’) 


b! alu" 
= 6(b",b')5(a",a")5(v"",v’) (1.10.11) 


U(Y, a, U'; b”, a”, v")U* (v"”", a Vy, b”, Gin vy") 


b” al’ vl!’ 
= 6(0',0")6(a' a” )6(v',v’"). (1.10.12) 
In particular, for b’” = 0’, a” =a’, v'” = v' we have the normalization 
conditions: 
|U(b",a",v";b',a’,v’)|? =1 (1.10.13) 
b” a" jv!’ 
|U(',a’,v';b",a",v")|? =1. (1.10.14) 
b! alu" 


For the sequel, it is convenient to rewrite (1.10.5) as 
py = Yolo) (1.10.15) 
b 


where 


| ) = S> [B!) a, v') CCW, an, v's b,a,) (1.10.16) 
b/ if 


and (1.10.8), (1.10.13) imply that 


S*|CW!,ay,v';b,a,v)|° = 1 (1.10.17) 
b/,v’ 

and the inequality 
S*|C(W', 3,58, a,v)|? <1. (1.10.18) 


Equations (1.10.2), (1.10.3), (1.10.17) give the normalization condition 


(oe | o)) = 5(b',b). (1.10.19) 


1.10.2 Probability and Conditional Probability 


We may define the density operator 


p= |v) (1.10.20) 


and ask pertinent physical questions. 
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After the experiment has been completed on the physical system in ques- 
tion, what is the probability that the “experimentalist” will find (read) the 
registered value a, on the apparatus regardless in which state the physical 
system has made a transition to? From (1.10.15), (1.10.19), (1.10.20) this 
probability is given by 


n| 


5 (| P| = |p|? (1.10.21) 


as expected. 

After the registration process of the apparatus has been completed, what 
is the probability that a B-filter will find the system in the state |b) and a 
reading on the registering apparatus to be aj? This may be obtained directly 
from (1.10.5) to be 


lool? S> |C(b, ab, U'; b, a, v)| < lep|? (1.10.22) 


where we have used the inequality (1.10.18), implying, in general, that the 
probability in question is reduced over the value |cy|? due to the possibility 
of the apparatus causing the physical system to make a transition. 

The importance of the expressions for the probability on the left-hand 
side of (1.10.22) and the one on the right-hand side of (1.10.21) cannot be 
overemphasized. They lead to the following, almost tautological, question. 
First we note that in the initial state |q9) in (1.10.4), there is a summation 
over all b. If a measurement is made on the system and the apparatus read- 
ing is apy, then one may ask the question: what is the probability that the 
system immediately afterwards is in the state |b)? Such a probability is what 
probabilists call a conditional probability: 


“Given that the apparatus yielded the value a,, what is the probabil- 
ity that the system is in the state |b)?” 


This probability, written as, 
Prob [system in state |b) / apparatus yielded value ap| (1.10.23) 
is given by the ratio: 


Prob [system in state |b) and apparatus yielded ap] 


Prob [apparatus yielded ap regardless of the state of the system] 
(1.10.24) 
From the left-hand side of (1.10.22), and (1.10.21), this probability works 
out to be 


ley]? S- |C(b, ap, UV’; b,a,v)|° 


|eo|? =S°|C(b, ap, 0’; b, a, v)|? (1.10.25) 
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and the |c,|? term (for |c)|* 4 0) cancels out in the final expression for the 
probability. This is “as if” the measurement carried out on the system, giving 
the read value ay, on the apparatus, has forced all the expansion coefficients 
in (1.10.4) to be zero with the exception of c, for the particular 6 value in 
question, and replaced this latter coefficient by one! 

For an idealistic, rather simplistic description of an apparatus, the final 
state |W) for the combined physical system and the apparatus may be taken 
to be of the simple form 


1) = So ew |b’) law) (1.10.26) 
5 


and the probability in (1.10.23) then reduces from (1.10.24) to 


Jeo? _ 
oe (1.10.27) 
Ico 
giving the idealistic confirmation of measurement that if (i.e., given that) 
the apparatus yields the value a», then the system is found in the state |b) 
with probability one, and as if all the coefficients cy in (1.10.26) have been 
replaced by zero with the exception of cy which has been effectively replaced 
by 1, as mentioned earlier. 


1.10.3 An Exactly Solvable Model 


As a further illustration, consider a two-level physical system such as for 
a particle of spin 1/2 which will be considered below. The states of such a 
system will be denoted by |+). If the system is in the state |+), then an 
apparatus, detecting the system, will be found in some corresponding state 
denoted, say, by |+). Similarly, if the system is in the state |—), the apparatus 
will be found in some corresponding state |—). 

The states of the apparatus are shown in Figure 1.16, where the initial 
state of the apparatus, corresponding to a neutral position, is taken to be 
|O) with the “needle” of the apparatus initially pointing at O. 

As we will see later, the state |O) may be expanded as follows 


1 
0) = ale n/a) 4) 4 elt/4|—) ), (1.10.28) 
The origin of the phase factors exp(+ia/4) in the combination of the states 
|=) in (1.10.28) will become clear later and have to do with the needle spin 
state along the y-axis and 1/\/2 is a normalization factor. 

The state of the physical system plus the apparatus before their mutual 
interaction will be taken to be 


Wo) = (c+ |+) +e |-)) |0) (1.10.29) 
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ce 0 


|0) +) ey 


Fig. 1.16. The figure denotes the states of the apparatus with the first one 
on the left-hand side denoting its neutral state before it interacts with the 
physical system. 


|c,|? + |c_|? = 1, and for the final state 


2) =| (cs + 5 le+(cos —1) +ic_ sin ‘]) |-+) 
or slice sin k + c_(cos« — 1)] |-) \ |+) 


i { (c- fe slics sin « + c_(cosk — 1)]) ee 


Nl rR 


+ =[c,(cos« — 1) +ic_ sin «] |+) \ |—) (1.10.30) 


showing that a correlation has occurred between the apparatus and the phys- 
ical system, and also incorporating a general disturbing transition specified 
by the “angle” «. For an almost ideal apparatus, « is arbitrarily small — see 
(1.10.40)—(1.10.43). For an ideal apparatus, with « — 0, (1.10.30) becomes 


I) dear = C+ +) |) + e [-) |—)- (1.10.31) 


This state defines the perfect correlation between the apparatus and the 
physical system with the “needle” pointing in the same direction as the spin of 
the particle, respectively, in each case. For such an ideal apparatus, (1.10.31) 
is of the form in (1.10.26), and, for example, (for c, 4 0), 

Prob [system in state |+) and apparatus in state I+) | = ileal? 
(1.10.32) 


Prob [apparatus in state I+) | = |c,|? (1.10.33) 
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and hence from (1.10.24) 


lex? 
= 5 =1 


(1.10.34) 


Prob [system in state |+) / if apparatus in state I+) | 


~ Tex 


as in (1.10.27). 
If cy, #4 0, c_ € 0, in (1.10.31), neither the physical system nor the 
apparatus is in a definite state.® 
More generally, the state |yo) in (1.10.29) evolves to state |7) in (1.10.30) 
via the unitary operator 
U=1+A+B (1.10.35) 


where 


A=-[A-il+)(414 0 +)I-)Cl] 


«SIF +IRMHI-I-)CH- FMI] 10.89) 


B =|(cos x — 1)(|+)(+] + |-)(-1) + isin «(|-) 41+ 1+)(-1)] 


- SIEM +|=)(-1+ |= (41+ +)(—|]. (1.10.37) 
That is (see Problem 1.10), 


U |u9) = |v) (1.10.38) 
with |w9), |v) given, respectively, in (1.10.29), (1.10.30) and 


Ulu =UUt =1. (1.10.39) 


Later (§8.7) we will see how the unitary operator U in (1.10.35)—(1.10.37) 
actually arises as an elementary interaction between the variables of the phys- 
ical system and the apparatus. The system consisting of the apparatus and 
the physical system based on (1.10.30) will be analyzed in detail below. 

For the interpretation of « in (1.10.30), (1.10.36), (1.10.37), suppose that 
a system is initially in the state |+), i.e., cy = 1, c_ = 0. After the interaction 
of the apparatus with the system, the probability that the apparatus will be 
in the state |++) is from (1.10.30) given by 


1 1 

7 | + cose)? + sin? r| = 5 (1 + cosh). (1.10.40) 
Now given that the apparatus is in the state |+), the (conditional) prob- 

ability that the physical state has made a transition to the state |—) is from 

(1.10.30), (1.10.40) with cy = 1, c_ =0, 


® Such a state is called an entangled state. The properties of entangled states will 
be studied in detail in Chapter 8, in particular, in §8.10. 
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= sin? « 1 


4 
(1 + cos «) 


- 2 

sin” Kk 

= 1.10.41 
2 (1+ cosk) ( ) 


Nile 


which for an ideal apparatus, x — 0, goes to zero. 

Conversely, the probability that the system remains in the state |+) after 
the interaction with the apparatus irrespective of the state of the apparatus 
is, from (1.10.30) (with c, = 1, c_ = 0), given by 


1 1 
Z (cos K+ 1)? + (cos — 1)?| = 5( + cos? kK). (1.10.42) 


Given that the system remains in the state |+), the (conditional) probability 
that the apparatus is in the state |—)} is then from (1.10.30), (1.10.42). 


1 (1 —cosk)? 
SNE 1.10.43 
2 (1 + cos? k) ( ) 


which would again vanish for an ideal apparatus, as expected. 

Finally, the probability that the system makes a transition to the state |—) 
and the apparatus is found in the state |—) is (sin? x) /4, again for c, = 1, 
c_=0. 

Figure 1.17 depicts experiments, where a spin 1/2 particle prepared in 
the state |+1/2,z) goes through a Stern-Gerlach set up. In the subsequent 
analysis, we make the identifications |) 1/2,Z) in (1.10.29), (1.10.30). 
In part (d) of the figure, an operating apparatus is inserted of the type just 
described. As in Figure 1.3 in §1.1, the numerical factor m in |m, z), with m = 
+1/2, correspond, respectively, to spin components along the +2 directions. 

The coefficients cy in (1.10.29), (1.10.30) are now given by 


cz = (41/2,2|+1/2, z). (1.10.44) 


In part (a) of the Figure, the probability that the particle emerges and a 
spin flip occurs, i.e., with the spin component being along the —z direction, 
is obtained from the successive measurements symbols 


A,(—1/2)Az(—1/2)Az(+1/2) = |-1/2, z)(+1/2, 2| 
x ((-1/2, z|-1/2, Z) (-1/2,2|+1/2,z)) (1.10.45) 
relative to the z-, Z-, z-axes, respectively, to be given by 
Kee Sy er) elon), (1.10.46) 
Similarly for the probability of a non-flip of spin, we have 


|(+1/2, z|-1/2, 2)|* (1.10.47) 
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(c) 


Fig. 1.17. A spin 1/2 particle initially prepared in the state |+1/2,z) goes 
through the Stern-Gerlach set up illustrated, with a non-zero (0 < 6 < 7) 
angle of orientation of the Z-axis relative to the z-axis. In part (a), the proba- 
bility that the particle emerges and has a spin flip, i.e., is in the —z direction, is 
\(—1/2, z| —1/2, z)|? |(—1/2, Z| +1/2, z)|?. In part (b), the corresponding prob- 
ability is |(—1/2, z|+1/2, Z)|? |(+1/2, Z|+1/2, z)|”. In (c), this probability is 
zero due to destructive interference. In part (d), an operating apparatus is 
inserted to determine the component of the spin, in the intermediate stage, is 
along the +7 or the —Z directions but the “experimentalist” does not take a 
reading and hence does not know this result. The probability in this case that 
the particle emerges with a spin-flip is, up to an additional term of the order 
(sin? «)/2, the sum of the corresponding probabilities in parts (a) and (b). 
For « arbitrarily small, the interference term becomes small. This idealized 
experimental set-up mimics the famous “double-slit” experiment. 


in part (a). 
For part (b), the probabilities that the particle emerges and with a spin 
flip or with a non-flip of spin are, respectively, 


|(—1/2, z|+1/2, Z)|? |(+1/2, Z| +1/2, z) |? (1.10.48) 


|(+1/2, z|+1/2,z)|*. (1.10.49) 


To consider the situation in parts (c) and (d), we first define corresponding 
density operators 
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po = Wo) (ol (1.10.50) 


p= |o)v| (1.10.51) 


where the states |o), |W) are given, respectively, in (1.10.29), (1.10.30). 
In part (c), the probabilities that the particle emerges with a spin flip or 
with a non-flip of spin are, respectively, 


0 =|(—1/2, z|+1/2, 2)|? = Tr [A.(-1/2) 0] 


=|c,|? |(+1/2,2|-1/2, 2)|? + |e_|? |(-1/2, Z| -1/2, 2)? 
+ [ee (+1/2,2|-1/2, 2) (-1/2, 2|-1/2,2) 
+ c% ce, (-1/2,2|-1/2, z) (-1/2, 2|+1/2,2) | (1.10.52) 
1 =|(+1/2, z|+1/2, 2)? = Tr [A.(+1/2) po] 


= 2 = 2 
=le+|? |(+1/2,2|+1/2, 2)|" + le-|? |(-1/2,2| +1/2, 2)| 


a ES (41/2, 2] 41/2, z) (41/2, 2|—-1/2,2) 


+ ct cy (-1/2, Z| 41/2, 2) (41/2, z| 41/2, 2) |. (1.10.53) 


The numerical values 0 or 1 on the extreme left-hand sides of (1.10.52), 
(1.10.53) are obtained from the consideration of the successive measurements 
symbols 


A,(#1/2)A,(+1/2) = |F1/2, z)(+1/2, 2| ((F1/2, z|+1/2, z) ) 


I 


F1/2, z)(+1/2, z| 0(Fz, +2). (1.10.54) 


The third terms in the square brackets on the extreme right-hand sides 
of (1.10.52), (1.10.53) are referred to as interference terms, showing that de- 
structive and constructive interferences, have, respectively, occurred in these 
experiments. 

Now we come to the interesting situation depicted in part (d), where the 
apparatus has been inserted. The corresponding probabilities are given by 


Tr [A,(¥1/2)p] (1.10.55) 


where the operating apparatus is inserted but no reading of the apparatus is 
undertaken. 


76 1 Fundamentals 


From (1.10.30), (1.10.51), (1.10.55) a lengthy but straightforward calcu- 
lation (see Problem 1.13) shows that the probabilities in (1.10.55) work out 
to be equal to 


a 2 a 2 
lex? (+1/2, Z| F1/2, z)|° + |e—|? |(-1/2, Z| F1/2, z)| 


+ (sin «) (le? — |cz|?) sink + i(e_ch. — ch.cy) cos r| 


(1/2, 2|+1/2,2)P — (F1/2, 21-1/2,2)0? ) 


+ (sin «) [(le-P — |c4|?) cosk +i(e_ch, — ch ey) sin r| 


x F(( 1/2,2|£1/2, 2) (1/2, z|+1/2,2) 


~ (41/2, 2|#1/2, z) (1/2, z| -1/2,2) ) (1.10.56) 


which is to be compared with the extreme right-hand sides of (1.10.52), 
(1.10.53) given, in part (c) of Figure 1.7 in the absence of the apparatus. 

For a negligibly disturbing apparatus, specified by a very small «, the 
interference terms in the square brackets on the right-hand sides of (1.10.52), 
(1.10.53) essentially disappear, as seen from (1.10.56) for « ~ 0, by the mere 
insertion of an unread apparatus! and completely disappear for k — 0. 

Let us be more quantitative. To this end, if the angle between the Z- and 
z-axes is 3, then as we will see later 


(+1/2,z|+1/2,z) = cos 5 = (-1/2,z|—-1/2,2) (1.10.57) 


(—1/2,z|+1/2,z) = sin 5, (+1/2, 2| 1/2,2) =sin 5. (1.10.58) 


Upon setting 
1 
5 sin? « = e? (1.10.59) 


the probabilities corresponding to parts (a), (b), (c), (d) in Figure 1.17 are 
directly obtained from (1.10.46)-(1.10.49), (1.10.52), (1.10.53), (1.10.56), by 
using (1.10.57)—(1.10.59), and are spelled out in Figure 1.18, where cz are 
defined in (1.10.44). 

In particular, we note from Figure 1.17 (a), (b) that blocking one of the 
“Z-outlets”, for a spin projection, increases the probability that a particle 
emerges with a spin-flip from a 0 value (versus part (c)) to a non-vanishing 
value of (sin? 3)/4, where in part (c) both “Z-outlets” are kept open. 

Unlike the situations in part (c) and (d), the blocking of the “Z-outlet” 
shown in part (a) (and similarly in part (b)), the particle does not necessarily 
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» 9 ¢ ¢ 
5 sin® 8 + e* cos* B 


Fig. 1.18. In parts (a), (b), the probabilities are given for the particle emerg- 
ing has a spin flip (spin non-flip), i.e., is in the state |—-1/2, z) (|+1/2, z)). In 
parts (c), (d), the probabilities are given for the particle emerging with a spin 
flip or with a non-flip of spin, where both of the “Z-outlets” are kept “open”. 
In part (d), the apparatus is inserted in the intermediate stage. For the cor- 
responding experiments refer to Figure 1.17. The angle between the Z- and 
z-axes is equal to 3. The probability spikes are not drawn to scale for any 
specific value of (3. 


emerge from the left-hand side of the experimental set-up and hence the sum 
of the probabilities (sin? 3) /4, for spin-flip, and sin*(@/2), for a spin non-flip, 
shown by the spikes in Figure 1.18 (a), is less than one. 

When the apparatus is inserted in part (d), but unread by the “experi- 
mentalist”, the probabilities, up to the te? cos? 6 terms, are the sum of the 
corresponding ones in parts (a) and (b), respectively. That is, for « arbitrar- 
ily small, and hence small ¢, the mere insertion of the apparatus essentially 
washes away interferences as occurring in part (c), and they completely dis- 
appear for k — 0. 

In part (c), the particle emerges with a non-flip of spin. The probabilities 
of 0 (for a spin-flip) and 1 (for a spin non-flip) are not simply the sum of 
the corresponding probabilities in parts (a) and (b). There are interference 


terms = 
__ (sin’ 8) (1.10.60) 


sD, 
, (in? 8) _ (: ee 5) (1.10.61) 


and 
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within the square brackets on the right-hand sides of (1.10.52), (1.10.53), re- 
spectively, leading finally to the value 0 and 1 for the probabilities mentioned 
above. 

That is, the fate of a single particle, in the experimental set-up in part (c) 
of Figure 1.17 is inferred from the intensity distributions computed in part (c) 
of Figure 1.18 and these probabilities are not obtained by simply summing 
the corresponding probabilities in parts (a) and (b) which would exclude 
the interference terms discussed above. This has led to such a statement, 
attributed to Paul Dirac, referring to a photon, that a particle interferes with 
itself! The interference, however, occurs between different amplitudes when 
adding them up to obtain a given probability of occurrence as in (1.10.52), 
(1.10.53). It is misleading to think of a particle in the quantum world as a 
particle in a classical sense, otherwise one would run into a typical classical 
argument on how a single particle may simultaneously go through both “+2- 
outlets” leading finally to the built up of the interference pattern observed in 
several experiments, with very low particle densities used at each given time, 
but are run for long periods of time so designed to avoid arguments based on 
particle-particle interactions as necessarily the source of interference. 

Finally we note in reference to part (d) in Figure 1.17, if the apparatus 
is read and found to be in the state |+), then one may enquire about the 
probability that the particle is in state |+1/2,2) before it enters the second 
Stern-Gerlach apparatus. As in (1.10.23), one is then dealing with a condi- 
tional probability stated as follows: given that the state of the apparatus is 
|+), then what is the probability that the state of the spin of the particle is 
given by |+1/2,2)? From (1.10.30), (1.10.44), (1.10.56)—(1.10.58), (1.10.24), 
this conditional probability works out to be (see Problem 1.15) 


B B a 
cos? = sin? « + sin? = (1 — cos x)? 
je 2 % (1.10.62) 


cos” ° (1 + cos x)? + sin? 8 sin? k 


As in (1.10.25), we see that “as if” the measurement carried out by the 
apparatus, and found to be in the state |+) has, for « — 0, forced the 
coefficient c, = cos 3/2 in (1.10.29) (see (1.10.44), (1.10.57), (1.10.58)) to be 
replaced by one and c_ = sin 3/2 to be replaced by zero. 

The experimental set-up in Figure 1.17 mimics the so-called “double-slit” 
experiment (see also §8.7—§8.9, §9.1, §9.5) elaborated upon repeatedly in the 
literature and notably by R. P. Feynman. 

Later on in §8.7, we will see how the unitary operator U in (1.10.35)— 
(1.10.38) arises as an elementary interaction between the variables of the 
physical system and the apparatus. 

There are many views, descriptions and interpretations on what measure- 
ment theory in quantum physics is and how it may be achieved. Each view 
involves internal consistency problems that have to be checked and further 
generalizations may be often needed based on such consistency checks. 
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Additional details on measurement theory and related aspects will be 
given in Chapter 8. In particular situations will be encountered where the 
orthogonality conditions in (1.10.3) are necessarily relaxed. We will see, as 
one may argue, that taking into account of the environment coupled to the 
meter (apparatus) variables and consisting of everything else monitoring the 
observables being measured, provides the different alternative readings of a 
meter being sought. That is, after a measurement, the system is found in 
one of its alternative states rather than in a superposition of them. The 
information thus obtained on the system can be then described in usually 
perceived classical terms. The destruction of such quantum superpositions, 
resulting from the interaction with the environment, is referred to as quantum 
decoherence. 


Problems 


1.1. Derive the expression for the measure of fraction of the number of 
particles transmitted in Figure 1.4 and Figure 1.5 (c). 

1.2. Show that the inner product in (1.4.42) does indeed satisfy the prop- 
erties in (1.4.46)—(1.4.48), and establish the description independence 
of the inner product as given in (1.4.51). 

1.3. Establish the inequalities in (1.4.52), (1.4.53) using (1.4.42), (1.4.49). 

1.4. For a sphere of unit radius in the n-dimensional Euclidean space, a 
point on the surface of the sphere may be defined in terms of the n 
variables 


1 = cosvy 
Ly, = sin V,---sinv__1 cosvVz, k=2,...,n—-1 


Lyn = sind,---sinVyp_2sinVp_1 
with O< 8; <7 (§=1,...,.n-2), OS O,1 < 2m, Soa? =1. 
i=1 
(i) Show that the surface element on the sphere is given by 
dQ = (sin? 01d01) ( sin”? 02d02) Nene (sin' Bn—2d0n—2) (dd,-1). 


(ii) Using the definition (27) = [x7 dQ/ fdQ, show explicitly that 


(29) == (e) = 


as in (1.5.27). 
1.5. Verify the property (1.6.10) which is relevant to the incompatibility of 
measurements of polarizations along two different directions. 
1.6. Derive (1.6.21) corresponding to the transmission of light through the 
arrangement of polarizers in Figure 1.13 (b). 
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1.7. 


1.8. 


1.9. 


1.10. 


1.11. 


1.12. 


1.13. 
1.14. 


1.15. 
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Establish the separability and completeness of the space ¢?(00) defined 
in (1.7.12)—(1.7.14). 

Extend the analysis given for the position operator through (1.8.35)— 
(1.8.39) to the momentum operator p = —ihV (§2.3) in the x- 
description. 

[ Hint. For any ¢ > 0, define the normalized function 


2 \3/4 : oe 

F5,(x) = (<5) exp (;. . x) exp (-F) 
and show that | (inv — po) FS, | <e.] 
Establish the properties in (1.8.16)—(1.8.22) of the projection operator 
P(A) associated with a self-adjoint operator A. 
Show that the unitary operator in (1.10.35)—(1.10.37) leads from the 
state |Wo) in (1.10.29) to the state |) in (1.10.30) /(1.10.38) and verify 
explicitly the unitarity condition in (1.10.39). 
Work out the expressions for the probabilities in (1.10.40)—(1.10.43). 
Derive the equalities on the extreme right-hand sides of (1.10.52), 
(1.10.53). 
Show that the probabilities in (1.10.55) are given in detail in (1.10.56). 
From the probabilities given in (1.10.57), (1.10.58), establish the prob- 
ability spikes given in Figure 1.18. 
Work out the conditional probability given in (1.10.62). 


2 


Symmetries and Transformations 


This chapter is an extension of the first dealing with the formalism of 
quantum theory being sought. The present one is based on symmetries and 
deals with their implementations and the mechanics of the transformations 
of the underlying variables in the theory. The main symmetries and the cor- 
responding transformations in non-relativistic quantum mechanics are the 
Galilean ones. Different labellings of an event by two Galilean frames (§2.1, 
§2.2) are continuously related and the corresponding transformations, which 
relate the two labellings, reduce to the identity one in the limit that the two 
frames coincide. Accordingly, we may infer from Wigner’s Theorem on sym- 
metry transformations (§1.9), that such symmetries must be implemented, in 
the underlying Hilbert space of the theory, by unitary, rather than by anti- 
unitary, operators as they must continuously reduce to the identity in the 
limit of coincident Galilean frames — the identity element being, of course, a 
unitary operator. Group properties of these transformations are derived and 
their corresponding generators are introduced. These group relations give 
rise to basic commutation rules to be satisfied by the generators and their 
physical meanings emerge naturally. One of such generators is the Hamil- 
tonian generating time translations and describing the dynamics of systems. 
Explicit Galilean invariant Hamiltonians will be constructed for several phys- 
ical systems of interest. We also introduce discrete symmetries, and consider 
further properties of the generator of spatial rotations. Special attention will 
be given to so-called spinors which allow the description of particles of any 
spin. Finally supersymmetry is introduced in analogy to the Galilean ones 
and basic properties are established which will find several applications in 
later chapters. 


2.1 Galilean Space-Time Coordinate Transformations 


Consider a Cartesian coordinate system F resulting from a given one F 
by carrying a c.c.w. rotation of the coordinate system by an angle y about 
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a unit vector n, translating its origin by a given amount and setting the 
corresponding frame in uniform motion relative to that of F' (see Figure 2.1). 
The corresponding structure of the rotation matrix with matrix elements RY 
is worked out in Figure 2.2. 

A point P with coordinate vector label x in F' will be assigned the coor- 
dinate vector label X in F’. The clocks in F will be set back relative to those 
in F' by a given amount. 


Fig. 2.1. A space-time point P is labelled, in F and F, respectively, by (x, t) 
and (x,t). These labellings are related through (2.1.1), (2.1.2). 


Accordingly, the labelling of point P: (x,t) and (X,7¢) in the frames F' and 
F, respectively, are related by the transformations 


E = RUzi — a? — v't (2.1.1) 
f=t—rT (2.1.2) 


(with a sum over j in (2.1.1) understood), where a’, v’ are independent of 
time t,—v denotes the velocity of frame F' relative to F’, and the rotation 
matrix R with matrix elements R” satisfies 


Ri RE = 55k (2.1.3) 
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(with a sum over 7) and from Figure 2.2, it is explicitly given by 
R* = 5'* — c*n) sin p + (5* — nin*) (cos y — 1) 
= R'*(y,n) (2.1.4) 
where 


+1, if (¢,j,k) is an even permutation of (1,2,3) 
eJK — ¢ 1, if (i,j,k) is an odd permutation of (1,2,3) (2.1.5) 
0, if two or three of the indices are the same. 


The transformations (2.1.1), (2.1.2) define the Galilean space-time coordi- 
nate transformations. We note that z* in (2.1.1) defines the space coordinate 
labelling in F at a given fixed time in terms of x and t. 

In particular, we consider an infinitesimal rotation by an angle dy and 
introduce the vector 

Sw = ndy (2.1.6) 


To second order in dy, the rotation matrix R may be written as 


R=1+A+B (2.1.7) 
where ( T for transpose) 
A'=-A (2.1.8) 
= A2 
Bl=B= a: (2.1.9) 
and - 
Ate = —ctth G5, (2.1.10) 
Also up to second order in dy 
R11 =1-A+B. (2.1.11) 


In establishing (2.1.9), the following identity 
ee eee (ee ae a (2.1.12) 


is quite useful. 
For infinitesimal first order transformations with parameters da, dv, dw, 
57 the Galilean transformations (2.1.1), (2.1.2) reduce to 


xX =x-—da-—dvt—d5w xx (2.1.13) 


t=t—65r (2.1.14) 
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Fig. 2.2. Rotation of a vector x c.w. by an angle y about a unit vector n. 
This is equivalent to the rotation of the coordinate system, instead, c.c.w. 
about n and keep the direction of x fixed. From the figure 


x’ =(x-n)n+h 


where 
h = [x — (x- n)n|cosy+x X nsiny 
or 


x’ =x—nxX xsing+ [x— (x-n)n] (cosy — 1) 


with X in (2.1.13) given at a given fixed time t. We set 
bx =x—xX=ba+dvt+d5w xx (2.1.15) 


St =t—#=6r. (2.1.16) 


We provide an exponential representation of the rotation matrix R = 
(R'*) in (2.1.24). 
To this end, we define the matrices M/, 7 = 1, 2,3, with matrix elements 


|" =iet#* (2.1.17) 


[M’ 


i,k =1,2,3. In particular, 


2.1 Galilean Space-Time Coordinate Transformations 85 


[mn - M]** = ie*n/ (2.1.18) 
[(n - M)?]"" = 6% — nin* (2.1.19) 
(n-M)?=n-M (2.1.20) 


and we may rewrite the matrix R as 
R=1+in-Msing+ (n+ M)?(cosy — 1). (2.1.21) 


Upon differentiation the latter with respect to y and using (2.1.19), 
(2.1.20) we obtain 


i =(in-M)R. (2.1.22) 
The integration of this equation, using the boundary condition 
Rl og = 1, (2.1.23) 
leads to the expression 
R=explign- M] (2.1.24) 
or using the standard notation 
w =n (2.1.25) 
(see also (2.1.6)), one has 
R= exp liw-M]. (2.1.26) 


The matrices M’, satisfy the commutation relations 
[M*, M*] = ie* M* C120) 


as is easily obtained upon using the identity in (2.1.12). 
These matrices are explicitly given by 


00 0 

M'=i(00-1 (2.1.28) 
01 0 
001 

M?=i{ 0 00 (2.1.29) 
-100 
0-10 

M?=il1 00 (2.1.30) 
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Also (with T for transpose, Tr for trace, det for determinant) 


(m?)'=M?, (M?)" =—Ms (2.1.31) 
M? = 21 (2.1.32) 

Tr [n - M] =0 (2.1.33) 
R'=Rt=R1 (2.1.34) 
RR'=R'R=1 (2.1.35) 

det R=1 (2.1.36) 

Tr (R) = 2cosyp +1. (2.1.37) 


Finally, we derive a useful identity involving the rotation matrix. To this 
end, from the expression of x’ given in Figure 2.2, we have 


d ri ! 
de =—-n xXx’. (2.1.38) 
Let 
A(y) = exp[—yn- (x x V)] xexp [pn: (x x V)] (2.1.39) 


which upon using the commutator 


[Mae beta” (2.1.40) 
leads to FF 
qe?) = —n X A(y). (2.1.41) 
Using the boundary condition 
A(y)|,-9 =X (2.1.42) 


and (2.1.38), together with the expressions (2.1.24), (2.1.26), one may infer 
the identity 


exp[—w - (x x V)] xexp [w- (x x V)] = exp [iw - M)] x (2.1.43) 


where (w+ Mx)’ = iuJe'J*x* in the sense of matrix multiplication. 


2.2 Successive Galilean Transformations and the Closed 
Path 


Consider two successive Galilean transformations written in matrix form: 
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Got) 
(2.2.2) 


(2.2.3) 


(2.2.4) 


These allow us to relate the final configuration to the initial one through 


(2.2.5) 


(2.2.6) 


A Galilean transformation is specified by the following quadruplet 


CG a, v, R). 


From (2.2.5), (2.2.6), the following group properties easily follow: 


(1) 
(7, 4,0, R)(7,a,v, R) = (7+ 7,a+ Ra — 07,0 + Rv, RR) 


(2) The identity element J is given by 
I = (0,0, 0,1) 


(3) The inverse is worked out to be 


(7,4,0,R)-' = (-7,-Ro(@+7r),-—R-+v, R-) 
as one easily checks that 
(7, a,v, R)~*(r7, a, v, R) = (7,0, v, R)(7,a,v, R)* 
= (0,0,0,1) 
(4) Finally, we have the associativity rule 
(73,43, U3, R3)[(T2, @2, v2, R2)(71, a1, 01, R1)] 


= [(T3, 3, U3, R3)(t2, a2, V2, Re)\(11, a1, 1, R1) 


O27) 


(2.2.8) 


(2.2.9) 


(2.2.10) 


(2.2.11) 


(2.2.12) 


Of utmost importance for the subsequent analysis are the following suc- 


cessive transformations forming a closed path given by 
(7,a,v, R) =(T2, 42, v2, Ro) (m1, a1, 01, Ri)" 


x (72, 42, V2, Ro) (71, 41, U1, Ri) 


(2.2.13) 
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represented pictorially by the box diagram 


emphasizing the reversal of the transformations in the third and the fourth 
segments of the path. 

The resulting elements in (7,a,v,) arising from the closed path trans- 
formation given in (2.2.13) are readily worked out from the group properties 
spelled out above. They are given by 


7T=0 (2.2.14) 
a= —R,' [a2 + vet. + Ry (a1 az — Roa, + v2T — v1T2) v2T2] 
(2.2.15) 
v = —Rz* [ve + RY (v1 — v2 — Rov1)| (2.2.16) 
R= RyRy RoR. (2.2.17) 


The infinitesimal transformation corresponding to (7,a,v, R) in (2.2.13) 
follows from (2.2.14)-(2.2.17). The corresponding infinitesimal parameters 
are readily obtained from the latter four equations and are given by 


Sr =0 (2.2.18) 
$a = Azgda, — Ay daq + v1 6T2 — 267, (2.2.19) 
Su = Aodu, — Arbue (2.2.20) 
5(R —1) = Ag A; — Aj Az = [ Ao, Ai] (2.2.21) 


where A is defined in (2.1.10) (see also (2.1.9)) with a corresponding infini- 
tesimal dw. 
In detail (2.2.19)—(2.2.21) are given by 


ga = — (5W. x da, me bw, x daz) + dv 1572 ed bv2d71 (2.2.22) 
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bv = (5wy X dvo — dW X dvi) (2.2.23) 

8(R—1) = —e)* 5a) (2.2.24) 
and 

bw = dw, X We. (2.2.25) 


Hence under the closed path transformation (2.2.13), we have the trans- 
formations x — xy, t — ty = t, leading to the infinitesimal expressions 


bx = x—xy = da+dvt+5w xx (2.2.26) 
St =t—ty =0 (2.2.27) 


(compare with (2.1.15), (2.1.16)), and the parameters 5a, 5v, 5w are given, 
respectively, in (2.2.22), (2.2.23), (2.2.25). 


2.3 Quantum Galilean Transformations and Their 
Generators 


The Galilean transformations include the trivial identity transformation 
I in (2.2.9) which may be obtained from an arbitrary Galilean transforma- 
tion, specified by the quadruplet (7, a,v,R), when the parameters in 7, a, v, 
(R—1) are led continuously go to zero. This continuity property of the trans- 
formations approaching the identity in the just discussed limit implies from 
the celebrated Wigner’s Theorem (§1.9), that Galilean symmetry transforma- 
tions on the states in quantum physics must be represented by unitary rather 
than by anti-unitary operators with the identity being obviously a (trivial) 
unitary operator. The transformations are then continuously connected with 
the identity. 

Accordingly, invariance under a Galilean transformation dictates that a 
state |y) obeys the transformation rule |) > |7): 


|b) =U |v) (2.3.1) 


with 

So (2.3.2) 
guaranteeing that a transition amplitude (|) transforms as a Galilean 
scalar, i.e., 


(| ¥) = (|). (2.3.3) 


For systems involving non-zero spins, |W) is a multi-component object (cf. 
(2.7.20), §2.8, (2.8.24)) and (2.3.3) involves a sum over such components, i.e., 
in detail it is of the form 


90 2 Symmetries and Transformations 
DFP) = Ve lv. (2.3.4) 


b 
In the coordinate description, invariance under space translations and 
boosts, providing a uniform motion of a frame relative to another, but not 
involving rotations, means that 


(x|B) = (ld), l= (UT (2.3.5) 


and the former equality may be rewritten in the more familiar form 
BR) = vo). (2.3.6) 


By definition of a scalar, (2.3.6) holds under rotations as well for spin 0. On 
the other hand for non-zero spins, the transformation law in (2.3.6) cannot 
hold true under rotations. This is most apparent for spin 1, with (x) as 
a vector (field), since under rotations, a component dv; (x) will be a linear 
combination of the components ~;(x) (see (2.7.18)). This is fully exploited in 
§2.8 for arbitrary spins and the transformation law (2.3.6), under rotations, 
in the general case is simply modified to an expression of the form 


WD (x) = AM p(x) (2.3.7) 


where the (A) are some matrices which depend on spin. 

As discussed above in the beginning of this section, since the Galilean 
transformations are continuously connected with the identity, then for infin- 
itesimal transformations, one may write a unitary operator U as 


U=1+iG (2.3.8) 


where G has the property that it vanishes when 67, da, dv, dw all approach 
zero (see, in particular, (2.1.15), (2.1.16)), and the i factor is chosen so that 
G is a self-adjoint operator Gi = G. 

The operator G in (2.3.8) is dimensionless, and the operator coefficients 
of 57, 5a, 5v, Sw, respectively! in G have, what is called in quantum physics, 
natural units. These operators, as we will see later, have important physical 
meanings and are, in general, counterparts of classical quantities which, how- 
ever, are defined in terms of different units. In order to make the comparison 
with these classical standards, an overall conversion factor is introduced by 
dividing G, in (2.3.8), by this conversion factor which necessarily has the 
dimensions of action. The operator coefficients of 57, 5a, dv, dw are then all 
defined in the same units as their classical standards. The unit of action in 
quantum physics found empirically is that provided by fh (Planck’s constant 


' See (2.3.13) for details concerning these operator coefficients whose physical in- 
terpretations will then follow. 
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divided by 27).? By introducing this conversion factor we may rewrite (2.3.8) 
as 


U=1+ ,G (2.3.9) 


in conformity with the classical standards. For example, as we will see below, 
the coefficient of 5a will be associated with momentum and hence by dividing 
by the unit of action A in (2.3.9) it will have the same units as in classical 
physics. 

We consider successive Galilean transformations corresponding to the 
overall closed path in (2.2.13). That is, we consider the unitary operator 


U =UluluL.U, (2.3.10) 


For infinitesimal transformations, the right-hand side of (2.3.10) is given by 


(1- 50+...) (1- 51+...) (14+ 540+...) (1+ 5+...) 


1 
=1+ 55[G1, | pevekes (2.3.11) 


where we note that G (respectively G2) are of first order in the infinitesimal 
parameters 571, a1, v1, 5w1 (respectively 572, 5a2, dv2, 5w2) and second 
order terms in these parameters, e.g. (57,)?, (5715a1),..., (respectively (572)?, 
(5725a2),...), cancel out. The latter fact is emphasized by writing dots ... in 
each of the factors on the left-hand side of (2.3.11). Upon writing U on the 
left-hand of side of (2.3.10) as in (2.3.9) we obtain 


1 
G= 7 Gu (2.3.12) 


The most general structure of G in (2.3.10) is 
G=dba-P+iév-N—-i7H+5w-J+5¢1 (2.3.13) 


guaranteeing the fact that G goes to zero when da, dv, 57, dw all go to zero 
provided 45¢, contributing to a phase factor, vanishes as well in these limits. 
The operators P, N, H, J are called generators of the Galilean transforma- 
tions. P generates space translations, N, which is sometimes referred as the 
“booster”, generates uniform motion, H generates time translations, and J 
generates space rotations. 

Similarly, for G1, G2 we may write 


? The question arises as to what happens if one chooses another constant with 
units of action, such as a multiple of h: kh for some &k > 0, as a conversion 
factor. The energy levels of the hydrogen atom (§7.2), for example, would come 
out to be —me*/2K7h?n?. By confronting theory with one or a finite number of 
experiments then fixes the value of this conversion factor. 
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G; = da;-P+iév;-N—67;H + 5w;-J (2.3.14) 


j = 1,2. Clearly, the addition of any multiple of the identity in (2.3.14) cannot 
contribute to the commutator [G),G2] in (2.3.12). 

We note that in (2.3.14) we have written —57;H rather than 57;H as 
this provides, in particular, the correct overall sign for H corresponding to 
the observable it represents. [This is spelled out in §2.5.] G in (2.3.12) is 
anti-symmetric in the interchange of the indices 1 and 2 in (Gi, Go). This 
property is explicitly verified in the expressions for dv, 57, 5w respectively, 
given in (2.2.22), (2.2.23), (2.2.25). [Also we note from (2.2.18) that 57 = 0] 
Accordingly, the coefficient 5¢ of the identity 1 in (2.3.13) must be also anti- 
symmetric in the indices 1 and 2 and is to be constructed out of dot products 
of a,, d5v1, 5W 1, dag, dv2, We. The most general structure of 5¢ is then 


b¢ = MM (5a; . dvo rz dao . dvi) +B (5w, . dve a bWe . dvi) 
+£# (51 * bay i dW. - da,) (2.3.15) 


The coefficients F, B will turn out to be zero. 

All told, we explicitly carry out the commutation relation in (2.3.12) by 
using the expressions in (2.3.14). Upon the comparison of the coefficients with 
those arising in (2.3.13), (2.3.15) and by using, in the process, the equalities 
(2.2.18)-(2.2.21), (2.2.22)—(2.2.25), we are led to Table 2.1. 

From the commutation relation 6 in Table 2.1, it is easily derived that 

Be ~ ages, baer cll (2.3.16) 
which is readily worked out to be zero (see Problem 2.4). Similarly, from the 
commutator 7 in the Table, it is easily shown that B = 0. 

Instead of transforming a state |W), under a Galilean transformation, one 
may keep |w) fixed and transform instead the observables A — A such that 
expectation values 


(b|A]P) = (Ald) (2.3.17) 


are unaltered. From (2.3.1), this requires that 
A=UtAU. (2.3.18) 


[This may be generally referred to as an observable in a Heisenberg-like pic- 
ture.] For infinitesimal transformations given in (2.3.9) we then have 


a 
bA=A-A=— [A, G]. (2.3.19) 


Under a Galilean transformation we may then write from (2.3.13), (2.3.14) 


ihS.A = Sa’ [A, P?] + 5v)[A, N?] 
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— 5r[A, H] + bw’ [A, J’). (2.3.20) 
As an application of (2.3.20), we consider the operator 
Q= os (tP — N) (2.3.21) 
M 
for 57 = 0. From (2.3.20) and Table 2.1 


SQ = Sat dvt + bw x Q (2.3.22) 


which upon comparison with the transformation law in (2.1.15), one identifies 
Q with the position operator which, in the sequel, is denoted by X. 

Table 2.1 and/or (2.3.22) then lead to the following commutation relations 
involving X: 


[X*, x7] =0 (2.3.23) 
[Ato | Sines x (2.3.24) 
[X*, P| = ind” (2.3.25) 


Table 2.1. Commutation relations of the generators of the Galilean trans- 
formations as obtained from (2.3.12) upon the comparison of the appropri- 


ate coefficient-parameters defining the transformations. 
COEFFICIENT RESULTING COMMUTATOR 
1. ba} dai, (Pe Pe "0 
2. (a) 572 — 05571) Wey cea) 
3. (Su}572 — 6v}571) [H, N*] = inp’ 
4. bw} dw Le | iher ey” 
5. (Sai5u3 — a5v)) [P*, N?| =ihMs" 
6. (5a) Sw} — 5a55w!) [P*, J7] = ik (e7* P* — BS") 
7. (wi dv} — 5w55v}) [N’, J] = ih (e*N* — BO”) 
8. (Sw} S72 — w5571) Eb jed” | =O 
9. vi bv3 NS ve | 0 
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inP* 


[LS] = re 


(2.3.26) 


On the other hand for a = 0, 5v = 0, 5w = 0, (2.3.20) gives for 5s 40 


2 


; P 
inbX* = —57[X*, H] = —-ihdr vi (2.3.27) 


where we have also used (2.3.26). Hence from 6X‘(t) = X‘(t) — X 
X(t) — X' (E+ &t), (2.3.27) gives 
Pp 


X=5 (2.3.28) 


where we have used the fact that X(¢) = X(t), for 5a = 0, 5v = 0, 5w = 0. 
Finally, we note from Table 2.1 again that 


2 4 
as = ~in— =A, |) (2.3.29) 


Here in writing the last equality in (2.3.29) we have used (2.3.26). From 
(2.3.29) and the commutators 1 and 2 in Table 2.1 and (2.3.25) one is led to 
the following general expression for H 


2 


P 
H=>_—+H 2:3: 
au ttt (2.3.30) 
where 
[Ay Pl =O (2.3.31) 
and 
(Hy, X*| = 0: (2.3.32) 


The expressions (2.3.28), (2.3.30) lead inescapably one to identify P with 
the momentum operator associated with the system in question, M with its 
mass, and H with the total Hamiltonian operator. 

The momentum operator P generates space translations. More specifi- 
cally, for an infinitesimal numerical quantity da, the unitary operator 


U(Sa-P)=1+ 58 “Pp (2.3.33) 
leads from (2.3.25) (see also (2.3.18)) to 
Ul(5a- P)XU(5a-P) = X — da. (2.3.34) 


Similarly, ; 
U(—dp-X)=1- 8p “x (2.3.35) 
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for an infinitesimal numerical dp, leads from 
Ul(—85p- X)PU(—5p-X) =P — 5p (2.3.36) 


to a shift of the momentum operator P by the numerical quantity —dp. 
For a rotation by an infinitesimal angle §¢ about a unit vector n (6w = 
nd) we have also seen that 


U(Sw+J)=1+ bw I (2.3.37) 
which leads to 
Ul(5w + J)KU(S5w-J) = K—-5w x X (2.3.38) 


where we will see in §2.7, that J is identified with the angular momentum 
operator. 

A finite space translation by a numerical quantity a may be obtained by 
writing 5a = a/N, and consider successive N infinitesimal translations each 
by an amount a/N for N — ov: 


a N i 
(u (= P)) > exp (52 P) (2.3.39) 
(see also Problem 2.5). 


Similarly upon writing 5p = p/N, 6w = w/N we have for finite momen- 
tum shifts and finite rotations: 


(u (-£ a) — exp (-; ; x) (2.3.40) 


(Cea) es) ca) 


for N — ov, respectively, where w = n@, for numericals p. 
That is, under a shift of a coordinate system by an amount a, a state 
vector |) changes in the following way 


|b) > exp (za . P) |¢). (2.3.42) 


Similarly, when a coordinate system is rotated c.c.w. through an angle ¢ 
about a unit vector n, a state vector |W) changes by the rule 


|b) — exp (jn . 3) |b) = |’). (2.3.43) 


We now provide explicit representations of the generators P, X. 
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In the X-description, under an infinitesimal space translation (see (2.3.5)) 


(x|%) = (x|y) (2.3.44) 
where = 
|b) =U |p) (2.3.45) 
with 
(x| = (x — da| (2.3.46) 
giving 
(x — dalUw) = (x|v) (2.3.47) 
or 
(Uw) (x) = W(x + Sa). (2.3.48) 
A Taylor expansion of the right-hand side expression leads to 
(Uw)(x) = U(x) + ba- Voi(x) (2.3.49) 
from which we may infer that 
U=1+4a-V (2.3.50) 


leading to the following representation for the momentum operator 
P = -ihAV. (2.3.51) 
Similarly, in the P-description, under a momentum shift (translation), 


(Pv) =(p|4), P=p- 4p (2.3.52) 


leading to 
(Uy) (p) = Y(p + dp) (2.3.53) 


and from a Taylor expansion, we may infer that 
U=1+65p:-Vp5 (2.3.54) 


thus giving the following representation for the position operator (see 
(2.3.35)) 
X =ihV . (2.3.55) 


Generation of time translations and coordinate rotations will be dealt 
with in detail later in 82.5 and §2.7, respectively. 
Space translations are provided by the unitary operator (see (2.3.42)) 


U = exp (52 P) (2.3.56) 


Invariance under space translations implies the vanishing of the commutator 
of P with H (see entry 2 in Table 2.1) and the transformation provided by 
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the unitary operator in (2.3.56) is referred to as a symmetry transformation. 
The latter, in particular means, that the momentum operator satisfies the 
time independent property 


P(t) = e#/" peik/h _ p (2.3.57) 


and implies the conservation of momentum. In particular, (2.3.57) establishes 
(see also (2.3.17)) the time independence of the expectation value 


(HIP)|d) = (Ply) (2.3.58) 


in a state |q). 
Similarly, rotation of a coordinate system, is implemented by the unitary 
operator (see (2.3.43)) 


U =exp (Fen 5) (2.3.59) 


in the underlying Hilbert space. As a symmetry operation, rotational invari- 
ance, leading to the vanishing of the commutator of J with H (entry 8 in 
Table 2.1), implies the time independence of J(t), similarly defined as P(t) 
in (2.3.57), and gives rise to the conservation of angular momentum, and, in 
particular, to the time-independence of expectation values (¢ |J(t)|w). 

Of particular interest in using (2.3.20) is to derive the equations satisfied 
by X(t) and P(t), under pure time translations. These may be obtained from 
(2.3.20) by setting a = 0, 5v = 0, 5w = 0. For example, 


~[X(t), H]57 = 6X(t) = X(t) — X(2) 


l 
sa 
pe 
| 


(f + 67) (2.3.60) 


or 


ihX(t) = [X(t), H] (2.3.61) 
since X(t) = X(é). Similarly, we have 
ihP(t) = [P(t), H]. (2.3.62) 


Equations (2.3.61), (2.3.62) are referred to as Heisenberg’s Equations of Mo- 
tion. 


Finally, consider a system of n particles of masses m1,..., 7, with asso- 
ciated position operators X,,...,X, and define the center of mass position 
operator 

m 
x=)> Rie (2.3.63) 
a 


where M = 5° ma is the sum of the masses of the particles. Invariance under 


a 
an infinitesimal coordinate translation means as before 
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(xpreag ha CYS Cigan soe Pee. (2.3.64) 

This leads to the following chain of equalities 
(x, — ba,...,Xp — ba|P) = (x1,...,Xn |) (2.3.65) 
(x1,-..,Xn|Up) = (x + ba,...,X%n + dal?) (2.3.66) 


(Uw) (x1, eens Xn) = W(x1 a da, eee Xn + da) 


= ( +da-S> v. W(X1,...;X%n). (2.3.67) 
From (2.3.50), (2.3.51), we may introduce the total momentum operator 
a (2.3.68) 


with 
P, = -ihVa (2.3.69) 
denoting the momentum operator associated with the a‘” particle as ex- 


pected. A property consistent with (2.3.23), (2.3.63), is that the operators 
X, associated with different particles commute as well, 


xi, x5] =; (2.3.70) 
From (2.3.69), one also has 
xi, Pa] = i555 (2.3.71) 
and sg 
[Pi PB] =i: (2.3.72) 


2.4 The Transformation Function (x|p) 


In this short section we derive the explicit expression for the transfor- 
mation function (x|p) from the momentum: p-description to the position: 
x-description. 

The resolution of the identity in the p-description is written as 


3 
a= [slp wl. (2.4.1) 


The transformation function (x|p) from the p-description to the x- 
description arises through (see also (1.4.17)) the relation 
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3 
xls) = fee Gxip) (ld). (2.42) 


We consider infinitesimal shifts by numerical factors 5x, dp: x — x — 5x, 


p — p-— 5p provided through unitary operators (see §2.3, (2.3.5), (2.3.46), 
(2.3.39)) explicitly given by 


(x — dx| = (x| (1 - 15x . P) (2.4.3) 
and (see (2.3.51), (2.3.52), (2.3.54)) 


Ip — 5p) = € = “sp : x) IP) - (2.4.4) 
The formal identifications 
P |p) =p |p) (2.4.5) 
(x| X = x (x| (2.4.6) 
then lead for infinitesimal numericals 5x, 5p to 


5 (x|p) = (x|p) — (x — 6x|p — dp) 


* (x|(5x+P + 5p-X)[p) 


i 
= 7 (ox- p+ dp- x) (x|p) 
i 
= 5 (xlp) 6(x- p) (2.4.7) 
and upon integration to 
i 
(x|p) = exp & . p) (2.4.8) 


where the integration constant has been set equal to one by adopting the 
normalization condition 


3 
[eee (*1P) (wie) = 8% - x), (2.4.9) 


The importance of the expression (2.4.8) for the transformation function 
(x|p) cannot be overemphasized. From (2.4.2) it leads to the Fourier trans- 
form 


3 
W(x) = | Shi e™P/! a(p) (2.4.10) 
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as a physical transformation from the p-description to the x-description. 
The inverse transform from the x-description to the p-description simi- 
larly follows to be given by 


w(p) = [ox e P/M a(x) (2.4.11) 


with ; 
(p|x) = exp (-7 P) (2.4.12) 


Finally, the resolution of the identity in the x-description may be written 
as 


1= [expo (x| (2.4.13) 
with the normalization condition (2.4.9): 
(x’|x) = 63 (x’ — x). (2.4.14) 


From (2.4.8), (2.4.13), we then have the following normalization condition 
for momenta 


(p'|p) = (2h)?5°(p! — p). (2.4.15) 


2.5 Quantum Dynamics and Construction of 
Hamiltonians 


2.5.1 The Time Evolution: Schrodinger Equation 


Had we written +57,;H instead of —d7,;H in (2.3.14), the commutator 
[H, N4] in Table 2.1 would have been equal to —ihP? instead of +ihP’. This 
in turn would have led H in (2.3.30) to be of the form — [P?/2M + Hj], 
where H} is some operator which satisfies (2.3.31), (2.3.32) [as before, P? 
does not commute with X], with the wrong sign for the kinetic energy term. 
That is, for an infinitesimal transformation t — t — 67 (see (2.1.16), (2.1.2)) 
the corresponding unitary transformation must be given from (2.3.14) to be 


U(-8rH) =1- ;orH (2.5.1) 


with H denoting the Hamiltonian of the system having the general form (see 


(2.3.30)-(2.3.32)) 


P 
H = —+H 2.5.2 
aM I (2.5.2) 


where Hy is referred to as the internal energy and satisfies 


[H},P] =0 (2.5.3) 
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(Hi, X] = 0 (2.5.4) 


for a Galilean invariant theory. 

To obtain the unitary operator for finite time translations we write 
57 = T/N, as in §2.3, (see, e.g., (2.3.39)) and consider successive infinitesimal 
transformations for N — oo: 


(u (-tH))" — exp (-;7#) : (2.5.5) 


Let |) denote a state determined at time ¢ in the frame F (see §2.1—§2.3). 
For infinitesimal 57 this state is given by |¢,) in F: 


[P:) = (1 7 ort) |e) - (2.5.6) 


For pure time translations, we recall from §2.1, that the frames F and F are 
the same, except that time readings are carried out, at the same instant of 
time, by two clocks with one set simply —67 units of time back relative to 
the other. That is, 4) denotes the state |¢145,). Hence from (2.5.6) 


Wersr) = (1 a ort) |W) (2.5.7) 
or ; 
lees) — he) = — 5 87H |) 


which by taking the limit 57 — 0 gives the familiar Schrodinger equation 


iA ey) = He). (2.5.8) 


In particular, by setting |Wo) = |W) for t = 0, (2.5.8) may be integrated 
to give 


we) = e H/F [yp (2.5.9) 


for the time evolution of a state |~). 


2.5.2 Time as an Operator? 


It is important to emphasize that time is a parameter and is not promoted 
to an operator on physical grounds. To see this, suppose that top stands for the 
corresponding “operator”. Then for 5a = 0, dv = 0, dw = 0, and numerical 
57 #0, (2.3.20) implies that 


ih Stop = —57|top, H] (2.5.10) 
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which upon comparison with (2.1.16) leads to? 
[top, H] = —ih. (2.5.11) 


This then implies that for any real numerical FE 


ag (;, Bt») Hexp (- 5% =H+E. (2.5.12) 


Now let Ao be in the spectrum of H. Whether Ap is in the discrete or in 
the continuous spectrum, given any € > 0 (see §1.8, (1.8.34)), we can find a 
vector |w-) such that 

( — do) bell <e. (2.5.13) 


Upon setting |¢-) = exp (} Etop) |We), (2.5.13) in turn implies that 


| lexp (; Bt») HA exp (- [Bt - 9 be 


or from (2.5.12) that 


<eé (2.5.14) 


Il ZZ — (Ao — £)] dell < €. (2.5.15) 


That is, Ao — FE is also in the spectrum of H for all EF. One may choose 
F, in particular, an arbitrarily large positive number implying that A is un- 
bounded from below — a physically unacceptable property of a Hamiltonian 
and for the stability of the system in question. That is, time is to be treated 
as a parameter and not as an operator. [Some authors have, nevertheless, 
suggested promoting time, under some circumstances, to the status of an 
operator, but we will no go into this here.] Special attention will be given 
later in §3.3, §3.4 to the boundedness of Hamiltonians from below and the 
corresponding stability problem. 


2.5.3 Construction of Hamiltonians 


We now proceed to construct Hamiltonians consistent with (2.5.2)—(2.5.4). 

To this end consider a system of particles with masses m,, associated po- 
sition and momentum operators X,q and Py, respectively, where a = 1,...,n 
(see (2.3.63)—(2.3.72)). The operators Xq, Py associated with different par- 
ticles commute. 

With X representing the position of the whole system, that is of the 
center of mass position operator, P its total momentum and M the sum of 
the masses, we here record the definitions 


3 A “relativistic version”: [P“, X”] = —ihg"” for which (g“”) = diag[—1, 1,1, 1], or 
[P", X”] = ihg”” for which (g””) = diag[1, —1,—1, —1], also gives (2.5.11) with 
P= H/c, X°= ctop and c denoting the speed of light. One, however, does not 
need to use this to obtain (2.5.11) and our result follows directly from (2.3.20) 
and the identification in (2.1.16). 
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Ma 
X= » ax (2.5.16) 
Peay Ps (2.5.17) 


M=)J ma. (2.5.18) 


In Table 2.2, we consider commutation relations of various combinations 
of the particles’ operators with PJ and XJ. 


Table 2.2. Commutation relations of various combinations of the particles’ 
operators with P’? and X’. 


UAE Aa) P |= a Reha) Aoet7)) 
= ind"? DX (dap — 5a) = 0 


fa 


2. [(Xa— Xp), X?] =U PL (Xa — Xp) Xp] = 0 
p 


3. [ (Xi — X*) , P'] => [Xi, P#] — ins =0 


6. [ (Pi — PA) .X!] =e (PE X4] - [P5, x2) 


From the Table, we see that Ay in (2.5.2), consistent with (2.5.3) and 
(2.5.4), may be chosen, in particular, a function of: (Ka — Xg), (Ke — X), 
Pa — (ma/M)P. 

That is, for a very large class of Hamiltonians consistent with (2.5.2)— 
(2.5.4), we may choose 


Pp? a 
H= M 2.5.1 
2M d. Mea ve (26212) 
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where Pe e 
V = S0 Vag (Ke — Xa) + 5) Va (Ka — X). (2.5.20) 
aXzZB a 
Additional conditions may be spelled out for V by invoking, for example, 
rotational invariance as given in entry 8 in Table 2.1. 


In the remaining part of this section, we work out some details of the class 
of Hamiltonians in (2.5.19) of physical interest. 


2.5.4 Multi-Particle Hamiltonians 


In detail, we may write 


5 (P= 2p ae | i Pp? 
yee) = Pe EP Pat Fy 
"PF? P? 
= d a oe (2.5.21) 


That is, the expression on the left-hand side amounts automatically in re- 
moving the center of mass motion, and (2.5.19) leads to a familiar expression 


H=)°-*4V (2.5.22) 


2Mo 


where V has the structure in (2.5.20). 
For example, for a system of charged particles of charges gq, and masses 


Ma, @=1,...,n, with Coulomb interactions, in the coordinate description, 
one has 2, : e 
P 1 dads 
H= a4 = ; (2.5.23) 
Doma 2 Pa — xa 


It is important to realize that the Hamiltonian (H’ = Hj): 


(Pe eB) 
HS (Pa=G#P) V 2.5.24 
2 Ting ue 
provides, and is appropriately referred to as, the internal energy as it has, as 
seen in (2.5.21), the center of mass motion removed. 


2.5.5 Two-Particle Systems and Relative Motion 


We consider the structure given (2.5.19) for n = 2, in a coordinate de- 
scription, and consider the motion of one particle relative to the other. To 
this end, we write 
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r= xX, — X2 (2.5.25) 
Se ine) (2.5.26) 
= —(m 1x, + m2x 5. 
wor 2X2 
It is easily verified that 

m 

Vi- ave =V; (2.5.28) 
m 

Vo- arV® =-V, (2.5.29) 


which directly lead from (2.5.19) to the expression 


Re ss —— 
A = —-— —_—vV 2.5. 
with i ‘ i 
Be sh (2.5.31) 


Lt My ms. 
and js defining the so-called reduced mass of the system of two particles. 
Here we note that since x} —R = mor/M, x2 — R = —mr/M, V(r) is 
some function of r. The first term in (2.5.30) describes the center of mass 
free motion. The remaining part describes the relative motion of the two 
particles. 


2.5.6 Multi-Electron Atoms with Positions of the Electrons 
Defined Relative to the Nucleus 
Consider a system of N electrons with 


Mae=m, a=1,...,N (2.5.32) 


and a nucleus of mass my+1 = ™o. 
That is, M4 = Nm-+ mp. We consider a coordinate description of the 
system. To this end we define the variables 


Yo =Xa—-Xn41, a= l,...,N (2.5.33) 
1 
R= ae (m (x1 +... + xy) + moxn41)- (2.5.34) 


It is readily obtained that 


Vx. — 47 VR = Vee a=1,...,N (2:5.35) 
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N 
m 
Vit = ar VR SNe Vs (2.5.36) 
a=1 


The expression in (2.5.19) then gives for the multi-electron atom in the co- 
ordinate description, the Hamiltonian 


2 
Pe seP. W 
H= & Py 
2M - Me 2m 7 2mo dX 
N N 
1 e? Ze? 
! (2.5.37) 
3 Ds ieee 2 Teal 


where Zle| is the charge of the nucleus. For a neutral atom N = Z. Here 
Py = —ifV,,,. Since mo, M > m, one, in practical applications, may neglect 
the first and the third terms on the right-hand side of (2.5.37). In any case, 


2 
N 

since P? and (x: Po are positive operators, that is, their expectation 
a 


values are non-negative, we have the following lower bound for atoms 


(b|A |b) > (b| Marl) (2.5.38) 


for the expectation values, where 


N D N 2 N 2 

P 1 e Ze 
Hap = y a S 5 : 2.5.39 
- fame " Z a#f Ita Fal fq Pal 


2.5.7 Decompositions into Clusters of Particles 


As a final application of the expression (2.5.19) for a Hamiltonian, we 
consider the class of two-body particle-particle interactions given by the first 
term in (2.5.20). The interest here consists in grouping the n particles into 
k, disjoint subsets of particles, which we refer to as clusters, with the first 
containing n, particles,..., and with the k*” containing n;, particles. 

The above grouping is most convenient if one is interested in studying the 
properties of the different clusters as separate entities. In a scattering exper- 
iment, for example, one may have, as a final state of the process, particles 
emerging (within experimental limitations) into localized clusters (with non- 
vanishing intra-clusteral interactions) which are widely separated one from 
the other (with negligible inter-clusteral interactions). [Intra-clusteral interac- 
tions refer to interactions occurring between particles within the same cluster, 
while inter-clusteral referring to interactions between different clusters, due 
to particle-particle interactions with the particles belonging to the different 
clusters.] That is, these clusters of particles emerge as separate entities and 
the properties of some or all of them may explored. 
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The positions of the particles in the clusters are conveniently labelled as 


{Xi1,---)Xini})---){Xk1,-+-)Xknz } (2.5.40) 


and their masses as 


{m11, +--+) Miny }y---s (MK. +s Men, }- (2.5.41) 


The center of mass positions of the clusters are then defined by 


1 
B=1 
with 
Ma = >) mop (2.5.43) 
B=1 
a=1,...,k, and M, denoting the sum of the masses of the particles in the 


a‘ cluster. For the center of mass position of the k clusters we have 
ic 
R= — MaRa 2.5.44 
7 d (2.5.44) 


with 
M — 


a 


My. (2.5.45) 


k 
=1 


1 PARTICLES WITH THE oy Nk PARTICLES WITH THE 
SUM OF MASSES 1, ar SUM OF MASSES /;,, 


\\ fy 
CENTER OF MASS POSITION Rj SW CENTER OF MASS POSITION Rz. 


Fig. 2.3. Grouping of n particles into k clusters of particles. Such a grouping 
is convenient for the study of properties of these clusters as separate entities. 


Upon defining the a” cluster total momentum operator 


Nes 


Py =>) Pap (2.5.46) 
B=1 
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and using the identity 


Na Mes 
SS” ( Pas — Po.] =0 (2.5.47) 
p=1 ( Ma 


one may write the Hamiltonian in question as 


2 
oP oye Poe Pe) Sh Pa HPy 
~ 2M 2mMag 2M. 
a=1 B=1 a=1 
+Vet+Va (2.5.48) 
where 
Ke) Rie 
Va = >> 3° Vay (Kap — Xe) (2.5.49) 
a=1 p4y 


k k na NB 
Ve =>) 3-303. Vay (Kay — Xpp) « (2.5.50) 
a=1 B=1 y=1 p=1 
(aA) 


The potential energies Vz and V4 are, respectively, responsible for the inter- 
clusteral and the intra-clusteral interactions. 


From (2.5.21) we note that, in reference to (2.5.48), one has the center of 
mass motion of an a” cluster conveniently removed in 


2 
Map 
— (Pas - Ma P..) _<S Pas _ Pi. 


. 2.5.51 
ye 2MNap De 2Mag 2M ( ) 
B=1 B=1 
Similarly, in 
k M. 2 k 2 2 
Py. — =#P P P 

S ( iP) = S a (25:52) 

= 2Ma = 2M, 2M 


one has the center of mass motion of the & clusters removed. 

At this stage, if one wishes, one may introduce variables as in (2.5.33), 
(2.5.34) in reference to each cluster, and also introduce such variables for the 
system of the k clusters. That is, one may set 


ree— ae Xan. PH lavyte—l (2.5.53) 


for a= 1,...,k and Ry as already given in (2.5.42). On the other hand for 
the system of clusters, one may set 


No =Ra—Re, a=1,...,k—-1 (2.5.54) 
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and R as defined in (2.5.44). Such variables may not always be the most 
convenient ones and different variables may be introduced instead. 

As an example, consider two clusters (k = 2) with the first consisting of 
one particle (n; = 1) and the second consisting of two particles (ng = 2). 
This may correspond, for example, to a situation in a scattering process 
involving a positron et and a hydrogen atom with the latter consisting of an 
electron-proton bound state. The variables (2.5.53), (2.5.54) in question are 


r=x1=Ri (2.5.55) 
Yo1 = Xo1 — X22 =F (2.5.56) 
n=R,-R»2 (2.5.57) 


and the following Hamiltonian 
h? ‘iar au 1 Ro aA 1 
Hie VE V- Vv; 
IM 9 €. a) B29 G mn) x 


+ Va(r) + Vi (n + my) + Ve (1 Me r) (2.5.58) 
2 


is of the form in (2.5.48), where Va(r) is responsible for the intra-clusteral 
interaction of the second cluster, and Vg = Vi + V2 is a potential energy 
responsible for the inter-clusteral interaction, that is, the interaction between 
the two clusters. Here M, = m1, Mz = mo, + mag. The decomposition in 
(2.5.58) may, for example, be convenient if Vg vanishes sufficiently rapidly 
for |n| — oo and, initially and/or finally in some scattering process, one is 
dealing with two non-interacting clusters. 

Other examples of clusters decompositions may be also carried out (see 
Problem 2.7) in a similar manner. 

In the appendix to this section, the time evolution of states given in (2.5.9) 
will be generalized to time-dependent Hamiltonians. 


Appendix to §2.5: Time-Evolution for Time-Dependent 
Hamiltonians 


The purpose of this appendix is to describe the time evolution of a state for 
time-dependent Hamiltonians. A state |¢,15,-) approaches the state |y,) for 
d5¢ — 0, and must coincide with (2.5.7) for a time-independent Hamiltonian. 
Quite generally we may then write 


lbrssr) = (1— ZorH(0) [de (A-2.5.1) 
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to first order in 7, where the Hamiltonian H(t) is assumed to be, in general, 
time-dependent. As in (2.5.8), (A-2.5.1) leads, by taking the limit 57 — 0, to 
the Schrédinger equation 


3) 
ina. be) = A(t) |ve) . (A-2.5.2) 


To integrate (A-2.5.2), we denote the initial state |¢9) by |). To this end, 
we have by an elementary iterative formal procedure 


Wt) = v) 


wy sty (Z) fae mete) 
v4 (Z) fen mens (ZB) fae [Wan meneten ie) 


and finally obtain 


we) = |) +d (F a fam fe “atm frat H(tn)H(tn—1)...H(t1) |?) 


nel 


(A-2.5.3) 
where we note that 
Bi Se (A-2.5.4) 
and the important ordering of the operators 
A (ty) A(tn-1)... 1 (te) (t1) (A-2.5.5) 


in (A-2.5.3) from right to left corresponding to the ordering in (A-2.5.4). 
To rewrite (A-2.5.3) in a more manageable form, we introduce the chrono- 
logical time ordering operation defined by 


(A(t) H(t), = (A()H(t))+ = A) A(t) (A-2.5.6) 
if t >t’, and more generally 
(H(t)... H(ty))4 = (H (tis). H(ti,))4 
= H(t,)...H(t,) (A-2.5.7) 


if ty, Stn-1 >... > ti, and {t;,,...,t;, } is any permutation of {t1,...,tn}. 
From the definition in (A-2.5.6), we have the following equality 
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| ae i: a (A(t2)H(t1))4 = i; ae | nats H(t2)H(t1) 
+ fae [oa H(t,)H(t2) 


=2 | dts | dt; H(ts)H(t,;)  (A-2.5.8) 


and more generally from (A-2.5.7) 


[ots [ate ie fat aH tn)... H(t1)) + 
=nt fata dij asf dt, H H(ty). (A-2.5.9) 


Equation (A-2.5.9) allows us to rewrite the solution in (A-2.5.3) as 


Wy = ee" Me fate [ dtp. [ats cme H(t) + |) 


n 


ay im (ke a(t) . (A-2.5.10) 


le) = (eo; [ae A(t )) (A-2.5.11) 


generalizing the solution in (2.5.9) for time-dependent Hamiltonians. 
We note that the adjoint transformation of (A-2.5.7) leads to 


[(H(t1)...H(tn)),]' = [(A (ti) --- H(ta,)) 4]! 


= H(t1)... H(tn—1)H(tn) (A-2.5.12) 


if t; < ... < ty_1 < ty. Thus we may introduce the chronological time 
anti-ordering operation defined by 


(H(ty)... H(tn))_ = (H(ti,) ... H(ti,))_ 
=A). tea) A (ta) (A-2.5.13) 


if ty <... <tp_1 < ty, and {t;,,...,¢;,,} is any permutation of {t;,...,tn}. 
From (A-2.5.11)—(A-2.5.13), we then have for the adjoint of the time evo- 
lution operator 


(oj fa ne) | = (ow; fev H(t) (A-2.5.14) 
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2.6 Discrete Transformations: Parity and Time Reversal 


In this section, we consider two discrete transformations. The first consists 
of the operation of reflecting the sense of every spatial coordinate axis. This 
amounts to the transformation x > x’: 


x’ > —x (2.6.1) 


as a reflection through the origin of the space coordinate system. The other 
is the reversal of the direction of time flow. We will see that on the physical 
states, a space reflection is implemented by a unitary operator P, referred 
to as the parity transformation, while time reversal is implemented by an 
anti-unitary operator which we denote by 7. The problem encountered in 
trying to represent P by an anti-unitary operator rather than a unitary one 
will be discussed explicitly below. The analysis of the problem encountered 
in representing J by a unitary operator rather than by an anti-unitary one is 
similarly carried out and is left as an exercise to the reader (see Problem 2.8). 
A third discrete transformation, referred to as charge conjugation, denoted by 
C, will be discussed later in Chapter 16. As mentioned earlier, if one combines 
special relativity with quantum mechanics, leading to what is called quantum 
field theory, then one may establish that the combined transformation CPT 
is asymmetry transformation based on such a merger. 

A point Q in a given (right-handed) coordinate system labelled by x, will 
be labelled by x’ in a (left-handed) coordinate system arising from the initial 
one by reflecting the sense of every spatial coordinate axis (see Figure 2.3). 

For infinitesimal Galilean transformations, we recall that (see (2.1.15), 
(2.1.16)) 


bx = da+ dvt+ dw x x (2.6.2) 
St = 57 (2.6.3) 


where Sw is defined in (2.1.6). For the combined Galilean transformation 
and space reflection, for example, we have 


(6x)/ = —5a — dvt — Sw x x (2.6.4) 
(Ot) =F (2.6.5) 


which amount to the replacements, 5a — —da, dv — —dv, dw — dw, 
b7 > Sr. 

With the change of description given in (2.6.4), we associate an operator 
P, operating on physical states, whose nature will be soon established. 

In reference to Wigner’s Theorem (§1.9), to establish whether P should be 
anti-unitary or unitary, we consider the case in (2.6.4) with a = 0, 5v = 0, 
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(b) 


Fig. 2.4. A point Q labelled by x in the right-handed coordinate system in 
(a) and is labelled by x’ in the left-handed coordinate system in (b) obtained 
from the initial one by reflecting the sense of each of its coordinate axes. We 
note that 0 = 7— 60’, 6 = ¢ — a, x = —x’, or equivalently 0’ = a — 0, 
og =o4+7, x’ =—x. 


5w =0 first. Since the operation consisting of a space reflection followed by 
a time shift 57 in one’s clocks readings and the operation of a time shift 57 
in one’s clocks readings followed by a space reflection are equivalent we may 
infer that 


(1 _ ort) Pap € 3 1 5r(H + aa)) (2.6.6) 


up to a phase factor specified by an additional structure (1/h)5y1, with 
5y = —cdr, as appearing on the right-hand of (2.6.6), with c some real 
constant. This equality implies that 


iHP = Pi(H + cl). (2.6.7) 


If P is anti-unitary, that is, in particular, it complex conjugates numerical 
factors, we obtain after re-arrangement of terms 


PHP | =-(H +l). (2.6.8) 


A Hamiltonian, in general, is unbounded from above. This is true even for a 
free particle. Equation (2.6.8) says that for every eigenstate |7) of H with 
a given non-negative energy E arbitrarily large, H has an eigenstate P |y) 
with corresponding energy —(£ +c). With F arbitrarily large, this in turn 
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implies that the Hamiltonian is unbounded from below and is physically un- 
acceptable. The latter argument may be made rigorous by stating that for 
any € > 0, however small, 


|(H — E)pell <e (2.6.9) 


if and only if 
l[H + (EZ + ¢)]Pyel] <e (2.6.10) 


implying the unboundedness of the spectrum of H from below for arbitrarily 
large non-negative E. That is, P must be unitary. 

With P as a unitary operator, we consider the more general cases corre- 
sponding to (2.6.4), (2.6.5). 

Since the operation consisting of a space reflection, followed by an infini- 
tesimal Galilean transformation, specified by the parameters da, dv, 57, dw, 
is equivalent to first performing the infinitesimal changes —da, —dv, 57, dw, 
and then a space reflection, we may infer that 


[14+ 5 (0a-P+5v-N— Sri +5w-3)|P 


=P lit zt Sa-P—5v-N Sri + 8a -3)| (2.6.11) 


where we have dispensed with a phase factor specified by an additional struc- 
ture (1/h)5y1 within the square brackets on the right-hand side (see Prob- 
lem 2.9). 

Unitarity of P then implies from (2.6.11) that 


HP —PH=0 (2.6.12) 
JP —-PJ=0 (2.6.13) 
PP +PP=0 (2.6.14) 
NP +PN = 0. (2.6.15) 


From (2.6.14), (2.6.15) and (2.3.21) we may also infer that 
XP +PX =0 (2.6.16) 


as expected. 

If parity is not asymmetry transformation, such as in beta decay (a weak 
interaction process), the Hamiltonian is so constructed to reflect this fact by 
ensuring, in particular, that HP — PH F 0. 

A very similar analysis to the one given through (2.6.6)—(2.6.10) implies 
that time reversal is implemented by an anti-unitary operator rather than 
by a unitary operator (see Problem 2.8). 
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The infinitesimal Galilean transformations in (2.6.2), (2.6.3) followed by 
the reversal of the time translation t — 57 — —t+ 87 give 


(8x)’ = a+ dvt+ bw x x (2.6.17) 
(8t)’ = —5r (2.6.18) 


where dv — —dv, dw — dw under the reversal of the direction of time flow. 
We note the invariance of the product dvt. 

By using the identification of the position operator in (2.3.21), we spell 
out an infinitesimal Galilean transformation, at a given time t, as 


1+ = [(6a + tv) -P — Mbv-X — brH + bw- J). (2.6.19) 


Since the operation of time reversal followed by an infinitesimal Galilean 
transformation specified by 5a, dv, 57, dw, at a given time t¢ as the latter 
is shown in (2.6.19), is equivalent to an infinitesimal Galilean transformation 
specified by 5a, —dv, —d57, 5w, at time —t, followed by a time reversal, we 
obtain 


(14+ 5 [(6a+ t6v)-P Mbv-X—brH +8 -3]) T 


=T (1 + (5a + tv) -P+ Miv-X+5rH +bw- 3) (2.6.20) 


where as for the case of the parity transformation in (2.6.11), we have dis- 
pensed with a phase factor contribution in (2.6.20) (see Problem 2.9). It is 
worth recalling the complex conjugation nature of numerical factors of the 
operator T. Accordingly, (2.6.20) leads to 


HT -TH=0 (2.6.21) 
JT7+T7J=0 (2.6.22) 
PT+TP=0 (2.6.23) 
and 
({P — MX)T + 7T(tP + MX) =0. (2.6.24) 


The letter two equations then give 
XT —-TX=0. (2.6.25) 


We note that the operator J does not operate on the real parameter t in 
(2.6.24). 
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2.7 Orbital Angular Momentum and Spin 


In this section we investigate the nature of the generator J (see (2.3.13), 
(2.3.43)) of space coordinate rotations. Important commutation relations in- 
volving J are given in Table 2.1 below (2.3.15) and in (2.3.24), where we recall 
that EF = 0, B = 0 in the Table. From these commutation relations, and from 
(2.3.21) and (2.3.20) it is not difficult to derive that for infinitesimal Galilean 
transformations 


5J = (a+ tiv) x P— Mdv x X+5w x J. (2.7.1) 


On the other hand from (2.3.22), (2.3.27) 
P 
5X = (da+ tdv) + dw x X — 675 (2.7.2) 


and from (2.3.20), Table 2.1, 
5P = Miv + dw x P. (2.7.3) 


From the latter two equations or directly from (2.3.20) (see Problem 2.10), 
one has 


5(X x P) = (a+ tdv) x P— Mév x X + dw x (X x P) (2.7.4) 
where we note, in particular, that 
[(H,X x P] =0. (2.7.5) 


As a matter of fact X x P satisfies the same commutation relations in Ta- 
ble 2.1 and in (2.3.24) as J. Hence upon the comparison of (2.7.1) with (2.7.4) 
we conclude that J has the very general form 


J=(XxP)+S (2.7.6) 


where the operator S necessarily, as is easily established, satisfies the com- 
mutation relations 


[PS | =0 (2.7.7) 
Lees? |=0 (2.7.8) 
[H, S*] =0 (2.7.9) 
[S* 6? eee S*, (2.7.10) 


The operator 
XxP=L (2.7.11) 
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denotes the familiar orbital angular momentum. On the other hand S in 
(2.7.6) is a translational independent contribution to J and is referred to 
as the spin or the internal angular momentum. We note, in particular, that 
(2.7.7), (2.7.8) imply that 

[E58] 0: (2.7.12) 


To obtain further insight into the nature of the spin operator S, we con- 
sider the transformations of a Galilean scalar (field) and a Galilean vector 
(field) under a rotation of the coordinate system. 

A Galilean scalar (field) ~(x), under a rotation of the coordinate system, 
is defined by (see (2.1.1)) the transformation law 


(Rx) = U(x) (2.7.13) 


or by 
w(x) = o(Ro*x). (2.7.14) 


Upon using (2.3.43), and (2.1.26), (2.1.34), (2.1.43) we obtain from 
(2.7.14) 


((ex ~w 5) v) (x) = exp [w+ (x x V)] U(x) (2.7.15) 
thus leading to the representation 
J =x x (-ifV) (2.7.16) 
and 
S=0 (2.7.17) 


reflecting the spin 0 character of a scalar (field). 
A Galilean vector (field) ®(x) under the rotation of the coordinate system, 
is defined by the transformation law4 


6"(Rx) = RYO! (x) (2.7.18) 


or by Pe 
6" (x) = RY @)(R7'x) (2.7.19) 


where, in the same way as the vector x itself (see (2.1.1)), the presence of the 
matrix (R’’) on the right-hand sides of (2.7.18), (2.7.19) reflects the vector 
character of ®. Of course, we have to check that the spin associated with 
® is indeed equal to one which is the spin content of a vector (field). The 
presence of the matrix (R’’), in turn, implies the invariance of an amplitude 
under a rotation of the coordinate system: 


* Vector fields also arise as a particular case within a spinor analysis context as 
given in §2.8. 
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3 3 
S7 (op) = 3° (@4y") (2.7.20) 
i=1 i=1 
where we have used (2.1.3), and (2.1.36) for the evaluation of the Jacobian 
of the transformation which is equal to one. 
From (2.1.26), (2.1.34), (2.1.43), we have for (2.7.19) the explicit expres- 
sion 


&'(x) = exp [i- . (x x (—ihV) + AM)] ®(x) (2.7.21) 
leading from (2.3.43) to the representation 
J=x x (-ihV)+hM (2.7.22) 
and hence to 
S=iM (2.7.23) 


where the matrices M/ are defined in (2.1.17), (2.1.28)—(2.1.30), (2.1.32). In 
particular, 
S? = 2h1 (2.7.24) 


which upon using the definition that for a particle of spin s (see §5.1, §5.4) 
we must have S? = fi?s(s + 1)1, (2.7.24) then gives s = 1, thus verifying the 
spin content of ®. 

Now we turn to a system of n particles. Using the definitions given in 
(2.5.16)—(2.5.18), we rewrite (2.7.6) as 


J= 50 (Xa X Pa + Sa) (2.7.25) 
a=1 


with X and P given in (2.5.16), and (2.5.17), respectively, and S is to be 
determined. J may be rewritten in the form 


J=XxP+)  [(Ka—X) x Pa+Sal (2.7.26) 
a=1 


On the other hand since 


“Me 
—(X,-X)= 2.7.2 
a7 Ko -X) (2.7.21) 
and hence - 
Mea 
— (X,-X P= 2.7.2 
Do G7 Ka -X) x (2.7.28) 


one may finally rewrite (2.7.25) as 


J=XxP+ s [Xa _X)x (Ps = “2P) 4 Ss. (2.7.29) 


a=1 
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This leads to the following expression for the total internal angular mo- 
mentum S of the system of particles as one composite object made up of the 
n particles: 


S= 3 [(Ka—X) x (Pa - 7eP) +S. (2.7.30) 


with (X,. — X) denoting the operator for the position of the a!” particle 
relative to the center of mass, and (Py, —m,.P/M) denoting the operator 
of its momentum as determined in the center of mass. The first term in the 
summand in (2.7.30) represents the orbital angular momentum operator of 
the a” particle in the center of mass system. The internal angular momentum 
S in (2.7.30) is the total angular momentum of the system of particles relative 
to the center of mass. It is easily checked from (2.7.30) that 


PPPs) Os. ee) 0 (2.7.31) 


as they should be for a proper definition of spin (see (2.7.7), (2.7.8)). 

Consider the decomposition of J in (2.7.25) into k F 1 clusters (see §2.5), 
rather than one, as in (2.7.29), (2.7.30), composed, respectively, of n1,..., 
particles. To this end, it is more convenient to introduce the operators Xqg, 
Pas, Xa-, X, Pa-, P where 


Na Map 
aa Xo re 
ay Ta (2.7.32) 
B=1 
BOM. 
X= = 2X. 2.7. 
ey Ar (2.7.33) 


denoting, respectively, the position operator associated with the center of 
mass of the a‘” cluster, and the position operator of the center of mass of 


k 
the k clusters with the latter coinciding with the center of mass of the )7 nq 


particles. The operators Pyg, Pa., P and the masses mag, Ma, M are defined 
in §2.5. In particular, M,, denotes the sum of the masses of the particles in 
the a‘ cluster. 

Suppose S,g denotes the spin of the 3" particle in the a‘” cluster. Then 


k ne 
J=S°S° (Kags X Pas + Sap) (2.7.34) 
a=1 pa1 
may be rewritten as 
B M, 
J=XxP+)> | %. —X)-x (P.. = TP) + Sa) (2.7.35) 


a=1 
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where 


— — a — Map 
=i [Kas — Xa.) x (Pas — “22 a.) +80] (2.7.36) 


is the total internal angular momentum of the a‘” cluster as a composite 
object made up of nq particles. The following commutation relations again 
hold true 
bes s?,| =0, eae 25, (2.7.37) 
Also 
bes Si] = 0, si -~0 (2.7.38) 


for all 7,7 = 1,2,3. 
For a two cluster decomposition (k = 2), consisting, respectively, of n1 
and nz particles, J in (2.7.35) may be rewritten as 


2 


J=XxP+>- ce —X)x (P.. = ~P) +5.0| 


a=1 


=X x P+L,+S8q) + 8,) (2.7.39) 


where L,. is the total orbital angular momentum of the two clusters residing 
in their center of mass given by 


= Yk —X) x (P.. - =P) 


(2.7.40) 


= (Xj. — X2.) x (Sa — a) 


M,+ Me 


and L, + S$) +S a) is the internal angular momentum of the two clusters. 
Pertinent properties of the angular momentum operator, in general, as 
well as of the orbital angular momentum operator and spin, in particular, 
will be studied in detail later in Chapter 5. Arbitrary spins are also studied 
in §2.8. 
Finally we note from the transformation law (2.3.43) of a state |¢) under 
a coordinate rotation 


|:") = exp (pen i 5) I) , (2.7.41) 


in a coordinate description, we have 


wie) = (ex (jen 5) ») (x) (2.7.42) 
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and from (2.7.6), (2.7.11), (2.7.12) and the identity (2.1.43) 


Wiad) = (ew (Fon-8) ¥) @) (2.7.43) 


where x’ = Rx, with R defined in (2.1.26). Note that in (2.7.42), (2.7.43) 
the argument of ~ is x when the right-hand side of the equation of its 
transformation law involves J, while it is x’ when it involves only the spin S, 
respectively. 


2.8 Spinors and Arbitrary Spins 


In generalizing the concept of spin 0 and spin 1 studied in 82.7, we intro- 
duce the concept of a spinor. Spinors are essential objects needed to describe 
spin 1/2 and, more generally, half-odd integral spins. One may even describe 
integer spins by using these curious objects as we will see below. 


2.8.1 Spinors and Generation of Arbitrary Spins 


We begin by defining on how a spinor transforms under a coordinate 
rotation. Spinors of so-called of rank 1 and their transformation rule under 
a coordinate rotation are introduced from first principles in the appendix to 
this chapter. At a first reading, one may skip the reading of this appendix. 
The burden of this section is to see how these mathematical entities together 
with their underlying definitions describe arbitrary spins. 

A spinor (field) of rank k, w*1--**(x), depending on k indices a1... ax 
each taking the possible values 1 and 2, has the following transformation law 
under a coordinate rotation by an angle y about a unit vector n: 


: aby : a4 dk 
Qe = (exp [sen . o|) st (exp [sen . o|) apr Pk (x) 
(2.8 


where a summation over the repeated indices b,,...,b% is understood. In the 
notation of §2.1, x’ = Rx. Here o = (01,02,03), where 01, 02, 03 are 2 x 2 
matrices introduced by Pauli 


o1= 4 , O2= G ) , O3= € :) (2.8.2) 


which together the unit matrix 
10 
- (39) ass 


constitute a complete set of matrices in the vector space consisting of complex 
2 x 2 matrices. That is, any element in such a vector space may be written 


122 2 Symmetries and Transformations 


as a linear combination of the matrices 1, 01, 72, 03. [-]?° denotes the matrix 
elements of the matrix in question. 

In the sense of §2.7, spin 0 (see (2.7.17)), is described by a spinor of rank 
0. To describe arbitrary spins, we are interested in symmetric spinors of rank 
k. That is, spinors w*--* (x) which are completely symmetric in the indices 
Q1...Qk. 

The matrix exp(ipn-o/2) in (2.8.1) may be rewritten in the simple form 


1cos . +i(n+o) sin - (2.8.4) 


To see this note that 


i 1 cos - +i(n-o)sin | = sn :o [2 cos . +i(n-+o)sin . (2.8.5) 


where we have used the useful identity 
aio; = 6% +iet* oy. (2.8.6) 
Upon integration of (2.8.5), we obtain 


exp (ifn . c) = 1cos . +i(n-o) sin? (2.8.7) 


with the boundary condition 


=1. (2.8.8) 


exp (ifn : c) 
y=0 
[Another way of deriving (2.8.7) is given in Problem 2.13.| 
As the indices a1,..., ax, take on the values 1 or 2, acompletely symmetric 
spinor ~*-** (x) define (k+1) functions which we denote by w9(x),..., Wx (x) 
and, in particular, we may choose 


wit tl (x) = dy (x) (2.8.9) 
p?--22(x) = Yo(x). (2.8.10) 
For arbitrary a1,...,a,% taking the values 1 or 2, we may expand w%-“* (x) 
in terms of these functions. To this end, let 2 of the indices a;,...,ax take 


on the value 1 and the (& — 7) remaining of them take on the value 2. We 
introduce the orthonormal set consisting of the objects 
C1 Ok (2) = 


(Ore ag OPT One”) (2.8.11) 


permut. 


1 
Jkil(k — 3)! 


with i=0,1,...,k. Here permut. stands for a summation over all permuta- 
tions of {a ,...,a,}. The orthonormality property reads: 
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Ct ()OM-% (i’) = Oj. (2.8.12) 


The orthogonality property for i 4 7’ is obvious since the number of ones 
(and hence also the number of twos) will be different in this case. The overall 
normalization factor in (2.8.11) arises in the following manner. Out of the k 
permutations of {a1,...,a,}, i! (k — 7)! are redundant. Hence we may write 


GEM, Ape ae” 58) 


permut. 


/ 


SFR a) (6 en COME 5 OR") (2.8.13) 


permut. 


/ 


where (anu OTS M412 OP) eeatt stands for a summation over all non- 


redundant permutations. The latter will consist of exactly CP = k!/i!(k — 1)! 
orthonormal terms. That is, 


Die... On se OOO 8) 


permut. 


x (6a1 gadgets? 5x2) 


= (i(k -a)?CF=kal(k—a! (2.8.14) 


leading to the expression in (2.8.11). 
Hence we may rewrite 


k 
apart (x) 2 S- C1 Ok (1) i(x) (2.8.15) 
1=0 
with, in particular, 
C1 Ok (k) = pul "ef Pelee (2.8.16) 
Yer tk (0) = 12 niet Sak (2.8.17) 


to be compared, respectively, with (2.8.9), (2.8.10). 

The structures C%!-¢*(k) in (2.8.11) may be rewritten in a more conve- 
nient form as follows. We introduce two independent variables gi, gz. Then 
it is not difficult to see that (2.8.11) may be rewritten in the equivalent form 


1 a a 
Ski (k— i)! 89a, Aga, 


At this stage, it is more convenient to introduce the parameters s and m 
defined by® 


C21--48 (7) (91)'(go)**. (2.8.18) 


> To simplify the notation in what follows, we use the symbol m rather than the 
more common one ms in (2.8.20). 
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ge ED yes (2.8.19) 
m=i-—s, +=0,1,...,k. (2.8.20) 


Hence s and m take the possible values 


1. 3 
oS lyse (2.8.21) 
m=-—s,—-st+l,...,s—1l,s. (2.8.22) 

We may then rewrite (2.8.15) as 
par-29(x) = $2 C%-2*(5 4+ m)b(x,m) (2.8.23) 


m=—s 


where w(x,m) = Ws4m(x) in a coordinate description. 
From the orthonormality condition (2.8.12), we have the unitarity of the 
transformation in (2.8.23) 


S- Jo’ (qptr-225 (x)) ptt a2s (x) 


Q1,.--,€2s=1,2 


= a Jo W* (x, m) (x, m) (2.8.24) 
and, in particular, 


l= SS |]y22-- 22 2 ss Jos [ab(x, m)|? (2.8.25) 


Q1,---,42s=1,2 m=—s 


when normalized to one. 
More generally, we may write instead of (2.8.23) in general 


api 428 = oD (41-925 (s+ m)w(m). (2.8.26) 


m=—s 


The matrix in (2.8.7) is easily worked out, by using in the process (2.8.2), 
to be 


i in?) sin $ 


I 


a) . 3 . 2) . 
i cos 5 + In sin 5 i( 
exp gD 


i (nt + in?) sin€ cos § — in® 


in 2 
sin $ 


=A (2.8.27) 
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where n = (n*, n?, Ww). The inverse of the matrix A is obtained by replacing 


yp by —y@ in (2.8.27): 


cos € — in® sin £ i(n’ — in?) sin $ 
2 eee ; (2.8.28) 
—i(n'+in?)sin€ cos $ + in? sin $ 
The transformation 
Aart | Ar2sbasgbr--.b2s (2.8.29) 


corresponding to the right-hand side of (2.8.26) may be explicitly carried out. 
To this end, we introduce two new variables h;, h2 defined by 


ha = 9 [A?]"* (2.8.30) 
leading to 
ae 
[A] a. Oh, (2.8.31) 
Ga = hy [A]. (2.8.32) 


Accordingly, from (2.8.1), (2.8.18)—(2.8.20), (2.8.26), we have 


8s 


Ale1et .. [4]2sb2s qjb1--b25 — 1 
4 4) ¥ 2 ae 


Q a stm s—m 
ead Aye Ale? 28. 
oh oh ee) Rela) v(m)) (2.8.33) 
Upon using the elementary binomial expansion 


(a+b)"= 5° — _(a)t(by'4 (2.8.34) 


‘= a(n — 9) 
for any non-negative integer n, as applied to 
s+m s+m 
(hel AJ)" = (ay [A] + ha Ap24)** (2.8.35) 


(h,[A]©)*"™ = (Ay [A]?? + he[A]??) > (2.8.36) 


the right-hand side of (2.8.33) may be rewritten as 


S- C%1-2°(g + m)q(m) (2.8.37) 


m=—s 
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where 


w(m) = Dy V(s+m)l(s — m)l(s + m/)\(s — m’)! 


s+m—q)'\(m' —m+q)!q'!(s — m! — gq)! 


" (jaysyt 4 (ao ({A]?2)2 (ay2)-" wm’) (2.8.38) 


giving the transformation law for ~)(m), and the q-sum is over all non-negative 

integers q for which the arguments of the factorials involving q are non- 
negative integers. 
Upon writing 

v(m) = (s,m|~) (2.8.39) 

the spin operator S is defined through (see, in particular, (2.7.43), (2.8.23), 
(2537 36.1), 


é i 
(s,m) = (s,m| exp (Fen . s) \~) (2.8.40) 
which from (2.8.38) gives 


(s,m exp (-en-s) = SS V(s + m)l(s — m)l(s + m')I(s — m’)! 


s+m-—q)\(m' —m+q)!q'!(s — m’ — g)! 


q,m! 


y (jay ot ([aj22)™ —™*4 ([A]!2)2 ([A}22)°" 9 (s,m! 
(2.8.41) 


By differentiating the latter with respect to y, using the expression for 
the matrix elements [A]? in (2.8.27), and setting y = 0, one easily obtains 


(s,m[n-S= (n} + in2)5 /(e— my(o +m +1) (s,m + 1| 


+ (n' in*)5V(s tm)(s—m-+1)(s,m—1| 


+ n?hm(s,m|. (2.8.42) 


Hence, in particular, 


S*|s,m) = 7c m)(s +m-+1)|s,m+1) 


+ 2 Met mye— m+) |, —1) (2.8.43) 


S?|s,m) = in Js m)(s +m+1)|s,m+1) 
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| iA Vs t m)(s —m-+1)|s,m—1) (2.8.44) 
S3|s,m) =hm|s,m). (2.8.45) 


Upon applying the operators $1, $7, S° to (2.8.43)-(2.8.45) and using 
the latter equations all over again, one immediately obtains 


S? |s,m) = h?s(s + 1) |s,m) (2.8.46) 
[S*, $4] |s,m) = ifle™*S* |s,m). (2.8.47) 


Using the definition that a particle is of spin s if S? = h?s(s + 1)1 (see 
5.1, §5.4), (2.8.46) establishes the fact that the symmetric spinor 7)°!'%s is 
associated with spin s, and that the components of the spin along the °-axis, 
as dictated by (2.8.45), are given by —fis, h(—s+4+1),..., h(s — 1), hs. The 
spin matrix [(s’,m’|S|s,m)] for any spin may be readily constructed from 
(2.8.43)-(2.8.45). 

Equations (2.8.45), (2.8.46) also lead, respectively, to 


A(m — m’) (s',m'|s,m) = 0 (2.8.48) 
h? [s(s + 1) — s'(s’ + 1)] (s', m'|s,m) =0 (2.8.49) 

establishing the orthogonality (orthonormality) of the eigenstates |s, m) 
(s',m'|s,m) = 5s',s6m/m- (2.8.50) 


With the normalization condition in (2.8.25), w(x,m) denotes the 
wavefunction of a particle of spin s. In a matrix notation ¢(x) = 
(w(x, s),-.., W(x, —s))', one has, from (2.8.1), (2.8.23), (2.8.38), (2.8.40), the 
transformation law in (2.7.43) with the matrix elements of exp (ign-S/h) 
given below in (2.8.51), as directly obtained from (2.8.41), that 


(s,m| exp (pen : s) |s’, m’) 


— Oss! 


JV(s+m)!(s — m)!(s + m’)!(s — m’)! 
| 


m+ q)!q'!(s — m! — gq)! 


53 ([ajyet4 ({aj2)"—™"*4 ({A]!?)! ({ay?2)°-"" 8 (2.8.51) 


where the sum is over all non-negative integers q such that the arguments of 
the factorials are non-negative integers. 
For example, for n = (0,1,0), we have from (2.8.27), (2.8.51) 


(ssmlexp (05%) Is!) 
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JV(s+m)l(s — m)!(s + m’)!(s — m’)! 


= 5s! 7 (stm —q)l(m! — m+ g)lq/l(s — m' — 4g)! 
2s—2q+m—m’ Qqt+m’/—m 
ry ee in? 
x (-1) (cos 4 (sin 4 : (2.8.52) 
For spin s = 1/2, 
(x) = pea € + m) u(x,m), a=1,2 (2.8.53) 
Ca 5" >. Coy So" (2.8.54) 
and the spin matrix is from (2.8.43)—(2.8.45), 
h 
S= 5° (2.8.55) 


and, for example, from (2.8.52) 


A cos$ sin $ 
lexp 281] = (2.8.56) 
—sin€ cos$ 


From (2.8.53), (2.8.54), the spinor may be written in a column matrix form 


wee) = (Be | = (g) ven +172) + (7) oe-1/2. 87 


For spin s = 1, 


s 


puer(x)= S$ C%2(1 + m)y(x,m) (2.8.58) 


m=—1,0,1 


and from (2.8.18) or (2.8.11): 


CMe (141)= fart geal (2.8.59) 
Ome (1 4 0) = S Ce om ab port) (2.8.60) 
Cee _ 1) = 6212 6422 (2.8.61) 


and the spin S = (S', $?, 9°) has from (2.8.43)—(2.8.45) the components 


010 
SSO. (2.8.62) 
v2 \o10 


or 
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fost a 
Ss? =i—_|1 0 -1 (2.8.63) 
v2 \o 1 0 
10 0 
S?=h|00 0 (2.8.64) 
00-1 


in the diagonal representation for S°. For (2.8.52), we have 
(1+cosy)/2 (siny)/V/2 (1—cosy)/2 
lexp Zs" = | —(sin y)//2 cos y (siny)/V/2 |. (2.8.65) 


(1—cosy)/2 —(siny)//2 (1+ cosy)/2 


We will recover the structures (2.8.59)—(2.8.61) in §5.5 when adding two 
spin 1/2’s. It is remarkable that the system (2.8.18)—(2.8.20) provides the 
general solution for describing a spin s out of 2s spin 1/2’s and generalizes 
the expressions (2.8.59)—(2.8.61). 

As mentioned before spin 0 may be described by a spinor of rank 0. That 
is, it transforms as 


v(x!) = v(x) (2.8.66) 


(see also (2.7.13)). It may be also described by a second rank anti-symmetric 
spinor ®“!%2(x) (see Problem 2.15). 


2.8.2 Rotation of a Spinor by 27 Radians 


An intriguing result which distinguishes half-odd integral spins (s = 
1/2,3/2,...) from integral spins (s = 0,1,2,...) is that under a rotation 
of a coordinate system by 27 radians, about any axis, thus bringing us back 
to the same initial situation, a half-odd integral spin spinor, is read to ac- 
quire an overall minus sign multiplied by the initial spinor. This relative 
phase change for such spin values is physically observable and will be dis- 
cussed later in §8.12. Particles with half-odd integer spins are referred to as 
fermions, while particles with integer spins are referred to as bosons. 

To see how this phase change occurs put y = 27 in (2.8.7) to obtain 


2 
exp (Fn . °) =1cos7+i(n-o)sinz = —-1. (2.8.67) 


Hence (2.8.1), with k = 2s, leads to 


op! t1 428 (x) = (—1)P2 ptt 92 (x) (2.8.68) 
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establishing the relative phase change which results under a 27 rotation for 
s = 1/2,3/2,.... Only through a rotation by 47 (or by integer multiples of 
4) a +1 phase is obtained for half-odd integral spins! 

For the matrix element (2.8.51), we also have 


2 
(s,m| exp Ga . s) |s’,m’) = (—1)??8 50mm (2.8.69) 


and hence also for the wavefunctions (see (2.8.23), (2.8.38)) 


w' (x,m) = (—1)?8¢)(x, m). (2.8.70) 


2.8.3 Time Reversal and Parity Transformation 


We next consider the time reversal operation for arbitrary spins. To this 
end, consider first the situation for spin 1/2. 

In a vector space generated by 2 x 2 matrices, the most general expression 
for the time reversal operator T for spin 1/2 is 


T = (la+in-o)K (2.8.71) 


expanded in terms of the complete set of 2 x 2 matrices 1, o;, 1 = 1,2,3, 
where a and 0 are, in general, complex numbers, n is a unit vector, and K is 
the complex conjugation operation to ensure, in particular, the anti-unitary 
nature of T (see §2.6). The latter must satisfy the anti-commutation relation 
in (2.6.22) with the spin operator S in (2.8.55). That is, we must have 


(la +ibn- o)o!* + 0/(1la+ibn-o) =0 (2.8.72) 


for 7 = 1,2,3. It is easily checked that the solution consistent with (2.8.72) 
is 


a=0, n= (0,1,0) (2.8.73) 


1.e. 


T = ibook. (2.8.74) 


The anti-unitarity nature of T then implies that |b]? = 1, where we note, 
in particular, the reality of the matrix (iog). That is, up to an overall phase 
factor, b, which we define for a basic spin 1/2 state to be one, (2.8.74) gives 


T = (io2)K. (2.8.75) 
For two successive time reversals, we obtain 
T*=-1, (2.8.76) 


A spinor then transforms as 
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'*(x) = K[io2]?p?(x) (2.8.77) 

under time reversal. Using the identity 
fig]? = 6275? — 927.5%" (2.8.78) 


we obtain, from (2.8.53), the following transformation law of a spin 1/2 wave- 
function 


w(x, +1/2) > w*(x, -1/2) (2.8.79) 
w(x, —1/2) w* (x, +1/2) (2.8.80) 
causing, in particular, a spin flip. 


For an arbitrary spin s, we define from (2.8.77), the following transfor- 
mation law under time reversal 


pre (xe) = K fig]. fira]tarbeeapbo-be (x) (2.8.81) 


where an overall complex conjugation operation is retained to ensure the 
anti-unitarity nature of the time reversal operation. 

To see how a wavefunction 7(x,m) (see (2.8.23), (2.8.25)) transforms 
under time reversal, it is easily proved from (2.8.18), (2.8.78), that 


[io] on [ing] 2222 C1 bas (s 4 m) 
= (TP Ott 28g 9), (2.8.82) 
Hence 
yl @2s (x) = S- (1 eee (s = m)w* (x, m) 
= y C81 @s (5 + m)ap'(x,m) (2.8.83) 
where 
a’ (x, m) = (—1)°-™p* (x, —m) (2.8.84) 


giving the transformation law w(x,m) — w’(x,m). For s = 1/2 we recover 
the expressions in (2.8.79), (2.8.80). 

Again we have a distinction between half-odd integral spins and integral 
spins when the time reversal operation is applied twice in succession. From 
(2.8.81), (2.8.78), we have 

7? = (-1)**1. (2.8.85) 


The parity operator is similarly treated, using now the commutation re- 
lation in (2.6.13), and one may infer that (see Problem 2.17) 
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b'(x) = €h"(—x) (2.8.86) 


where € is a phase factor, and a similar expression for a spinor of rank k then 
also follows. 

An important consequence of time reversal as a symmetry operation arises 
from the following consideration. 

Upon multiplying the Schrédinger equation 


a) 
ins WW) = Ale) (2.8.87) 


from the left by 7, we may rewrite the resulting equation as 


— ihe (T |W) = He (TW) (2.8.88) 


where 


Hy =THT™' (2.8.89) 


and we have used the complex conjugation nature of T in (2.8.81) replacing i 
by —i in (2.8.88). For time reversal as a symmetry operation, (2.6.21) implies 
that 

THT '=H. (2.8.90) 


A theory which is invariant under time reversal, that is satisfying (2.8.90) 
gives rise to the following degeneracy problem of energy levels discussed be- 
low. 


2.8.4 Kramers Degeneracy 


For time reversal as a symmetry operation for which T? = —1, each 
eigenvalue of H is at least two-fold degenerate. This result easily follows by 
noting that 

H \w) = Bl) (2.8.91) 


implies, from (2.8.90), that T |y) is an eigenstate of H with the same eigen- 
value EF. Also from the chain of equalities 


(TY|Y) = (ITv |Tv)" = -(b|To)y" = — (Ty |), (2.8.92) 


we note that J |w) and |w) are orthogonal and are independent eigenstates 
of H. This type of degeneracy is referred to as Kramers degeneracy. For 
example, for the simple hydrogen atom, each energy level is at least two-fold 
degenerate due to the electron spin. 
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Appendix to §2.8: Transformation Rule of a Spinor of 
Rank One Under a Coordinate Rotation 


We are interested in the transformation rule of spinors under a coordinate 
rotation. To introduce spinors with this in mind, we define for each point 
labelled by x in R°, a complex 2 x 2 matrix 


3 1_ 3,2 
= x uv —ix 
X=x-o= (,. ih eas ) (A-2.8.1) 
where 0 = (01,02,03), with o1, 02, 03 denoting the Pauli matrices (see 


(2.8.2)). 
Some properties of the Pauli matrices are collected here for convenience 
and are given by 


Oj;0; = Oi + lei jkOk (A-2.8.2) 

Oj0;0k = Oi0jk = 075 0ik + Ki; + le ijk (A-2.8.3) 
1 1 F 

Tr(o;) = 0, 3 Nt (a:93) = ij, 3 It (710308) = ieijr (A-2.8.4) 


and €;j;~ = e'* is defined in (2.1.5). 
From the second equality in (A-2.8.4), we may solve for x in terms of X 


x= 5IoX), (A-2.8.5) 
We consider the transformations X — X’ defined by 
X'=AXA'=x'-o (A-2.8.6) 
where x’ (see (2.1.4)) is given by 
a” = R4(y,n)x? (A-2.8.7) 


for all 2 x 2 complex unitary matrices A = A(y,n), corresponding to coordi- 
nate rotations by angles {¢} about unit vectors {n}. 
We note that 


det X = —x-x (A-2.8.8) 
implying from (A-2.8.6) and the fact that 

x’+x’ =x-x (A-2.8.9) 
the invariance property 

det X = det X’ (A-2.8.10) 


under a coordinate rotation. Equation (A-2.8.10) also follows directly from 
the unitarity condition of the matrices A in (A-2.8.6). 
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For a subsequent coordinate rotation 
a” = RY (y',n')x" (A-2.8.11) 
X" = A'X'A = (A'A)X(A'A)' = x" 0 (A-2.8.12) 


Since the product of unitary operators defines a unitary operator, the set of 
all such transformations in (A-2.8.6) forms a group. 
The most general structure of a 2 x 2 complex matrix A is given by 


A=1la+ib-o (A-2.8.13) 
giving 
AtA = |al? + b> b* + i(b**bc9* — b*¥a + ab" )oy. (A-2.8.14) 
Unitarity then implies that 
lal? +b-b* =1 (A-2.8.15) 
bbc — b*Fa + a*b* = 0. (A-2.8.16) 


On the other hand (A-2.8.6), (A-2.8.13) lead to the equality 
(lal? — b+ b*)o; + (b*b- o + b’b* - a) — (ab™ +.a*b')eM* oy, 
=cosyo;+nin+o(1—cosy)—n'sinye*o, (A-2.8.17) 


where we have made use of (A-2.8.2)—(A-2.8.4). 
Upon multiplying (A-2.8.17) by oj, we obtain 


3|a|? —b- b* = 2cosy4+ 1 (A-2.8.18) 
ab*-o+a*b-o=n- osiny. (A-2.8.19) 
Equations (A-2.8.15), (2.8.18) then imply that 
a =e cos(y/2) (A-2.8.20) 
b) = es nd sin(y/2) (A-2.8.21) 
where 5, 5; are real. On the other hand (A-2.8.19) is equivalent to the con- 
ditions 
ab“) + a*b? = nJ sing (A-2.8.22) 
which from (A-2.8.20), (A-2.8.21) imply that cos(5 — 5;) = 1 for 7 = 1,2,3, 


and hence that ae 
b) = end sin (p/2). (A-2.8.23) 
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Accordingly, a rotation by an angle y about a unit vector n induces the 
transformation X — X’ where X, X’ are, respectively, defined in (A-2.8.1), 
(A-2.8.6), with the matrix A, up to an overall phase factor which by conven- 
tion we set equal to one, is given by 


A=1cosy/2+in- osiny/2. (A-2.8.24) 


It is readily checked that 
det A= 1. (A-2.8.25) 


Conversely, it is easily shown (see Problem 2.18) that the set of all trans- 
formations X — X’ with 
X'=BxXBt (A-2.8.26) 


for any 2x 2 complex unitary matrices B of determinant one each one induces 
a coordinate rotation. With B having the general structure as given on the 
right-hand side of (A-2.8.13), det B = 1 implies that 


a?+b-b=1. (A-2.8.27) 


This together with the unitarity condition in (A-2.8.15) imply that a and b 
are real and may be parameterized as in (A-2.8.20), (A-2.8.21) with e® = 1. 
Equation (A-2.8.18) is then automatically satisfied. 

The matrices A in (A-2.8.24) operate on two dimensional objects 


y= & (A-2.8.28) 


referred to as spinors or more precisely as spinors of rank 1, guaranteeing the 
invariance property of the inner product 


S> yrtyie = SO preys (A-2.8.29) 


a=1,2 a=1,2 
under a coordinate rotation, where 
ple = [A]? (A-2.8.30) 


and [-]?° denotes a matrix element of the matrix in question. 

Equation (A-2.8.30), with A given in (A-2.8.24), provides the rule of trans- 
formation of a spinor of rank 1 under a coordinate rotation by an angle y 
about a unit vector n. The transformation law for a spinor of rank k un- 
der a coordinate rotation is defined in (2.8.1). Equation (2.8.7), shows the 
equivalent exponential form of the operator A in (A-2.8.24): 


A=exp (ifn. c) (A-2.8.31) 
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2.9 Supersymmetry 


In quantum field theory, supersymmetric transformations give rise to 
transformations between bosons and fermions. As a unifying principle, super- 
symmetric Lagrangians in field theory have this appealing invariance prop- 
erty built in. Such a symmetry then puts constraints on physical processes 
involving the underlying particles. Although nature is not necessarily super- 
symmetric, the imposition of such asymmetry on a field theory has turned out 
to have some advantages over its non-symmetric counterpart. In particular, 
it has bean shown, in general, that the number of divergences (encountered 
in defining coupling parameters, masses and related physical quantities) are 
consequently reduced. Also the imposition of a symmetry narrows down the 
class of interactions that one may write down. With the hope of recovering 
more realistic theories, one may then break such a symmetry guided by some 
physical principles. 

Supersymmetric transformations may be also defined in quantum me- 
chanics, in general, as an abstract notion which turn out to have far reach- 
ing consequences and yield to several physical applications. For example, in 
the elementary Bose/Fermi oscillator, as a prototype of field theory models, 
the explicit transformations between Bose and Fermi states may be readily 
demonstrated. This and other examples will be given later (see, e.g., §6.5, 
84.7). 

In this section, we define supersymmetric transformations which parallel 
the treatment of the Galilean transformations carried out in §2.2, §2.3. The 
closed path transformations lead to basic commutation/anti-commutation 
relations of the supersymmetry generators and of the Hamiltonian in a way 
similar to the earlier study. Consequences of these relations and physical 
applications will be given later as mentioned above. 

Together with the time variable t, as a c-number, we introduce anti- 
commuting c-number variables 0, 6* having the following transformations: 


tot—T+i€O-iP#*E=t (2.9.1) 
9>0+€=6, 0 =O +e (2.9.2) 

where by anti-commuting c-numbers 0, 6* one means 
{0,0}=0, {0*,A*}=0, {6,07} =0 (2.9.3) 


where 


{A,B} = AB+BA. (2.9.4) 
Equations (2.9.3), in particular, imply that 


(Gr =O. fe) =o: (2.9.5) 
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The variables 0, 6 are referred to as Grassmann variables.°® 

As before, rT is a c-number. On the other hand, 0, 6*, €, &*, €, Easy 
anti-commute with each other but commute with t, 7, T,.... 

For a subsequent transformation to (2.9.1), (2.9.2), we have 


0-i0 € (2.9.6) 
O@=O0+€ (2.9.7) 


and similarly for 6 , or 


i=t (- 7T-i€ €4 ig*€) | i (e* &) 0 i0* (€+ €) (2.9.8) 


0=04(E+6). (2.9.9) 
The transformations in (2.9.1), (2.9.2) may be specified by the pair 
(7, ) (2.9.10) 


which for simplicity of notation we have suppressed the entry €*. 
From (2.9.1), (2.9.2), (2.9.8), (2.9.9) we have the following group proper- 
ties: 


(1) 


(72, €2) (71, £1) = (T1 + 72 — 1€3€1 + i€T Eo, &1 + Ea). (2.9.11) 
(2) For the identity element I 
I = (0,0). (2.9.12) 
(3) The inverse is given by 
(7,€) "= (-1,-8) (2.9.13) 
as is easily checked that 
(—7, —€) (7,) = (0,0). (2.9.14) 
(4) Finally, we have the associativity rule 
(73, £3) [(72,€2) (71, £1)] = [(73, €3) (72, €2)] (71, &1) - (2.9.15) 
We consider successive transformations following a closed path given by 
(T2,€2)' (T1,61) (72, €2) (71, &1) (2.9.16) 


represented pictorially by the box diagram 


® A fairly detailed treatment of Grassmann variables will be given in §10.6. 
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emphasizing the reversal of the transformations in the third and the fourth 
segments of the path. 

The group properties (1)—(4) above then lead to the following rule asso- 
ciated with the closed path: 


(T2,€2)~* (71,€1)* (2,2) (1,1) = (7,8) (2.9.17) 

where 
PED E PIS (2.9.18) 
G0: (2.9.19) 


Since the above transformations are connected with the identity, the corre- 
sponding transformations in the vector space of physical states is represented 
by unitary transformations. For infinitesimal transformations, we may write 
for the overall transformation 


U=14+56+... (2.9.20) 
where 
G = —$rH + i5€*Q —iQ' 5¢ (2.9.21) 
and 
U;=1+ =Gi aby (2.9.22) 
G; = —51jH + i8€% Q —iQt 86; (2.9.23) 


j = 1,2, and Q, Q? are generators of supersymmetric transformations. 
Upon writing 
U =U, 'U, U2, (2.9.24) 
we have the identity 
1 
Cae (G1, G2] (2.9.25) 
i 
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and, from (2.9.18), (2.9.19), 
Sr = —2id€3 5€, + DiSEF 5g (2.9.26) 


6€ =0 (2.9.27) 


5€,, d5€7, 5, 5€5 anti-commute with Q, Qt and anti-commute with each 
other. 5€, d5€7, 5€2, 5€3 however, commute with 57), 672, H. 

The commutator on the right-hand side of (2.9.25) is easily worked out 
to 


[Gi, G2] = (id72 bE] — 167) 53) [HQ] 


t (i571 5&2 — i572 5€1) (QT, H] 
+ (5&7 8&3) {Q, Q} 
+ (8&1 5&2) {Q',Q"} 


+ (S€7 2 — 5€ 541) {Q,Q"} (2.9.28) 


which, from (2.9.21), (2.9.25)—(2.9.27), and upon the comparison of (2.9.28) 
with ifG in (2.9.21) yields to 


[H,Q] =0 (2.9.29) 
[H,Q"] =0 (2.9.30) 
{Q,Q}=0 or (Q)?=0 (2.9.31) 
{QV o}=0 o (Qt)? =0 (2.9.32) 
and 
1 
H= aR es O"}- (2.9.33) 


Applications of the results obtained above will be given later (e.g., in §4.7, 
86.5 also in §8.2). 


Problems 


2.1. Establish the properties (2.1.18)—(2.1.20), (2.1.27)—(2.1.30) for the ma- 
trices M,, M2, M3 involved in the expression for the rotation matrix 
Rin (2.1.21) 

2.2. Work out in detail the group properties (2.2.8)—(2.2.12) of the Galilean 
transformation and also the infinitesimal changes in the coordinate 
labels given in (2.2.26), (2.2.27) together with (2.2.22)—(2.2.25) under 
the closed path transformation (2.2.13). 
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2.3. 


2.4. 


2.5. 


2.6. 


2.7. 


2.8. 


2.12. 


2.13. 


2.14. 


2.15. 


2.16. 


2.17. 


2.18. 


2.19. 


2 Symmetries and Transformations 


Derive the entries in Table 2.1 below (2.3.15) by using the commu- 
tation relation in (2.3.12) and the expressions for the G and G; in 
(2.3.13), and (2.3.14) /(2.3.15), respectively. 

Obtain the expression for the coefficient F in (2.3.16) and show explic- 
itly it is equal to zero. Carry out a similar analysis for the coefficient 
B in entry 7 of Table 2.1, to show finally that it is zero as well. 
Provide another derivation of (2.3.39) for finite translations without 
building it up from infinitesimal translations. 

Using the position variables defined in (2.5.55)—(2.5.57), show that the 
Hamiltonian in (2.5.58) is of the form (2.5.48) for the two cluster prob- 
lem with one cluster consisting of one particle and the other involving 
two particles. 

Extend the analysis of the two cluster case given in Problem 2.6, to the 
case where each cluster consists of two particles as in the interaction 
of two hydrogen atoms. Choose your variables conveniently. 

Repeat the analysis in §2.6, where it was shown that the parity trans- 
formation is to be implemented by a unitary operation, to show that 
the time reversal transformation is to be implemented by an anti- 
unitary one. 


. Show that consistently that no phase terms need to be introduced on 


the right-hand sides of (2.6.11), (2.6.20). 


. Show that (2.7.2), (2.7.3) lead to (2.7.4) for 5L, where L denotes the 


orbital angular momentum operator, in (2.7.11). 


. Establish the commutation relations for spin in (2.7.7)—(2.7.10) as they 


follow from the earlier commutation relations involving J, X, H, P. 
Show that the spin of a system of particles, as one composite object, 
obtained in (2.7.30) satisfies the commutation relations in (2.7.31). 
Since g é % 

exp (ifn. c) =cos (Fn. c) +isin (Sn. c) 
use the fact that (n- 0)? = 1 and the facts that the cosine and sine 
functions are, respectively, even and odd functions, to obtain another 
derivation of (2.8.7). 
Show that the expression in (2.8.11) may be rewritten as in (2.8.18) 
in terms of the two variables g1, ge. 
Prove that spin 0 may be described by a second rank anti-symmetric 
spinor and verify explicitly, in the process, the spin 0 content of such 
a description. 
Establish the identity in (2.8.82). 
Repeat the analysis given through (2.8.71)—(2.8.77), (2.8.81) for time 
reversal, to the parity operation to derive, in the process, (2.8.86). 
Provide the details to show that the set of all transformation X — X’, 
with X’ given in (A-2.8.26), induce a coordinate rotation. 
Work out the closed path exact supersymmetric transformation law 
given in (2.9.17)—(2.9.19). 
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2.20. Prove the commutation rule [G),G2] given in (2.9.28) involving the 
supersymmetry generators and the Hamiltonian, and finally derive 
(2.9.29)—(2.9.33). 
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Uncertainties, Localization, Stability and Decay 
of Quantum Systems 


Uncertainties, localization and stability are significant features concerning 
quantum systems. These are aspects which distinguish the latter systems 
from classical ones. This chapter is involved with such key points including 
related ones as the boundedness of the spectra of Hamiltonians of physical 
systems from below. This property of boundedness from below is important as 
otherwise a system would collapse to such a level as to release, in the process, 
an infinite amount of energy which is physically meaningless and the system, 
in question, would be unstable. The first section deals with analyses related 
to investigating the probability of the “fall” of a particle into another in a 
bound state as well as in determining a lower bound for the average spatial 
extension of such a system and in deriving an upper bound for the average 
kinetic energy into consideration in addition to other details. This includes 
investigating the nature of the resistance of a system with a large number of 
Fermi particles to the increase in its density. The boundedness of the spectra 
of Hamiltonians from below is the subject of §3.2-83.4 for several classes of 
interactions and §3.4 is involved with multi-particle systems. The final section 
§3.5 is concerned with the decay of quantum systems and special emphasis 
is put on the celebrated Paley-Wiener Theorem for describing a physically 
consistent theory of quantum decay in which the underlying Hamiltonians 
are bounded from below. 


3.1 Uncertainties, Localization and Stability 


3.1.1 A Basic Inequality 


Our starting inequality to study uncertainties and localization in quantum 
physics is the following one: 


W 


J2°x|I00 ~ia) + a — b) 960) 809)? 0 (3.1.1) 
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where a, b are real constant vectors, a is a real parameter and g(x) is a real 
function. For greater generality, the space dimension, denoted by the natural 
number », is left arbitrary. 

Upon expanding the above inequality and multiplying the latter by f?, 
we obtain 


((p a ah)”) — ah? (V + [(x — b)g(x)]) + 02h? (x — b)2g?(x)) > 0. (3.1.2) 


Here we have used the notation (A) for (w|A|w). Minimizing the left-hand 
side of the above over a gives the useful bound 


h? (W + (x —b)g(x))” 


(Pa) > FBP eC) 


(3.1.3) 


3.1.2 Uncertainties 


For example, for g(x) = 1, (3.1.3) leads to the familiar Heisenberg uncer- 
tainty principle inequality 


((x—b)*) ((p — ah)”) > a (3.1.4) 


Equation (3.1.4) puts a lower limit on the product of the expectations of the 
deviations squared of the position and momentum about the constant vectors 
b and ah, respectively. For b = (x), ah = (p), these lead to the definitions 
of the variances or the standard deviations squared. 

An uncertainty relation of the form in (3.1.4) is also obtained for other 
two non-commuting self-adjoint operators A, B with 


A-B-B-A=iC (3.1.5) 


giving rise, formally, to a self-adjoint operator C. 
Upon using the Cauchy-Schwarz inequality 


| (Aw |-Bu) — (By |-Av) | < 2\/(A?)\/ (B?) (3.1.6) 


(Ay|-By) = De (Ath | Bub) 


[(C) | < 2y/(A?),/(B?). (3.1.7) 


For A=x-—b, B= p-ah, C= vf, (3.1.7) leads to (3.1.4). 
Another useful inequality which follows from (3.1.2) is the following 


7 me (x?g°(x)) < (F ~ cue (3.1.8) 


where 


we obtain 


obtained by setting a = 0, b = 0 and choosing a = p/h?. 
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3.1.3 Localization and Stability 
An interesting application follows from (3.1.3) by setting g(x) = 1/|x — b| 


and a = 0. This gives the important lower bound for the expectation value 
of the kinetic operator for v = 3: 


2 2 2 
p A 1 
> 3.1.9 
) aa (cu) ( ) 
for a given mass VM. 


On the other hand, we may invoke the Cauchy-Schwarz inequality 


1 = faex weap = fat PET bx — bl) 


1 1/2 i 
<(—y) (|x — b]) (3.1.10) 


for a normalized state, to obtain from (3.1.9), (3.1.10) 


(Fe) x bl)? > so. (3.1.11) 


By using the symbol X for the random variable associated with the po- 
sition, we have for the probability of occurrence of the event {|X — b| < 5} 
for any 6 > 0: 


Prob [|X — b| < 6] = [ex |ab(x)|? @(5 — |x — b]) (3.1.12) 


where O(c) is the step function, i.e., equal to 1 for c > 0, and zero for c < 0. 
Since in the integral we have the constraint 


1<6/|x—b| (3.1.13) 


and O(c) < 1, we have from (3.1.12) the bound 


1 
Prob [|X — b| < 6] < (5)s (3.1.14) 


which from (3.1.9) leads to 


Prob [|X — b| < 6] e wees " 


F SA og (3.1.15) 


The physical interpretation of this result is clear. Jf for arbitrary small 
6, Prob [|x —bl< 6], is non-vanishing, then the expectation value of the 
kinetic energy is necessarily arbitrarily large. That is, for a particle which 
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has a non-vanishing probability of being found within a sphere of radius 6 
about a fixed point, specified by the vector b, its average kinetic energy is 
necessarily large for an arbitrarily small sphere of localization. 

In collision theory, for example, we may identify M with the reduced 
mass p of two particles, set b = 0 in (3.1.15), with |x| denoting the distance 
between the two particles. Accordingly, in order to bring the “colliding” parti- 
cles arbitrarily close to each other, there must exist an arbitrarily large lower 
bound to the average kinetic energy of the relative motion. 

For a negative-energy state, the average kinetic energy cannot be arbitrarily 
large and (3.1.15) may be used in the following manner. Consider the Hamil- 
tonian of relative motion of a two-particle system (§2.5), 

2 


f= Fy (3.1.16) 


2p 
where = m1m2/(m1 + mg) is the reduced mass. 
Suppose that V(x) remains invariant under the scalings m, > Km 1, m2 > 
Km, by an arbitrary positive parameter «, and the scalings 


Aq eae («)™ 1, r2 = (K)* do, soe (3.1.17) 


where Aj, Az, ... are coupling parameters, and 61, d2, ... are some real num- 
bers. For the Coulomb interaction, for example, V is independent of m1, m2, 
and Ay = q1q2, 61 = 0 where qi, gz denote the charges of the two particles. 
For the Newtonian gravitational interaction, with A; chosen to coincide with 
the gravitational coupling constant G, 6, = —2. 

In Chapter 4, the nature of the spectra of Hamiltonians is studied under 
some general sufficiency conditions satisfied by the potentials. In general, we 
consider a strictly negative energy-state of a Hamiltonian, if there exists one, 
| (m1, m2, A;)) satisfying, by definition, 


(b(m1, M2, As) |A|y(m1, m2, Ai)) < 0 (3.1.18) 


with the understanding that energy is required to “break up” the system by 
separating the two particles and make the energy of the system non-negative. 
Here we have labelled the state |w(m1, mz, A;)) by m1, mg and by 44, standing 
for Ay, Ag, ..., for a reason that will become clear below. 

A physical system is one that has the spectrum of its corresponding Hamil- 
tonian bounded from below (§3.3), that is, its spectrum does not go down to 
—co. Otherwise, the system would collapse to such a level as to release an 
infinite (!) amount of energy which is physically meaningless and the system 
would be unstable. 

Suppose that the ground-state energy —E[m1, m2, A;] of the Hamiltonian 
(3.1.16), that is, corresponding to the lowest point of its spectrum (§3.3), is 
finite and is strictly negative. Then for the state |w(m 1, mg, A;)) satisfying 
(3.1.18) we may write 
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—E [m, m2, Xi] < (w(m, m2, di) |A\b(mi, m2, di)) <0. (3.1.19) 


Let hes ae (is di) denote the state |W(m,, mz, ;)) with m1, ma, 


: 5; 
Ai replaced, respectively, by 41, ™, (3) Ni 


Clearly, the state we, ae (ay ri) ) cannot lead for the expectation 


0; a 
my, mez (1\" my me {1\" 
where H is given in (3.1.16), a numerical value lower than —E[m1, mg, Ai], 


for the given H, otherwise this would contradict the fact that —E[m1, mo, Aj] 
is the ground-state energy of H. That is, 


value 


—E|m1,mz2, ri] 


< (of ™,™ (EVN (Pay 
: ASG) (E+ ) 


my mg, 1 
wp 9? 9” (5) a 
3.1. 


(3.1.21) 


—E[2my, 2me, (2)* Ai] 


< (wim, m2, Xi) 


(= +v) fom, ma, as). (3.1.22) 


On the other hand, the inequality (3.1.18) implies, for the Hamiltonian 
(3.1.16), that for the expectation value of p?/4y: 


2 


(W(am,ma,ds) ° w(am,ma,&)) 


2 
G + v) Goma, x) 
UL 


<- (v(m, m2, Ai) 


< E[2m1, 2mz, (2)* Ai] (3.1.23) 


where in writing the last inequality we have used (3.1.22). 
Equation (3.1.23) gives the following upper bound for the expectation 
value of the kinetic energy operator for such a system 


2 
(FE) < 2E[2m1,2mo, (2) Ai]. (3.1.24) 
m 
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From (3.1.11), with b = 0, we have the following non-vanishing lower 
bound for the expectation value (|x|) of the separation distance between the 
two particles 


h ; -1/2 
(|x|) > \e (2E[2m, 2mz, (2)* d;J) (3.1.25) 


as follows from (3.1.24). 
From (3.1.15), (3.1.24) we also obtain, with b =0, M — p, 


1/2 
Prob [|X| < 6] < 26 (Sz [2m1,2me, (2) di] , (3.1.26) 


That is, for a physical system for which 0 < E[2m1,2mz,(2)*'\;] < 00, 
(3.1.26) gives the satisfactory result of a vanishingly small probability, for a 
vanishingly small 6, and rigorously vanishes for the “fall” of one particle into 
the other. 

A similar inequality to (3.1.26) will be also derived for multi-fermion 
systems in (3.1.37), and also later on, in Chapter 14, in our study of the 
important problem of the stability of matter with special attention given 
to the number of particles involved and the Fermi character, that is of the 
underlying Pauli exclusion principle, of the electrons. The stability of the 
hydrogen atom is studied in 87.1. 


3.1.4 Localization, Stability and Multi-Particle Systems 


Here we are interested in the localization problem of identical Fermi 
particles, such as electrons. Consider the (anti-symmetric) wave function 
W(x101,...,Xnown) of such N particles in the coordinate description, where 
01,---,On are spin indices. A single particle probability density, normalized 
to one, may be defined by 


A=. [0° . Pxy |b(x101,...,XNon)|?. (3.1.27) 


The probability that any one of the particles is found within a sphere of 
radices 6, about some point in space specified by a vector b is given by 


Prob [|X1 — b| < 6] = [ox h(x) 0(6 — |x — b]) (3.1.28) 
where O(a) is the step function, i.e., O(a) = 1 for a > 0 and O(a) = 0 for 


a<0. 
By Hélder’s inequality in Appendix II, we may bound 


[ox h(x) @(5 — |x — bl) < ( [os nor) 
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x ( fax Q(6 — eb) (3.1.29) 


where we have used the fact that (Q(a))°/? = O(a). 
Hence from (3.1.28), (3.1.29) we have 


1 2/5 a 3/5 
Prob [|X1 — b| < 6] (=) < ( fox (h(x)) ) (3.1.30) 
where 4 
v= = (3.1.31) 


is the volume in which a particle is confined. 
In the sequel, we use the notation, 


«| = (3.1.32) 


for the expectation value of the kinetic energy of the N particles, where m 
denotes the mass of a particle. 

In §4.6, (see (4.6.24), (4.6.16), (4.6.17)), we derive the following bound on 
the average kinetic energy TJ’, where the exclusion principle plays a key role, 


3/5 3/5 
T 
( fax (ne0)*”) <o>y (3.1.33) 
with 7 is 
1 2(2 1 
geet ver) (3.1.34) 
3h? 37 
and s is the spin of a fermion. 
Since 


Prob [|X; — b| <6,...,|Kn — b| < 6] < Prob [|K; —b| <6] (3.1.35) 


(3.1.30), (3.1.33) give 


N\2/8 p\3/s 
Prob [|X1 — b| <6,...,|Xw —b] < 6] (*) <e (=) . (3.1.36) 
As an application of this bound, consider a system of fermions, which may 
be interacting, and with a non-vanishing probability of occurrence of the event 
{|Xi — b] < 6,...,|Kw — b| < 6}, ie., of being localized as indicated. As N 
becomes larger and larger, i.e., we increase the particle density, the left-hand 
side of (3.1.36) goes to infinity for N — oo. This can be true only if T goes 
to infinity as well, and not slower than N. This shows a kind of “resistance” 
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of Fermi particles to the increase in density, by the increase of their average 
kinetic energy. 

As another application , consider a negative-energy state of the fermions. 
For astable system, the ground-state energy E'y,—Ey cannot grow faster than 
N. For example,—E'y cannot growlike N° witha > 1.The reason is that other- 
wise the formation of a single system consisting of 2N particles will be favored 
over two separate systems brought into contact, each consisting of N particles, 
and the energy released of such a contact, being proportional to [(2N)°— 
2(N)*], will be over whelmingly large for large N, e.g., N ~ 10?3 (see also 
Chapter 14). 

Accordingly, with a ground-state energy Ey, with typical bounds —aN 
< En < —a@N, where a, @ are some positive constants, almost an identical 
reasoning as the one lading to (3.1.24) for the average kinetic energy, with 
similar scaling properties of the interactions, for example with Coulombic ones, 
shows that T < AN,! with A denoting some positive finite constant indepen- 
dent of N. 

For such a negative-energy state, we may then further bound the right- 
hand side of (3.3.36), giving 


N 2/5 
Prob [|X, — b| < 6,...,|Ky — b| < 6] (=) < cA3/5 (3.1.37) 
U 


where the right-hand side is some finite constant. For a non-vanishing prob- 
ability of having the fermions within a sphere of radius 6 as indicated, in 
order that the left-hand side of (3.1.7) remains finite, in conformity with its 
right-hand side, as N grows without bound, it is necessary that the volume 
v grows at least as fast as N. That is, in particular, the radius of spatial ex- 
tension of the fermionic system grows not any slower than N‘/3 for N — oo. 
These properties will be quantitatively analyzed in Chapter 14 in considering 
the problem of the stability of matter, where the exclusion principle, plays a 
key role and is based on Manoukian and Sirininlakul (2005). 


A lower bound to the expectation value of x |x;| /N, as a measure of 


i 
the extension of the above system is also readily obtained. 
To the above end, 


N | | N 
os * ) d fetm...dxw (>: 
w=1 O1,-+-,0N t=1 


But for any 6 > 0, 


I 


x; 
| |wb(x101,-..,xNon|" 


I 


ox |x| h(x). (3.1.38) 


' Actually just the bound —aN < En < 0 is sufficient to establish this, in the 
process, that T < AN (see (3.1.19)). 
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Jes |x| h(x) > ii d?x |x| h(x) > 6 / d°x h(x) (3.1.39) 
[x|>6 |x|>6 
and 
/ d°x h(x) = Prob [|X| > 6] = 1 — Prob [|X| < 4]. (3.1.40) 
|x|>6 


Accordingly, from (3.1.35), (3.1.37)—(3.1.40), 


N 2/5 
xi vase 3/5] _ 4m 3 3/5 
ob: ~~ )>s 1- (=) c AS! = §|1— (58 cA 
(3.1.41) 


where in writing the last equality, we have used (3.1.31). Upon optimizing 
the right-hand side of (3.1.41) over 6, gives 


6= aN\e (6) I (3.1.42) 
~ \ An lle Al/2 Fr 


which from (3.1.41) leads to 


N 5/6 1/3 
i N 1 
S ES a (a s (3.1.43) 
N /7 ii \ite dn) AIP 


i=l 


giving a lower bound proportional to N!/3, where c is defined in (3.1.34). 
Bosonic systems behave differently, and so-called “bosonic matter” will be 
analyzed in detail in Chapter 14 (see also Problem 14.10). 


3.2 Boundedness of the Spectra of Hamiltonians From 
Below 


If the spectrum of a self-adjoint operator A is bounded from below then 
its spectral decomposition (§1.8, (1.8.15)) may be explicitly written as 


Aa] ¥aR,O) (3.2.1) 


where |L.4| < oo. For any vector |f) in the domain of A, we then have the 
following lower bound for its expectation value 


iain =f “yd (FIPACAI) > La [ “a(F|Pa(if). (8.2.2) 


Hence for all |f) in the domain of A, the resolution of the identity (see 
(1.8.18)) 
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1= “abe (3.2.3) 
La 
gives the bound 
(fFIAIF) > Lallfll?. (3.2.4) 


Conversely, suppose that for some self-adjoint operator A, and for all 
vectors | f) in its domain, the bound (3.2.4) holds true. Then we will show that 
for all A in the spectrum of A, A > L.4. To do this we assume otherwise and 
run into a contradiction. That is, suppose that for some Ap in the spectrum 
of A: Ag < La, and let 

_ La Xo 


a eS 0: 2. 
E ee (3.2.5) 


Then (§1.8), 
[PaOo te) = Pin — oH (3.2.6) 


is not empty. Let |fo), in the domain of A, be a vector belonging to (3.2.6), 
i.e., 

[Pa(Ao + €) — Pa(Ao — €)] |fo) = |fo) - (3.2.7) 
Hence 


Ao+E 


Aote 
(folAlfe) = [AAPA fl? <Oo+e) [ dllPaQ)fol? 6.28) 


te toe 


or using, in the process, the equality in (3.2.5), we have 


(fo|Alfo) < (Ao +) Ilfoll? = (Za — €) IIfoll? < La ||foll” (3.2.9) 


which is in contradiction with (3.2.4). 

That is, if for all |f) in the domain of A for which (3.2.4) is true, any A 
in the spectrum of A, is such that \ > Ly. 

One way of obtaining a lower bound of a Hamiltonian H is through the 
examination of its resolvent (H — €)~'. According to the treatment given in 
§1.8, if € is a real parameter belonging to the spectrum of H, then either 
(H —€)~' does not exist or if it exists then it is an unbounded operator. 
Thus, if one may find a real number £y such that for all € < , (H—€)' 
exists and is a bounded operator, then clearly €) provides a lower bound to 
the spectrum of H. Such a method will be applied in some of the subsequent 
investigations carried out in examining the boundedness of Hamiltonians from 
below. 


3.3 Boundedness of Hamiltonians From Below: General 
Classes of Interactions 


This section is entirely involved with the investigation of the boundedness 
of Hamiltonians from below (§3.2) for large classes of interactions. These 
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classes will be dealt with in various parts [A] to [G] given below. Needless to 
say, an interaction, specified by a given potential, may belong to more than 
one of the classes considered. Lower bounds of Hamiltonians for multi-particle 
systems will be considered in the next section. 

The lower bounds derived are not necessarily optimal ones but they es- 
tablish the important property of boundedness of Hamiltonians from below 
given that some sufficiency conditions are satisfied by the potentials. 

[A] The simplest potential V(x) is one that is bounded everywhere. That 
is 


|V(x)| <C (<0) (3.3.1) 
for all x. Accordingly (for normalized |w)), 


Mb |VIb)| < Cll]? =C (3.3.2) 


and the positivity of the kinetic energy operator implies that (p? = —h?V") 


(o|(® 4v)Je) sc 83) 


An example of such a given potential is 
V(x) = A,0(R — |x|) + Age 9 (3.3.4) 

with 0 < R < ow, @ > 0, and one may take 
IV (x)| < |Ar| + [Ao] = C (3.3.5) 


[B] We may use the inequality in (3.1.8) to obtain lower in bounds for 
Hamiltonians with a class of potentials V(x) related to a real function g(x) 
given by 


V(x) = -3V + (xg(x)). (3.3.6) 


From (3.1.8), this leads to? 


= oa (x?9?(x)) < (EF + V(x) (3.3.7) 


The applications of (3.3.6), (3.3.7) are endless. These couple of equations 
will be used to obtain lower bounds for Hamiltonians for several specific 
interactions later on, notably in §4.2, 86.1, §7.1. 

As another application of (3.1.8), where the potential, in this case, is not 
chosen exactly in the form in (3.3.6), consider the following general class of 
potentials, (|x| =r, space dimension v = 3), 


? Note that parts or all of x?g?(x) may be also reconsidered as part of the potential 
energy itself. 
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= dye" Fi" AQ A3 


V(x) bine (3.3.8) 
r ror 
where 
is 3.3.9 
0,A3 > -—— 3. 
By > 0,A3 > Bu ( ) 
which are obviously bounded from below as follows 
[Ai] | A2 , As 
s , 3.1 
V(x) + = +3 (3.3.10) 
Upon choosing g(x) in (3.1.8) to be the function 
g(x) = 242 (3.3.11) 
fe Ne 
with Sih ‘ 
3 
h? A3 
a 1 14 ul 
B= 9, ( Su *) (3.3.18) 
we obtain from (3.1.8) by a re-arrangement of terms, 
2 
Hoo [Pp (Ail—A2) | As 
eee ae | 3.3.14 
one (E r r2 ( ) 
which from (3.3.8), (3.3.10) gives 
2 
Mog p 
-=s — F 3.1 
p21 < (F +e) (3.3.15) 


[C] We now consider another class of potentials. To this end we first write 
the potential in 


cope 
Bo (3.3.16) 

as 
V =Ve(V) + VeE(-V) (3.3.17) 


where O(a) is the step function and hence O(a)+0(—a) = 1. Since VO(V) > 
0 we have 
V >VO(-V) =-v (3.3.18) 


where vu > 0. The step function @(—V) picks up the points x for which the 
potential V is non-positive. Another standard notation for VO(—V) is 


VeE(-V) = -|V|_ (3.3.19) 
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where |V|_ > 0 
In order to obtain a lower bound for H in (3.3.16) it is sufficient, from 
(3.3.18), to consider instead the Hamiltonian 


H’==—-—-v (3.3.20) 
2h 
since 
(H) > (H’). (3.3.21) 
To the above end, set 

ve =U. (3.3.22) 

We consider a class of potentials for which 
[ox d°y u(x) ———, uly) < oo. (3.3.23) 

Ix —y| 


The latter integral may be also rewritten as 


a@/2 PI | (< 00) (3.3.24) 


by using, in the process of the derivation, the integral expression 


1 Anh? 3 e ip:x/h 
= 3.3.25 
pin any [ep a) 
where (p) is the Fourier transform of u(x): 
a(p) = faPxuioje Pl, (3.3.26) 


From the identity 


(dh |ujb) = [exe’y (u|V—v? |x) K(x,y) (y|V—v?|u) (3.3.27) 


where 
1 1 


and an application of the Cauchy-Schwarz inequality in R®, we obtain 


1/2 
(olule) < (o|(-W2)|») ( faxaty|%¥)P) (3.3.29) 


[exe’y |K(x,y)|? = Jes (y 


where 
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_h f dp |ii(p)/? 
8 leas. |p| Coed 
or 
(plulb) <aT, u=v’? (3.3.31) 
with , 
_/P 
don (Ez) (3.3.32) 
lu dp _ me 
a= Fp Aa72 (2 ja()) ; (3.3.33) 


We use (3.3.31) to derive a lower bound for H’. 
Upon dividing the following inequality by 4T 


AT \/ (v2) < AT? + (v?) (3.3.34) 
and using the bound (3.3.31) we arrive at 
Jw) <T+ t (3.3.35) 
or equivalently at 
2 ; Sop n/t), (3.3.36) 


Finally, the application of the Cauchy-Schwarz inequality to 
(ded = |b? = |v|II) 


(v) = [excep lel (3.3.37) 


gives 


(v) < V/(v?). (3.3.38) 


This together with (3.3.36) yield, from (3.3.16), (3.3.20), (3.3.21), the 
following lower bound for the Hamiltonian 


ae (= <b v) . (3.3.39) 


As an application consider the potential 
— Br 
vr’ 


, we have for u(p) in (3.3.26), the expression 


VSS \>0,8>0 (3.3.40) 


which with u = v? 


And? h2 
iu(p) = Ee (3.3.41) 


giving for a in (3.3.33) 
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V2pr? 
thus obtaining the bound 
2 pr? Be Nene 
uN < (F 7 " (3.3.43) 
[Lb r 


[D] The lower bound derived in this subsection is important in that it 
provides a sufficient condition for a potential so that the Hamiltonian is 
positive thus giving a No-Binding Theorem. 

The method of development is similar to the one given in part [C]. To this 
end, we use the bound in (3.3.18), and obtain directly from (3.3.29)—(3.3.31), 
now working with v(x) rather than with v?(x), the bound 


p” 

< — (O5 

(blvld) < (Eh (3.3.44) 
where oe 
Ll i ae 

b = —~—~ — , 34 

aan ([F wyr) (3.3.45) 
Therefore 


(Fav)> (EF -v)sa-(E). (3.3.46) 


Since (p? / 21) > 0, the Hamiltonian in question is positive for 
b<l (3.3.47) 


providing a sufficient condition for “no-binding”. 
For the Yukawa potential, for example, r = |x|, 


er 
V(x) =-A ae —v(x) (3.3.48) 
with A>0, 6 >0, 
Arh? 
v) oo . A 
v(p) eRe (3.3.49) 
giving 
2 
fe Ap/2 (3.3.50) 


h? GB 
and implying the positivity of the Hamiltonian 
p? re 8" 
2U r 


(3.3.51) 
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for 


nh? 
vA< 3.3.52 
i ( ) 
or equivalently for 
2 
v2n B (3.3.53) 


he 
as a sufficiency condition to be satisfied by (. 

[E] The analysis given in this part deals, in general, in vy = 3,2, 1 dimen- 
sions of space. We consider the following class of potentials. A potential V in 
this class is defined as any potential which is square-integrable over a finite 
region about the origin and is bounded beyond this region. That is, we may 
find finite constants R, > 0, C, > 0 such that 


i d’x |V(x)|? < ay < 00 (3.3.54) 
|x|<Rp 
and 
|V(x)|<CL, for |x| > R,. (3.3.55) 


Below we will derive the following inequality 


|Vel| < A, || Hovl| + Bp (3.3.56) 
where 
p” 
Hg == (3.357) 
Qu 


and A,, B, are positive constants, depending on v, and 0 < A, < 1. Such 
an inequality is usually referred to as a Kato bound.3 

We first show how inequality (3.3.56) leads to a lower bound to the Hamil- 
tonian H = Hj) + V. We recall, (see §1.8, §3.2) that if a real parameter € 
belongs to the spectrum of H, then either (H — éy! does not exist or if it 
exists then it is an unbounded operator. We will see that for all reals 


be Bi(1= A) (3.3.58) 


(H — ee exists and is a bounded operator, where, for simplicity of notation 
here, we have suppressed the index v of space dimension in A,, B,. That is, 
—B/(1 — A) provides a lower bound to the spectrum of H. 

To reach the above conclusion, we note that, for complex €, for example, 
one may formally write 


1 1 1 1 
Gp Gio (ian ae 


3 Cf. Kato (1966, 1967) and an earlier classic paper: Kato (1951a). 
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1 1 1 
~ (Ho - 8) (H-8 oy (Hy — 8) (3.3.59) 
leading to the formal expansion 
: 1 k 1 k 
= ial is 3.3.60 
(i =) (5 2 ) | Tee ( ) 
For a strictly negative €, we may use (3.3.56) to obtain 
: -1 B 
sgl [$4 oro as Bi 
. (4+ a) Fal (3.3.61) 


where we have used the fact that Hp is a positive operator. For € < 
B/(1-— A), A+ B/|é| < 1, we then have 


1 1 By 
ies If <gmee At 
laa l<adl tia) Wa 
1 
~ (aa-ay— a! 
1 
— (<€(1- A) — B) | fl < 0 (3.3.62) 
for all |f), and € satisfying (3.3.58). 
Hence we may conclude that for v = 3,2,1: 
p? 
(EF v) Be Gd) (3.3.63) 


with 0 < A, < 1,0 < B,, as defined through (3.3.56). Therefore it remains 
to find such constants A, and B, in deriving (3.3.56), in general, for v = 
3, 2,1. The Coulomb potential qiq2/ |x|, for example, belongs to the class of 
potentials considered for v = 3, i.e., it satisfies (3.3.54) and (3.3.55). 

The derivation of (3.3.56) follows. To this end we use the definition of the 
Fourier-transform in yv dimensions: 


vx) = f Bree) (3.3.64) 


and derive the following chain of inequalities: 


wool < | abe 


J(p)| 
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eo ft +0.()] ben(eyy 


(20h)”/? (20h)”/? 


(3.3.65) 


or 


moor <1] f 2. bw +e f 2P. (2) eel] 3.65 

x = Vv (27h)” p T Vv (27h)” Qu p aaa 

where in (3.3.65) we have multiplied and divided the integrand by 
1/2 

E + b, (p?/2u)"] , and in writing (3.3.66) we have used the Cauchy- 


Schwarz inequality, and 


d’p 1 pr! 1 
i= 7 = 3.3.67 

lar a Sea eee | 88 
1+b, (=) ¥ 


For each v, b, is so far an arbitrary strictly positive constant, v = 1, 2,3. 
Accordingly, from (3.3.66), (3.3.67), we have 


Woo? <1, {1 +b, |Houll’}. (3.3.68) 
We will use this upper bound for |e (x)|” for |x| < R, only. Quite generally, 


[e’xivco? wool? = J a’xiveoP woor 


|x|<R, 
4 / d’x |V(x)|? |ab(x)|?. (3.3.69) 
|x|2>Rv 
For the second integral we have from (3.3.55) 
d’x |V(x)|? |b(x)? < C2 pI? = C?. (3.3.70) 
|x|2>R 


On the other hand for the first integral on the right-hand side of (3.3.69), we 
use the bound (3.3.68). All told, we finally obtain from (3.3.54) 


|Vbll < Ay || Howl] + BL (3.3.71) 


where 


A, = (a,b, 1,)? (3.3.72) 


B, = (C2 +a,1,) (3.3.73) 
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and we have used the inequality \/c? + c3 < ci + cg for real and positive c; 
and Co. 
Now for each v = 3,2,1, we choose the positive constant b, such that 


0<A, <1 (3.3.74) 


thus obtaining the lower bound for the Hamiltonian as given in (3.3.63). 
For each v = 3,2,1, we spell out a convenient expression for the lower 
bound just derived for the corresponding Hamiltonian. 
To the above end, for v = 3, let 


3/2 \ 2/3 
— f a3pt 
‘Vy = (seca) (3.3.75) 
1/6 
3/2 
A= {—" (3.3.76) 
Ih3C3b3/ 


to obtain A3 = y3/A, and for (3.3.63): 
2 JTL) 
(E + v) > avin, 


, v=3 SOT 
2h A— 73 ( ) 


I 


where now 4 is and arbitrary positive parameter such that \ > 73. For a given 
constant y3, as defined in (3.3.75), A may be then fixed by optimization. 
For v = 2, let 


ag- 
= 3.3.78 
2 4h2C> ( ) 
1/2 
a2U 
7 = | ——— 3 (3.3.79) 
oa :) 


to obtain Ag = 72/7, we then have the lower bound 


2 1 2 
(F KV) > GN TE. “yes (3.3.80) 
m 


ny Y2 


with 7 an arbitrary positive parameter, such that 7 > y2, and may be fixed 
by optimization. 


For v = 1, let 
2 
1/2 
ayy 
Wa (3.3.81) 
a ) 
3/2 
1/2 
ayy 
p= | —— (3.3.82) 
(saa) 


giving A; = 7, /p. This finally leads to the lower bound 
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2 /} 2/3 
(F 4s v) SO aeet (3.3.83) 


2 p—- Yi 


with p an arbitrary positive parameter, such that p > 71, and may be fixed 
by optimization. 

[F] We consider a class of one dimensional potentials. To this end, we 
bound the potential V in the Hamiltonian 


A= — 3.84 
i +V (3.3.84) 
as 
V>-v (3.3.85) 
with v > 0 as defined in (3.3.18). The class consists of any u(x) > 0 such that 
0< / dz v(z) <a < co (3.3.86) 
|z|<R 
where 0 < R < ov, and 
u(x) <Cc for |r| >R (3.3.87) 
where0<C<o. 
For b > 0 
love) dp 1 p 1/2 . 
was f lite] [sm] 8.88) 
oo 27h py il 2 
[i+ oe 


or 


lv (a)? < Jats c 4 6(F) : (3.3.89) 


We use this inequality for |z| < R only. 
Therefore from (3.3.86)—(3.3.88), 


(v) < af () a (c te o/s) (3.3.90) 


= (3.3.91) 


We may choose 


thus obtaining from (3.3.90) 


2 2 
_ pa p 
(c | < (F v) (3.3.92) 


and hence finally leading to the lower bound 
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2 2 
pa Pp 
< : LO. 
(c+f5)<(E sv) (3.3.93) 
[G] As a by-product of counting the number of the bound-states of given 
potentials, the theory of which is developed in §4.5 for Hamiltonians in v = 
3,2, 1 space dimensions we have the following bounds. The proofs are given 


in that section.* As before, we let v(x) be defined as in (3.3.18). 
For v = 3, if 


[ox (v(x))°/? < 0 (3.3.94) 


then (see (4.5.92)) 


(= | v) ze ~ ey [ox (v(x). (3.3.95) 


For v = 2, if 


[ox Wild)? <6 (3.3.96) 
then (see (4.5.97)) 


2 
p 3 2 2 
Ls Soe 3 
(Fav) 2 — Fp d*x (v (x)) (3.3.97) 
Finally, for v = 1, if 
i dx (v(a))?/? < 00 (3.3.98) 


(F rf v) pen le Mees (3.3.99) 


—oo 


In the next section, we provide lower bounds to Hamiltonians for multi- 
particle systems. 


3.4 Boundedness of Hamiltonian From Below: 
Multi-Particle Systems 


In the present section, we derive lower bounds to Hamiltonians for multi- 
particle systems. First we treat systems of interacting particles without im- 
posing any statistics on the particles. This analysis is then followed with some 
estimates involving Coulomb interactions of fermions then of bosons taking 
their appropriate statistics into account. 


* For definiteness, here one may assume that on the negative real axis, the corre- 
sponding Hamiltonians have, at most, only eigenvalues. 
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3.4.1 Multi-Particle Systems with Two-Body Potentials 


We consider a Hamiltonian in the form 


HS oe + 52 Vij (%: — xy) 
w=1 


i<j 


(3.4.1) 


where necessarily the interaction part involves negative contributions, other- 


wise H is a positive operator. 


The two-body potentials V;; are chosen to satisfy the sufficiency conditions 


d®x |Vij(x) |? < aij < 00 
Ix|< Rij 
where 0 < Rij < oo and 
|Viz (x)| < Ciz < 00 for |x| > Rij. 
We note that upon the change of variables 

x1 —-X2= Xi, X1 + Xo = XQ 

in 
Via? = fsa ---axy [Vials — 32)? [Orr ---.xv)P 

for example, we may write 


1 
|View|| 45 [ex d?X» d?x3---d3xy |Vi2(X1)|? 


Xo+X, Ko-—X, x 
2 ’ 2 9 AB, + +> 


ac 
This suggests to set 


Xo+X, Xo- xX 
2 : 2 


v( 


,X3,---,XN) = 0(X), Xo, x3,..., Xn) 


then it is easy to show that the Fourier transforms are related by 


rd 1=.Ppy—P2 Pi +P2 


Y(P1,P2Pa,--- Pw) = 32 9 9 ,P3,---;Py)- 


Consider the integral 


Tia (Xi) = fa XpdPxq-+- dey [BOK Kay...) 


(3.4.2) 


(3.4.3) 


(3.4.4) 


(3.4.5) 


(3.4.6) 


(3.4.7) 


(3.4.8) 


(3.4.9) 
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2 
d?q5 d°ps 7 py 


=f (Qrh)* (20h)? — (2rh)* 


Wai ig, -X1/ng 
ie 5 en * Baap Ps) 


27h) 
(3.4.10) 
We now use the Cauchy-Schwarz inequality with b > 0 in the following 


2 
-1/2 


d3q 2 1/2 
/ 4 [1 +5 (ai) | (a1,de,Ps5--+sPw) 


ia-%1/h | 1 4b (gq? 2 
(20h) : [t+ 0 (ai)"] 


d3q sais ; 
<1f 1 [1 + 5(ai) |®(a1, 42, Ps,-+-+Px)| 


(20h)° 
(3.4.11) 
where 
r = d?q 1 
(2eh)° [1 # b(a2)’| 
= : (3.4.12) 
 Ay/2 7h3p3/4 ~ 


to obtain for I, (X;) in (3.4.10) the Xj-independent bound 


3 3 3 3 
ha(X) <1 [* aa d qa d PS. 5; d Pw 
(2rh)° (Qrh)” (27h) (2h) 


2 iad 2 
x [1 +0 (a?) I® (a1,4e,Ps,---,Pw)| 


= | d’p, d?p, < apy 
(2rh)° (20h)? — (27h)® 


2 


(p: — ps)” + 
. 140(® a (Pee PEEP ps Pa] 


2 
dp d°p P; —p>)° . 2 
=sr i ian uh 1+b — | (P1,Pa,Pas---sPw)| 
(3.4.13) 


where we have finally used (3.4.8). 
Since 
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N p? 
< (mat ma) a (3.4.14) 
we obtain the bound 
Ti (Xi) < 81 {1 +.B (me + ma)? ||How||"}. (3.4.15) 


We use this inequality in (3.4.6) for |Xi| < Rig only. 
From the definition (3.4.7), (3.4.9), (3.4.10) and (3.4.2), (3.4.3) we have 


Mob? < Tar {1 +b (my + ma)’ || Howl? } + C2, (3.4.16) 


or 
|View|| < V Tay2b(m, + m2) || How| + \V/ Cr + Tajo. (3.4.17) 


Accordingly 
N N 
SY viel] < Civil 
i<j i<j 
<Al|Hovll +B (3.4.18) 


giving a Kato bound (see also (3.3.71)), where 


N 
A= ~(m,+my) Ale (3.4.19) 
= 4/2 rh8 


a ee 
4.2 
Bad 4/2 wh8b8/4 toe 


and we have used the value of the integral J in (3.4.12). 
As in (3.3.63), the above then gives the lower bound 


(ne P? +5 66 -»))> SH (3.4.21) 


i<j 


with A, B defined in (3.4.19), (3.4.20) and the relevant constants in (3.4.2), 
(3.4.3). The positive constant b in (3.4.19) is so chosen to make A < 1. 


3.4.2 Multi-Particle Systems and Other Potentials 


A straightforward extension of the basic inequality in (3.1.8) for multi- 
particle states which may include many-body potentials is obtained by intro- 
ducing real vector fields Fj (x1,...,xw),j =1,..., N, of the position vectors 
of N particles, and define the potential energy 
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N 
V (x1,..-,%v) = — 5) Vy + Fj a, -..,Xw)- (3.4.22) 


Positivity implies that 


AV; 2m; 
F. 
(= 7 h i) 


which upon integration by parts yields the elementary bound 


Np? ‘2m; 
2 
(SEE + Vn) > oa EP) (3.4.24) 


2 
>0 (3.4.23) 


Note that the lower bound on the right-hand side of (3.4.24) applies also to 
other potentials U (x1,...,xj) which are bounded below by V (x1,...,xw), 
ie., for which U (x1,...,xn) > V (X1,.--, XN). 

Although the estimate in (3.4.24) is, in general, far from being optimal, 
it is nevertheless useful in establishing boundedness from below for some 
specific interactions. For an application of (3.4.22), (3.4.24), see Problem 3.5, 
and see also (3.4.29)—(3.4.32) below. 


3.4.3 Multi-Particle Systems with Coulomb Interactions 


We consider the multi-particle systems with Coulomb interactions de- 
scribed by Hamiltonian 


SZ Zje? 
w= Se ye ee By 3425) 


t=1 g=1 


consisting of N negatively charged particles of charges e, masses m, and k > 2 


positively charged particles of charges Z; |e|,..., Z% |e], such that 
k 
So Z4=N (3.4.26) 
i=1 


i.e., we consider neutral systems. Here the positive charges are considered to 
be fixed. The Hamiltonian of an atom with atomic number Z is obtain from 
(3.4.25) by deleting the last term in it and setting k = 1, Z, = Z. 

The derivation of the lower bounds for systems of fermions and bosons 
require special tools and will be given in detail in Chapter 14. Here we simply 
record the bounds obtained there. 

For identical spin 1/2 (fermions) negatively charged particles, such as 
electrons, we have for k > 2 the bound® 


° The numerical values 8.310 and 5.235 in (3.4.27) and (3.4.28), respectively, may 
be further improved, i.e., decreased but we will not attempt to do so. 
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1/27? 
met tee 7/3 
> : t : ‘ 4, 
(H) » > 8.31055 N 1 w2% (3.4.27) 


On the other hand, for identical spin 0 (bosons) negatively charged par- 
ticles, we have°for k > 2 


é 1/27 2 
1 
(Hye 5035 a ef e311 4 (32”"] ; (3.4.28) 


An expression for the ground-state energy of an atom, as a function of 
the atomic number Z, will be obtained in Chapter 13 based on physical 
grounds as an extension of the so-called Thomas-Fermi atom in which the 
latter corresponds to the large Z limit. We here derive a conservative lower 
bound to the ground-state energy for atoms. To do this we use the simple 


bound 
ieee oe (2-2) (3.4.29) 


where we have put the nucleus at tis origin, i.e., R = 0 and used the posi- 
tivity of the second term in (3.4.25). A conservative lower bound for an atom 
is —(me*/2h?)Z?. This is easily obtained from the right-hand side of the 
inequality in (3.4.29) by knowing the ground state energy of a hydrogenic 
atom or, for example directly from (3.4.29) by defining 


Ze? x; 
| ena (3.4.30) 
2 |x; 
which gives 
N N 
Z 2 
-\OV,-F=-yo eS (3.4.31) 
rae ay Isl 
and % 
2m? 5 WE? x 
-S> Sas mee (3.4.32) 
j=l 


with Z = N, leading from (3.4.24) to the rough lower bound stated above. 
One may also derive improved bounds (cf., Problem 13.14). A fairly detailed 
investigation of the ground-state energy of atoms, as a function of Z, will be 
carried out in Chapter 13. 


3.5 Decay of Quantum Systems 


Consider a Hamiltonian H which is bounded from below by a finite num- 
ber L H; 
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(b|H|b) > La (3.5.1) 


for normalized |7) (§3.2), where Ly is often negative. One may then conve- 
niently introduce the Hamiltonian H’ = H — Ly and write the inner product 


(|) as (see also (§3.2)) 


(|v) = | “aa Gb|E (A — EI) =1 (3.5.2) 
where F(\) 
(W|5(0— HI) = EO (3.5.3) 


denotes the probability density, per unit energy, of finding the energy of the 
system in the state |W) around the value X in the energy scale translated by 
the amount —Lyz. 

In particular, if ya (A) denotes the characteristic function of a set A, i.e., 
xa (A) = 1 if A € A, and ya (A) = 0 if A g A, then 


| © AQF) (3.5.4) 


represents the probability of finding the energy of the system in the state |w) 
to have values in the set A in the shifted energy scale. 
Given that a system in the state |w) = |w(0)) has developed in time to 
the state 
[h(t)) = e PA" fb) (3.5.5) 
a quantity of physical interest is the probability of finding the system in the 
same state |w) at time t. This is given by (§1.2-§1.5) 


Tr [lb) Hl 1b @) OL] = 11d). (3.5.6) 


According to (3.5.2), (3.5.3), the corresponding amplitude to the above 
probability may be written as 


(|p (t)) =e #h#/F A(t) (3.5.7) 
where ay . 
A(t) = if : en ee (3.5.8) 
as a Fourier transform with the constraint 
F(A) =0 for vA <0. (3.5.9) 


Since |.A(t)|? denotes the probability that the system at time t is found 
in its initial state, it is also referred to as the survival probability of the 
system. Guided by this interpretation, one may formally define 


6 | A(t)|? equivalently represents the following probability. Given that the system 
is in the state |), it represents the probability that it was in the state |q (t)) at 
time t < 0. 
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sf. AAprsr (3.5.10) 


as a measure of the lifetime of the system in the state |). 

An interesting application which follows from (3.5.10) is the following 
one. Suppose that the energy of the system is confined to some interval A of 
length A (A). That is, F(A) =0 for A ¢ A, and from (3.5.2), (3.5.3) 


ie a xa (\) F(A) =1 (3.5.11) 


in the notation of (3.5.4). An elementary application of the Cauchy-Schwarz 
inequality then gives 


1=([° Mure) <(fo Awe) (f Seon) 


= oer 2T (3.5.12) 
where we have used the property 
i — |F(A)|? = - dt|A(t)]’. (3.5.13) 
From (3.5.12) one then obtains an energy-time uncertainty principle, 
h 
5 < A(A)T. (3.5.14) 


That is, the shorter the energy “width” A (A) of the state |y) is, the longer is 
its lifetime against decay. In the limiting case of zero “width” A (A) — 0, one 
obtains a non-decaying system of infinite lifetime! The mere fact that atoms 
in excited states decay from one energy level to a lower one, is an indication 
of the finite “widths” of such energy levels. 

Some properties of A(t) which follow directly from its definition in (3.5.8) 
are 


A(0) = 1 (3.5.15) 
where we have used (3.5.2), (3.5.3), 
A* (t) = A(-t) (3.5.16) 
and formally 


ihA’ (0) = [ \dd F (A) (3.5.17) 


n|A’(t)| < [rare (3.5.18) 
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where we have used (3.5.9). The integral on the right-hand sides of (3.5.17), 
(3.5.18) denotes the mean energy in the state |~), in the shifted energy scale. 
Upon using the notation 


A*(t)A(t) = P(t) (3.5.19) 


for the survival probability in (3.5.6), we note that 


P(t) = —A’(-t) A(t) + A(t) A’(t) (3.5.20) 


—P(t)) =0 (3.5.21) 


for the derivative of the survival probability at the origin. 

In particular, one learns from (3.5.21) that the familiar decay law 
exp(—I't/h), for t > 0, cannot hold near the origin t — +0. A concrete 
example of a physical situation, where (3.5.21) may be explicitly verified is 
readily given. Consider a particle of spin 1/2 of magnetic dipole moment tu 
in a uniform time-independent magnetic field B = (B,0,0). Then as a spe- 
cial case of an analysis that will be given later in §8.8, leading to (8.8.47), 
the survival probability P(t) with the initial state of the spin in the state 
|+1/2,z) is given by 


P(t) = cos? (5) (3.5.22) 
and in the neighborhood of the origin, this probability has the behavior 
pi) 1-H EE (3.5.23) 
consistent with (3.5.21) at t = 0." 
The exponential decay law with the amplitude A(t) given by 
A(t) = eot/heTt/2h (3.5.24) 


for some Ay, cannot certainly hold true for all t. In particular it cannot hold 
true for t > oo. The reason is that it yields a density F(A) which is non-zero 
for A < 0, for arbitrary large |A| which is inconsistent with the boundedness 
of a Hamiltonian from below.® This is seen by the explicit evaluation of the 
integral 


” This same behavior in (3.5.23) follows more generally in an oscillating magnetic 
field B = (Bcoswt, Bsinwt, Bo) as will be seen by examining (8.8.45) later on. 

8 The expression in (3.5.24), as it stands, is likewise inconsistent with analyticity 
properties when extended to the complex time domain t — t + ir. 
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Co 
= / dte Pot/Pe-Tlt|/2hgita/h (3.5.25) 
—co 


which gives the familiar Breit-Weisskopf-Wigner expression 


BOY rp 
2h — m (A— Xo)? + T?/4 


(3.5.26) 


which, as mentioned above, does not vanish for A < 0, for |\| arbitrarily large. 
Thus the exponential decay law cannot be true for all ¢. 

On the other hand, if one chooses the density in the form in (3.5.26) to be 
true only for \ > 0, then one has to normalize it first, obtaining the density 


-1 
Fo) _ oJ T/2 [(A = Ao)? +1?/4] . asa 
2rh 0, \<0 


(3.5.27) 


where 


2 
C= _ +tan7* (=) : (3.5.28) 
A contour integration in the complex A-plane, enclosing the pole at A = 
Ao — iD /2, Ao > 0, for t > 0 gives (see Problem 3.6) 


A(t) /C = me~Pot/Re-Tt/2h + R(t) (3.5.29) 


where for t — co i 
|R(t)| = o(7) : (3.5.30) 


That is, for t — oo, |A(t)| would vanish slower than an exponential law for 
t— ow. 

The exponential decay law, however, is not ruled out, however, for t not 
close to the origin and for ¢ not in the truly asymptotic region |t| > oo. An 
interesting example of an exponential decay will be worked out in detail in 
88.1 (see (8.1.94)) for a two-level system interacting with an infinite number 
of harmonic oscillators. 

The above analyses have shown the following typical behaviors P(t) = 
1 — O(t?) for t ~ 0, P(t) = O(1/t) for t > oo for the survival probability, 
and that the classic exponential law for intermediate t¢ is not ruled out. 

In the appendix to this section, we prove a theorem due to Paley and 
Wiener® tailored to our physical problem at hand. Given that the Hamil- 
tonian of the physical system is bounded from below and |A(t)| is square- 
integrable then we will see that 


qe All OM 
fia Roses: (3.5.31) 


° Paley and Wiener (1934). 
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To gain further insight into the finiteness property in (3.5.31), note the 
following. For any real c > 0, 


|Inc| = 20(c— 1) Inc—Inc (3.5.32) 


where O(x) is the step function, i-e., O(x) = 1 for x > 1, O(x) =O fora <1. 
Also 
O(c—1)Inc< 2 (3.5.33) 


|Inc| < 2c? —Ine (3.5.34) 
and with c = |A(t)|, we obtain from (3.5.34) upon integration over t 


jm lA@ll a JAI i. In|A(@)| 
[i« sre ta) A err aad as ee (3.5.35) 


Since 


a ae AOr < a dt | A(t)| (3.5.36) 


1+? 


then in order to establish (3.5.31), with the square-integrability condition of 
|A(t)|, satisfied, it remains to obtain a finite lower bound (number) for the 


integral 
ee | In|A(t)| 
dt 5. 
- Le? 2881) 


n (3.5.35). 
A question that arises in reference to (3.5.31), (3.5.35), (3.5.36) is the 
following: “If we have square-integrability of |A(t)|, why bother with the lower 
bound on the left-hand side of (3.5.35) — that is with the expression in 
(3.5.31)?” The answer is that the condition in (3.5.31), via (3.5.35), follows 
if it is given, in particular, that the Hamiltonian of the system is bounded 
from below. The exponential law in (3.5.24) here comes to the “rescue” as a 
counter example for which the spectrum, as seen in (3.5.26), is unbounded 
from below. It is easily checked that |A(t)|, with A(t) in (3.5.24), is square- 
integrable, while 
Im|A@I| _ lel 2 
1 te 1+ t? 2h 


is, clearly, not integrable. The integral on the left-hand side of (3.5.31) pro- 
vides a rigorous constraint on A(t), and hence on the survival probability 
P(t), to be satisfied! for a physically consistent theory of quantum decay in 
which the Hamiltonian is bounded from below. 


(3.5.38) 


1° The importance of the Paley-Wiener theorem was emphasized by Khalfin (1957). 
A general lucid overview treatment by Fonda e¢ al. (1978) should be also noted. 
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Appendix to §3.5: The Paley-Wiener Theorem 


To prove! the finiteness condition in (3.5.31), it is more convenient to 
start from the complex conjugate 


A*(t)= / FF (Ajerwe (A-3.5.1) 


instead of A(t). 
Consider the following transform 


Pye. Ber 
Lr) = dt A-3.5.2 
C7) =e (¢-—v)2477? ( 3.5 ) 


the so-called Poisson integral formula for a half-plane, where t’, r’ are reals. 
The part of the integrand multiplying A*(t) in (A-3.5.2) has the following 
properties: 


1 7 


m(t—t/)?4+7 d(t — t’) for tT’ > +0 (A-3.5.3) 
formally, and 
1% ‘ 7! a n F 
ee ee (A-3.5.4) 
1 co z'eliat : ante 
__ jia(t’+ir’) 
™ wet : @—v24+7? ese) 


for a > 0. 
Accordingly, from (A-3.5.1), (A-3.5.2), (A-3.5.5), 


oo lo) I airt/h 
1,7) = | dA F(a) | dt ’e 


= nh _oo T (t—t/)2 +7? 
_ dn iX(t! Hit’) /h 
=f apt Ae 
= B(t' + ir’) (A-3.5.6) 


and the transform in (A-3.5.2) has the remarkable property of extending 
the Fourier transform of A*(t) to the complex domain thus introducing the 
function of the complex variable t + ir: B(t +ir). 

Since F(A) = 0 for \ < 0, however, we have for 7 > 0 


= + \j2 x dy 2.—27A/h 
/ dt|B(t+ir)| =} ap FO) e 


—oCo 


11 The proof of this theorem may be omitted at a first reading. 
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= : salPoyr =f atiaw@r (A-3.5.7) 


and (A-3.5.6) defines the Fourier transform in the complex-time upper plane 
7 > 0 and is analytic in this region.” 
From (A-3.5.6), (A-3.5.2), we rewrite 


co * / 
B(t +ir') = -| eal 
Tv 


-co Gp ar (A-3.5.8) 


and consider the transformation of the upper complex plane into the unit 
circle 


= rel? (A-3.5.9) 


where 


zZ=t +ir’ (A-3.5.10) 
and r < 1. The boundary of the unit circle is given by r = 1. In reference to 
the real variable ¢ in (A-3.5.8), we transform the integration along the real 
axis to an angular one along a unit circle defined by the transformation 


rt agi. (A-3.5.11) 
t+i1 


The following are easily established 


dt sin 0 


dé = were a eT (A-3.5.12) 
2r sind Pete 
oe ee a A-3.5.13 
1+r2—2rcosg’ * ~ 147? —2rcosd : 
1+? — 2rcos(¢— 0) 
faeay) Pie 2 _ 9 A-3.5.14 
( ar | z'| (1+ r2 — 2r cos ¢)(1 — cos 8) ( 
and hence 
14 rete 
a ui (A-3.5.15) 


(t-t/)?+7 9 (1+ r? — 2rcos(¢— )) 


Therefore denoting A*(t), in terms of the new variable e'®, by a(e'®), and 
B(t' + ir’) in terms of re by b(re!#), we have from (A-3.5.15), (A-3.5.12) 
and (A-3.5.8) 


12 A classic reference on the connection between analyticity and the Fourier trans- 
form is Titchmarsh (1937). For a relatively modern treatment, see, Rudin (1966). 
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ea es 1—r? 
id) _ = id e 
b(re'?) = an : dé a(e’’) (i Ee ree 0) (A-3.5.16) 


showing that b(re®) is the so-called Poisson integral (for a circle) of a(e’®), 
where the former is obtained for r < 1, i.e., inside the unit circle, from an 
integral of a(e!®) along its circumference. 

Using the integral 


L [ao erie =5 (A-3.5.17) 
Qn Jo (1+ r2—2rcos(d—6)) — _ 


it is easily shown (see Problem 3.9), that 


21 20 
i. do |b(re'?)|" < ‘i dd |a(re®)|”. (A-3.5.18) 
0 0 


To establish the finiteness condition in (3.5.31), we consider the integral 


1 27 : 
=| d¢@ In |b(re'*)| . (A-3.5.19) 


Using the notation pe'? = w, p < r we note that although b(w) is analytic 
in the circle of radius r, it may have zeros, and in (A-3.5.19), we are interested 
in In |b(w)|. Accordingly, suppose that b(w) has zeros at: 


w=0O oforder m, 


W = Q1,.--,Qp of orders c1,..., Cn such that jai] <r,...,lan| <1, 
W = On41,---,@n Of orders Cn41,---,¢n Such that Janiil > 7,...,/en] > 7 
and conveniently define!? 


7 ‘wor \% au a \% 
h(w) = n(#5) (1 (=) (A-3.5.20) 


=1 i=n+1 


thus removing the zeros from b(w). 
Upon setting a; = |a;|exp(id;), we note that for the expression in the 

j-product in (A-3.5.20) 

azw — r? 

r(w — a5) 


=1. (A-3.5.21) 


p=r 


Directly from (A-3.5.20), 


'3 Cf. Rudin (1966), pp. 299-300, except here we are allowing zeros at the origin 
as well. 
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(w) ) n r Cj 
= a A-3.5.22 
wo) = ((2] Iz (A-8.5.22) 
and for p =r, w = rexpi¢, 
N 
In |h (re'®) | =In |b (re'®) | —minr — S- cin 1 — ei(-8))  (A-3.5.23) 


i=n+1 


where in writing the latter we have used (A-3.5.21). 
Since in (A-3.5.20), we have removed all the zeros of b(w), we have from 
Cauchy’s theorem 


ee aa In |A(w 
a dé In |h (re'* ==. paw nd = In|A(0)|. (A-3.5.24) 
Qn 0 
Using the integral!4 
20 ; 
| dé In 1 2 el(-9) =0 (A-3.5.25) 
0 
we have from (A-3.5.22)—(A-3.5.25), 


‘a bea ; b(w) 
a dé In |b (re'*) | =n 


+mlinr. (A-3.5.26) 


+ Yam] = 


w=0 


This is known as Jensen’s formula. 

Since c; > 1, |r/a;| > 1 in (A-3.5.26), we have the following bound 
b 
“w) +milnr. (A-3.5.27) 


w=0 


1 20 : 
= id 
=f d¢n |b (re'®)| > In 


Using (3.5.34) in conjunction with (A-3.5.27), with c = |b (re'®)| gives, 


as “ do [In |b (re'®) || < é i‘. d@ |b (rel®) |? 
27 0 = T Jo 
In ee) minr. (A-3.5.28) 
~ w=0 


On the other hand, we may use (A-3.5.18) to obtain from (A-3.5.28) 
1 27 , 1 20 ‘ 2 
xf delin|o (rel < =f aoa (e%)| 


™ Cf. Rudin (1966), pp. 299,300, and Problem 3.10. 
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b(w) 


wm 


in| minr. (A-3.5.29) 


w=0 


Upon taking the limit r — 1, and using the t variable in (A-3.5.11), 
(A-3.5.12) we have from (A-3.5.29) 


co Oe 7 
/ a B4Oll <5 / Fy ee emg (A-3.5.30) 
ae 1+ 72 wen 14+ w™ aa 
and since 2 
* AO! if * 2 
Z 3.5. 
fies <f jaw) (A-3.5.31) 


the square-integrability of |A(t)| implies the finiteness of the integral in 
(3.5.31) by noting finally that from (A-3.5.27) we have the lower bound 


fae a) >7ln oe 


1+? ue 


(A-3.5.32) 


—Cc w=0 


is finite as a zero of order m of b(w) at w = 0, if there is one, has been 
removed. 

The key assumption in the above analysis is the boundedness of the Hamil- 
tonian from below, i.e., that |Ly| is finite. As a matter of fact, if H is not 
bounded from below, then we cannot define the Hamiltonian H’, as done in 
the beginning of §3.5. In the latter case, if we define 


Wey = fore (A-3.5.33) 
with F(A) not zero for A < 0, then instead of (A-3.5.7), we obtain 


jf clpesine =f A (F(A))e 


—b 
> ebar/e f = I BO)|* (A-3.5.34) 


where 0 < b < a for which F(A) 4 0 in the interval (—a, —6). Clearly for 
T positive and arbitrary large, the right-hand side of the above inequal- 
ity (A-3.5.34) increases without bound, destroying the square-integrality of 
|B(t + ir)| in t as opposed to the case in (A-3.5.7). 


Problems 


3.1. Show that the integrals (3.3.23) and (3.3.24) are equivalent. 
3.2. Derive the expressions in (3.3.25) and (3.3.30). 


3.3. 


3.4. 


3.5. 


3.6. 


3.7. 


3.8. 
3.9. 


3.10. 
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Apply the formula (3.3.77) to find lower bounds for a Hamiltonian 
with potential V(x) in (3.3.8) for A3 = 0, Ay < 0, A2 < 0, 6, > 0 in 
v = 3 dimensions, and compare your bound with the one in (3.3.15). 
Use the general bound in (3.4.21) to find a lower bound to the Hamil- 
tonian in (3.4.1) with 


Gmm,; 


Vig (Xi — Xj) = 
il i) eee 
for the Newtonian gravitational potential. 

Introduce a vector fields Fj (xi,...,xw; Ri,...,Rx), for j = 
1,...,N,Ry,..., Rez fixed, to generate the potential energy for matter 
in (3.4.25) by applying the definition of the potential in (3.4.22). Us- 
ing your expression for F; obtain a lower bound for the ground-state 
energy by the application of (3.4.24). This estimate is rather rough in 
comparison to the ones in (3.4.27), (3.4.28). Is it possible to choose the 
position vectors Rj,..., Rx of the nuclei optimally to get an improved 
estimate? 

Verify the normalization of the density in (3.5.27)/(3.5.28). By a 
contour integration in the complex A-plane enclosing the pole at 
A= Xo — iD /2, Ao > 0, for t > 0, show that A(¢) in (3.5.8) is given as 
in (3.5.29) and obtain an expression for |R(t)|. What is the next order 
behavior to 1/t for t > co? 

Suppose that the density in (3.5.26) is non-zero only for 0 < A < «¢, 
where c is a finite constant. Does the amplitude A(t), in this case, 
involve an exponentially damping term? 

Derive the expressions for the integrals given in (A-3.5.4), (A-3.5.5). 
Prove the inequality in (A-3.5.18). [Hint: Note, in the process of the 
demonstration, that 


* (il io’ iO) «(10 id) |2 ei? 
a*(e'*) a(e™) + a(e%) a*(c'®’) < |a(e®)|” + fale”) 
and use (A-3.5.17).| 


It may be amusing to explicitly evaluate the integral in (A-3.5.25) and 
show that it is equal to zero without using contour integration. 
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Spectra of Hamiltonians 


The chapter is involved with several aspects concerning the spectra of 
Hamiltonians. In §4.1, §4.3, §4.4, the nature of the spectra of some general 
classes of Hamiltonian is determined under sufficiency conditions satisfied 
by the underlying potentials. These sufficiency conditions are readily verified 
and if satisfied, the nature of the spectra may be inferred. This is obviously 
very important if the exact solution of the problem in question is not known. 
Even if the solution may be explicitly obtained, the knowledge of the nature 
of the spectrum of a Hamiltonian prior to its determination is quite useful. 
Bound-states are studied in §4.2. In this section sufficiency conditions are 
given for their existence as well as for their absence for various physical 
systems. In §4.5, we carry out an analysis for counting the eigenvalues (if 
any) of given Hamiltonians. This investigation is then used in §4.6 to derive 
lower bounds to the expectation values of kinetic energy operators for single- 
and many-particle systems. The results obtained in these two sections will 
find important applications to multi-electron atoms in Chapter 13 and to the 
problem of the stability of matter in Chapter 14. The final section (§4.7) deals 
with the role of supersymmetry in solving the eigenvalue problem and in the 
construction of supersymmetric Hamiltonians. In this section, we make use 
of the general properties of supersymmetry transformations obtained in §2.9. 

Before getting into the details of this chapter we note the following. Sup- 
pose that for some vector |y), in the domain of a given Hamiltonian H, 


(b|H|p) <0. (4.1) 


In §4.1, for example, we give sufficiency conditions to be satisfied by the po- 
tential in question such that if the negative spectrum of H is not empty, 
then the latter consists of eigenvalues of finite degeneracy (the discrete spec- 
trum). The condition in (4.1) would then imply that the negative spectrum 
of such a H is not empty consisting of a discrete one, as part of the spec- 
trum. This property of the spectrum follows by noting that by the spectral 
decomposition of H (§1.8, (1.8.15)), (4.1) may be rewritten as 


182 4 Spectra of Hamiltonians 
[ dalPnvrel? <0 (4.2) 


thus admitting negative (A < 0) eigenvalues for H. 


4.1 Hamiltonians with Potentials Vanishing at Infinity 


In this section we are concerned with Hamiltonians 


p? 
H = —+V(x) (4.1.1) 
Qu 
in vy = 3,2,1 dimensional spaces, with potentials satisfying the following 
sufficiency conditions. For any 0 < R < ov, 


/ dx |V(x)|2 < 00 (4.1.2) 
|x|<R 
and 


Such potentials are said to be locally square-integrable with vanishing prop- 
erty at infinity. For v = 3, the Coulomb potential te?/|x|, for example, lies 
in this category. 

Since condition (4.1.3), in particular, implies that for a sufficiently large R, 
we may find a finite positive constant C such that |V(x)| < C for |x| > R, the 
analysis given in class [F] of §3.3 shows that such Hamiltonians are necessarily 
bounded from below. We will denote the lower bound of such a Hamiltonian by 
—y (= Ly in our earlier notation in §3.2), emphasizing its strict negativity 
if it arises. 

For a two-particle system, for example, with a reduced mass pu, when the 
two particles are widely separated, condition (4.1.3) means that the interac- 
tion between them goes to zero, while the relative kinetic energy may take 
on any value from zero up to arbitrarily large positive values. Accordingly, 
the spectrum would include a continuous one on the positive real axis [0, 00). 

Of particular interest is the situation when —y is strictly negative. We 
would then have, in addition to the continuous spectrum discussed above, 
negative eigenvalues falling in the interval [—24,0). What is interesting, as 
the following theorem shows, is that these negative eigenvalues are at most 
finitely degenerate and there is no continuous spectrum on the negative real 
axis. In the introduction to this chapter, we have seen that if one can find 
any vector |~) such that (W|H|w) < 0, then —y is strictly negative. Needless 
to say, as mentioned earlier, in order for the Hamiltonian to have part of its 
spectrum consisting of negative values it is necessary that the negative part 
VO(—V), as defined in (3.3.18), of the potential V is non-vanishing, otherwise 
the Hamiltonian is a positive operator. 
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Theorem 4.1.1 

A Hamiltonian H in (4.1.1) with a potential V(x) satisfying (4.1.2), (4.1.3) is 
bounded from below. It necessarily has a continuous spectrum on the positive 
real axis [0,00) with no continuous spectrum on the negative real axis. It has 
also no eigenvalues of infinite degeneracy. In addition, if the lower bound —ly 
is strictly negative, it has also negative eigenvalues (a discrete spectrum) of at 
most finite degeneracy. [By finite it is meant to be non-infinite thus including 
the possibility of non-degeneracy.] 


The importance of this result in quantum physics is obvious and cannot 
be overemphasized. 
To establish! this result we write 


1 1 ; 1 1 


= V 4.1.4 
HS eo.’ dpe Hake’ Vig 23 ee 

where & is any fixed real negative number such that 
€) < min (0, —@y]. (4.1.5) 


[Actually Proposition 4.1.2 below, alone, implies that —f4 < 0, and hence 
we may take &) < —fy.] As before Hp = p?/2u. 
We first need the following two propositions. 


Proposition 4.1.1 
For any infinite sequence {|fn)} of orthonormal vectors, 


1 1 
V fr 0 for no (4.1.6) 
| [+ |€ol] [Ho + |&ol] 
where &) as given in (4.1.5). 
To prove this we define 
V(x), x] <L 
Vz (x) = (4.1.7) 
0, |x| > L. 
We note that we may explicitly write 
1 
x | Vz —— fn ) = Vi (X) (®x| fn 4.1.8 
(x|Veeg tn) = Vi) alfa) (4.18) 


where we conveniently set 


! The proofs of this theorem and of the following two propositions may be omitted 
at a first reading. 
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d’p e7ip:(x—-y)/h 
®x(y) = D 4.1.9 
(y) i (27h) E +| él (4.1.9) 


and hence 
alfa) = fary © (9) fn). (4.1.10) 
Also 
d’p 1 
I|B,.||? = d’y |®,. (y)|? —= 7 5) 
| Irae [EE + leo 
=I”) <0 (4.1.11) 


for v = 3,2,1, and the latter norm is independent of x. Accordingly 


|(®acl fn)? < [|Gxcll? [[fall? = 2 < 00 (4.1.12) 


and is bounded independently of all x and n. 
Also we note from Bessel’s inequality (1.7.4) that 


So Gx Fi) /? < ]@xl|? = 1 < co. (4.1.13) 
j=l 
The finiteness of the right-hand side of this inequality for all x implies that 
Jim [(®xl fe)? =0 forall x (4.1.14) 
for the converges of the series on the left-hand side of (4.1.13) for n > oo. 
The boundedness of |(®|fn)|’ independently of x and n, as given in 


(4.1.12), and the obvious square-integrability of Vz (see (4.1.2), (4.1.7)), allow 
us to take the limit n — oo inside the integral: 


jim, far VEG0)? (Oxf)? = fa’ [Vi]? (tim, (xl fn))?) = 0 


(4.1.15) 
where we have used (4.1.14). 
Finally we consider the bound 
1 1 1 
V nil < V -YV;, ms 
| [A+ [éoll \ [Ho + [eal <Te Tal i +) Tip + [eal 
1 1 

4 {| (4.1.16 
[ol — eu | Teel 


and the bounds 
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1 1 
|v ") Tig eal <a BE (4.1.17) 
1 : . : 1/2 
Mgrres ena - (| x |Vi (x)?||(@xlfn)| ) (4.1.18) 


to obtain from (4.1.15), upon taking the limit n — oo, that 


. 1 1 
ne | [A+ léoll (Ho + léol 


fn |V(x)|. (4.1.19) 


1 
<r :SOomnx 
[Ifo] (ISo] — 2zr)) IxI>z 
The left-hand side of this inequality is independent of L. Hence finally 
upon taking the limit ZL — oo and using (4.1.3) the result given in (4.1.6) 
follows. 


Proposition 4.1.2 

The spectrum of the operator (H —&)~+ includes a continuous spectrum con- 
sisting of the interval [0,1/| |] and does not contain eigenvalues of infinite 
degeneracy. In particular, the spectrum of H includes a continuous spectrum 
consisting of the positive axis [0,00). 


To establish the validity of the proposition, we use the notations 


1 1 
Hy — &0 


(4.1.20) 


Clearly, the operator B has only a continuous spectrum consisting of the 
interval [0, 1/|&o]. 

As a hypothesis, suppose that some real number Ao belongs to the essential 
spectrum, if not empty, of the operator (H — &))~'. [That is, \o belongs to its 
continuous spectrum or is an eigenvalue of infinite degeneracy.| According, 
to Proposition 1.8.1, there must then exists an infinite sequence {|fn)} of 
orthonormal vectors such that 


(A+ B- Ao) fall +0 for n— co (4.1.22) 


where are have used (4.1.4), (4.1.20), (4.1.21). 
Hence from the inequality 


I|(B— Ao) fall < (A + B— Ao) fall + Afall (4.1.23) 
together with (4.1.6), (4.1.22), we conclude that 


(B= Ao)fall +0 for nm — ov. (4.1.24) 


186 4 Spectra of Hamiltonians 


That is, we may infer from Proposition 1.8.1, and (4.1.22), (4.1.24), in 
particular, that any real number Ap which belongs to the continuous spectrum 
of (A + B) necessarily belongs to the continuous spectrum of B as well, 
recalling that B has no, so-called, eigenvalues of infinite degeneracy. Because 
of the latter property of the spectrum of B we also conclude that (A + B) 
cannot have such eigenvalues of infinite degeneracy. 

Conversely, suppose that some real number Xp belongs to the (continuous) 
spectrum of B. That is, there exists an infinite sequence {| f;,) } of orthonormal 
vectors such that 


(B= o0)fnl| ~ 0 for n—-oo. (4.1.25) 
Hence from the inequality 
(A+ Bo) fall < |Afnll + (B — Ao) fall (4.1.26) 
together with (4.1.6), (4.1.25) we may infer that 
(A+B -o)fn|| -0 for now. (4.1.27) 


That is, any real number Ao in the (continuous) spectrum of B necessarily 
belongs to the continuous spectrum of (A + B) and from this, together with 
the conclusion following (4.1.24), we may infer that the continuous spectrum 
of (A+ B) consists of the interval [0, 1/|& |]. Also that (A+ B) has no eigen- 
values of infinite degeneracy. This completes the proof of the proposition by 
finally noting that the continuous spectrum of H then consists of the positive 
axis [0, co) alone. 

To complete the proof of the theorem, it is instructive to reconsider the 
expression for the resolvent of H. To this end let 


1 


R = —_ 4.1.28 

(60) = Foe (4.1.28) 
with £9 defined to be fixed as before. 

Suppose A is some complex number, then the following expressions are 


easily derived: 


aX 1 
Ro) Ry F > ea 
and 

: =-\-» : : (4.1.30) 

[R(&0) — 5] [H — (fo + A)| 

The equality in (4.1.29) leads to the bound: 
1 1 1 1 

lear |< aretaleee | aa 
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On the other hand, (4.1.30) gives the bound: 


1 


Sable awoexn 


1 
[RG)— a] 

From (4.1.31), (4.1.32), we may then conclude, with now specialized to 
be real and not equal to zero, that [H — (€) +A)]~’ does not exist if and 
only if [R(o) — 1/A]~" does not exist. Also that [H — (€) + \)]~* exists as 
an unbounded operator if and only if [R(€)) — 1/A]~* exists as an unbounded 
operator. 

In particular, the above states that if a real number 1/A belongs to the 
discrete spectrum of R() then the real value (A — |&5|) necessarily belongs 
to the discrete spectrum of H and vice versa. Since the spectrum of H is 
empty in the region (—oo, —é;7), it is necessary that \ > 0. On the other 
hand, the statement in Proposition 4.1.2 does not rule out the possibility of 
having eigenvalues (of at most finite degeneracy) for R(£) for 1/A < 0 or 
for 1/A > 1/|& |. These facts together with the necessary condition > 0, 
just given, show that if —y is strictly negative, i.e., the spectrum of H in 
[—£17,0) is not empty, then H has also a discrete spectrum (eigenvalues of at 
most finite degeneracy) falling in the interval [—f7,0). This completes the 
proof of the theorem. 


s| (4.1.32) 


4.2 On Bound-States 


4.2.1 A Potential Well 


An elementary though important example of a potential falling in the cat- 
egory of Theorem 4.1.1 is the one-dimensional potential well problem defined 
by: 

—h? d2 


H= mn de® + V(z) (4.2.1) 
where 
—Up, ea < L 
V(x) = (4.2.2) 
0, |el>L 
Up > 0. 


According to the theorem, this Hamiltonian has part of its spectrum a 
continuous one on the positive real axis [0,0o). It is instructive to consider 
the discrete spectrum in the light of Proposition 1.8.2. 

To the above end, we set 


2m 


2m 
oe (Uy ~ |B) =K?, SB) = #? (4.2.3) 
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in the Schrédinger equation 


—h? d? 
tis 


V(e)| ole) = Bute) (4.2.4) 


with E < 0, corresponding to a bound-state, to obtain 


Fe + | p(x)=0, || <L (4.2.5) 
d2 
is - | p(z)=0, |x| >L (4.2.6) 


having, respectively, even and odd solutions 


w(x) = Acos(Ka), |x| < L 


(4.2.7) 
p(a)= Bel, jal > L 
and 
v(x) = Csin(Ka), |z| < L 
w(z) = De“*ll, =a > L (4.2.8) 


w(a) = —De-*l#l, x < -L. 


The boundary conditions implied by the continuity of (x) /q(a) at |a| = L 
lead immediately to the equations: 


(4.2.9) 


and 
(4.2.10) 


corresponding, respectively, to the even and odd solutions, where we have set 


WmnUoL? 
aa Oe — or ee (4.2.11) 
The energy levels may be thus written from (4.2.3), (4.2.11) as 
&? 
E=-Up f - 5 (4.2.12) 


We may quite generally write 


-=N+te (4.2.13) 
TT 
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Te eo Se 


Fig. 4.1. The a and lower curves are given, respectively, by f (6) = 
Ja? — & o/s, of = —€/\/a? — &, while the eigenvalues are given by E“ = 
—Up(1—€2/a?), ee €, denoting critical values corresponding to the intersec- 
tions of the former curves (denoted by crosses on the graphs) with the tané 
curves. One may generally write a/7 = N +, where N is a non-negative in- 
teger and 0 < e < 1. For 0 < e < 1, the number of eigenvalues corresponding 
to the upper curve is (N + 1). For 0 < ¢ < 0.5, the number of eigenvalues 
corresponding to the lower curve is N, while for 0.5 < « < 1, itis (N +1). For 
€=0,a=Nz7, N > 1, the largest FE value is at the bottom of the continuous 
spectrum, with 2N eigenvalues falling below it. The Figure is of a qualitative 
nature only and is not based on actual numerical values. 


where N is a non-negative integer and 0 < « < 1. The bound-state energies 
F are obtained as described in Figure 4.1. 

For 0 < « < 0.5, the total number of bound-states are no = (2N + 1), 
while for 0.5 < ¢ < 1, the number of bound-states are ng = (2N + 2) — See 
Figure 4.1. 

From (4.2.11), (4.2.12) we also have the following expression for the 
bound-states 


Pee Ss 
Bp =—Uo+ 55, k= 1,...,00 (4.2.14) 
where £1,...,&n,. are the € values corresponding to the crosses (intersections) 
in Figure 4.1. The eigenvalues Fy,...,£,, are arranged in a non-decreasing 


order. In the notation of Proposition 1.8.2, E, = A,(H), with E,, falling 
below the bottom of the continuous spectrum of H. 

For ¢ = 0, :, = a = No (see the Figure) and E,, = 0 is at the bottom 
of the continuous spectrum of H with no — 1 eigenvalues falling below E,,, 
in conformity with Proposition 1.8.2. 

The limit a — 0 will be considered below. 

Quite generally, we note that for N > 1, for example, 
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ee (4.2.15) 


Wha 


C= 


The above example will be useful in studying the spectrum of Hamiltoni- 
ans with potentials increasing with no bound at infinity in $4.4. 


4.2.2 Limit of the Potential Well 


In reference to (4.2.13), (4.2.11) if 
a 
—=¢, 0<e<05 (4.2.16) 
7 


the analysis given in the figure caption of Figure 4.2, shows that there is only 
one bound-state corresponding to an even solution. 
We consider the limit ¢ — +0, i.e., a — +0, with Up — o0, LD — 0 such 
that 
2UoL = X (4.2.17) 


is a finite non-vanishing positive constant. 


From (4.2.3), (4.2.11) and (4.2.9) we then have from 


2m |E| 


that is in the above limit of Up large and L small 


ay 
, Th = (4.2.19) 
or that 2 
E= oe (4.2.20) 
The eigenstate corresponding to (4.2.20) is from (4.2.7) 
w(a) = Vk e*el (4.2.21) 


where k = \/2m|E|/h. 


4.2.3 The Dirac Delta Potential 


It is instructive to compare the limiting solution (4.2.20), (4.2.21), for 
Up — oo, LE — 0 and X a finite non-zero positive constant as defined in 
(4.2.17), with the one obtained directly from the Dirac delta potential 


V(x) =—Ad(x), A>. (4.2.22) 
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First in reference to the inequality (3.1.8) with v = 1, we set 


g(x) jal (4.2.23) 
hence 
O48 
7 a ame 
0 
= 7 1@(0) - e(-2) 
x 
= 2X 6(zx) (4.2.24) 
where O() is the step function. 
Hence from (3.1.8), with x2?g?(x) = A?, we have 
mAr2 py 
< 2. 
oar < (E Ble) (4.2.25) 


giving the lower bound —m.A?/2h? for the spectrum. 
Now we solve for the bound-state problem corresponding to the equation 


Ae ge 
-s - 4(a)| W(x) = Ev(2) (4.2.26) 
with FE < 0. For |x| > 0, this gives the properly normalized solution 
w(a) = Vk eM (4.2.27) 
where k = \/2m|E|/h. We note that 
jim, ¥(2) = im, ¥@) = vk. (4.2.28) 
On the other hand 
i (a) = Vk eB ee . 7g ; (4.2.29) 
and hence 
Jim, o(a) — Tim "(x)= —2kv/k. (4.2.30) 


Upon integration of (4.2.26) over x from —e to € we obtain 


h2 


2m 


(W(6) — W(-<)) — 90) = Ef ae we) (4.2.31) 


which because of the continuity of ¢b(x) in (4.2.28) and the discontinuity of 
y' (a) as given in (4.2.30) for « — 0 gives 
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ie ( 2kvik) AVE =0, (4.2.32) 


k= Am/h?, or 
pa (4.2.33) 


consistent with (4.2.20) and coincides with the lower bound obtained in 
(4.2.25), with E in (4.2.33) as the only bound-state energy with correspond- 
ing eigenstate given in (4.2.27). 

In the following theorems of this section, we assume, for every Hamil- 
tonian into consideration, that, on the negative real axis, it may have at 
most eigenvalues. In Theorem 4.1.1, for example, we have seen that a Hamil- 
tonian with a potential satisfying the sufficiency conditions (4.1.2), (4.1.3) 
will have such a property. 

According to the introduction to this chapter we may then conclude that, 
in each case, if we can find any vector |) such that 


(b| |v) <0 (4.2.34) 


then the corresponding Hamiltonian 
H = — +V(x) (4.2.35) 
has at least one bound-state. 


4.2.4 Sufficiency Conditions for the Existence of a Bound-State 
forv=1 


Theorem 4.2.1 
If 


—0o < ie da V(a 0 fs da |a||V(ax)| < 00 (4.2.36) 


CO 


then the Hamiltonian in (4.2.35) admits at least one bound-state. 


To establish the validity of theorem, we use the properly normalized trial 
function 


w(a) = C exp(—a|z|/2), a>0 (4.2.37) 
where 
C= 5 (4.2.38) 


It is easily verified that 


(|Z le) = — (4.2.39) 
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and hence 


(b| H |) = +o fe dz e“"!V (x) (4.2.40) 


which may be rewritten as 


(| H |b) = of da V(x +o fe da (c — ale -1) V(x). (4.2.41) 
We use the elementary bound 
le-atel ee 1 < ale] (4.2.42) 


for all |x|, to obtain from (4.2.41) 


(| H |b) < 5 o(F +f da |x| |V (a + fr da V(x ) (4.2.43) 


and the latter is strictly negative for 


(0<)a < |- i. de v(a)| /|E +f da || ve) (4.2.44) 


remembering the conditions stated in the theorem, proving the existence of 
a vector |) satisfying (4.2.34). 
The elementary bound (4.2.42) follows by noting that for y > 0 


y y 
ree = fay’ e 4 (4.2.45) 
0 
and 
y ; y 
[1 —e7¥| = fw’ e ¥ <| dy! = y. (4.2.46) 
0 0 


Potentials satisfying the conditions of Theorem 4.1.1 and (4.2.36) are 
numerous. Such an example is given by the potential 


V(x) = Up (L — |x|) + Ave"! (4.2.47) 


where a > 0, L > 0 and 
2d 
ILU p= SO (4.2.48) 
a 


to ensure that f°. dx V(x) <0. 
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4.2.5 Sufficiency Conditions for the Existence of a Bound-State 
for v = 2 


Theorem 4.2.2 


If 
h2 
~co < [ex Vj z=—— (4.2.49) 
Au 
and 
0< [ex |x| |V(x)| < oo (4.2.50) 


then the Hamiltonian (4.2.35) admits at least one bound-state. 


To establish this result, we choose the trial function 


a 
w(x) = =e? a > 0 (4.2.51) 
V2 
to obtain ‘ Ro? 
a 
CE ) aa, (4.2.52) 
ha? a? poate 
(Hy) = + & fbx ere! ven 
h2 2 2 2 
= aE + = [ex V(x) + = [ex (ea — 1) V(x). 
(4.2.53) 
Hence 


a? [hen . 
(| H |b) < — | — + |] d°x V(x) +a [d°x |x| |V(x)| (4.2.54) 
Qn | 4p 
and the right-hand side is strictly negative if one chooses 


[At + fax V(x] 
Ja?x |x| |V(x)| 


(0<)a< (4.2.55) 
thus establishing the validity of the theorem, remembering the conditions 
(4.2.49), (4.2.50) stated (and hence, in particular, the strict positivity of the 
right-hand side of the inequality in (4.2.55)). 
In the light of this theorem, we may quite generally define a (space) scale 
parameter 
_ 3 faéx |x! VG) 
2 | fd?x V(x)| 


and an energy scale parameter 


(4.2.56) 
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= 3 
On R3 


Uo 


[ex |x| |V(x)]. (4.2.57) 


The sufficiency condition (4.2.49), in particular, for the existence of a bound- 
state may be then rewritten as 


Au 
Fz Uo st (4.2.58) 


with [d?x V(x) <0. 
The parameters R and Up conveniently coincide with the parameters of a 
two dimensional symmetrical well 


—Up, Ix| < R 
V(x) = (4.2.59) 
0, |x| >R 


where Up > 0. 


4.2.6 Sufficiency Conditions for the Existence of a Bound-State 
for v = 3 


Theorem 4.2.3 
df 
—o0 < [ex V(x) <0 (4.2.60) 


0< aan fats |x| |V(x)| < ( fox v9) (4.2.61) 


then the Hamiltonian H in (4.2.85) admits at least one bound-state. 


and 


To establish this result we choose the normalized trial function 


3 
w(x) = = el a 50 (4.2.62) 
TT 
giving 
2 h2a2 
(EE) = 
fn fn 
and 
h2 2 3 3 
(| H\b) = + = fa®x V(x) + [ov (e-o = 1) V(x) (4.2.63) 
Sy 87 87 


and hence 


(b| H |v) < oe [o? A-a B+1] (4.2.64) 
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where 
A= — /a8x |x| |V(x)| (4.2.65) 
hen 


ot 3 


The expression in the square brackets on the right-hand side of (4.2.64) 
may be rewritten as 


(4.2.66) 


A(a—az)(a—-a_) (4.2.67) 


1/2 
1+ (1- a) 
B2 


and the condition (4.2.61) ensures that 4A/B? < 1. 

Accordingly, we may choose a = B/2A, thus making (4.2.67), or the 
right-hand side of (4.2.64), strictly negative. This completes the proof of the 
theorem. 

As in the two-dimensional case, we may, in the light of the above theorem, 
define a space scale parameter by 


where 
= 8 
s° 2A 


(4.2.68) 


_ 4 Ja°x |x| |V(x)| 


oe, 3 | fax V(x)| 


(4.2.69) 


and an energy scale parameter 


1 3 
Up = =a | x |x| |V(x)]. (4.2.70) 


The sufficiency condition in (4.2.61), in particular, for the existence of a 
bound-state then reads 
4 Lt Up R? 
9 fh 


ie (4.2.71) 


with (4.2.60) holding true. 
The parameters R and Up above conveniently coincide, with those of a 
spherical potential well: 


—Up, Ix| < R 
V(x) = (4.2.72) 
0, |x| > R 
with Up > 0. 
As another illustration, consider the Yukawa potential 
e AIx! 
V(x) =—-A rie B>0, A> 0. (4.2.73) 


This gives 
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[ox |x| |V (x)| = = (4.2.74) 
and res 
Jo V(x) = a (4.2.75) 


The sufficiency condition (4.2.61) for the existence of a bound-state then 
reads 
pA 


1 
S 26RR 


(4.2.76) 


or equivalently reads 


ur 
OB soo (4.2.77) 


4.2.7 No-Binding Theorems 


As a by-product of establishing the boundedness of Hamiltonians for a 
special class of potentials studied in class [D] of §3.3, we here first recall the 
following No-Binding Theorem established there. 


Theorem 4.2.4 
Let —v(x) be the negative part of the potential V(x) as defined in (8.3.18), 
4.€., 


u(x) = —V(x) O(-V) (4.2.78) 


and define the Fourier-transform 


= [ox u(x) ep */P, (4.2.79) 


1/2 
= Fini? (3; pr Per) eae 


as a sufficiency condition, is such that 
b<l (4.2.81) 


then the Hamiltonian (4.2.85) admits no bound-states as it is strictly bounded 
below by zero (see (3.3.46), (3.3.47). 


Then if 


By using the integral expression (3.3.25) for 1/|p|, the constant b may be 
also rewritten as 


1/2 
= lt 3y qa 1 
= es ( x d°x’ OC ge we ve) j (4.2.82) 


For the Yukawa potential V(x) = —Ae~9!*!/|x| the condition (4.2.81) implies 
the sufficiency condition V2uA/h? < 3 for the absence of bound-states (see 
bound-states (see (3.3.48)—(3.3.53)). 

As a second theorem of no-binding for v = 3,2,1, we make direct use of 
the inequality (3.1.8) to obtain: 
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Theorem 4.2.5 
Given any real function g(x) such that the potential defined by 


V(x) = as x2g?(x) — sv - (x g(x)) (4.2.83) 


satisfies (4.1.2), (4.1.3), then directly from Theorem 4.1.1, (3.1.8), we have 


0< (= + i) (4.2.84) 


and the Hamiltonian in question has only a continuous spectrum (see Theo- 
rem 4.1.1) consisting of the positive real axis as —Cy is zero. 


The number of potentials that may be constructed from (4.2.83) by ap- 
propriately choosing real functions g(x) are endless (see, e.g., §3.3 class [B]. 
See also §6.1, §7.1). 

Finally, we provide a theorem of no-binding for the radial part of a Hamil- 
tonian, for a given spherically symmetric potential V(r), r = |x|, specified by 
an arbitrary orbital quantum number (§5.1) @=0,1,.... 

The radial part of the Hamiltonian, of a spherically symmetric potential, 
of a given @ is given by (see, e.g., §7.2) 


fh? Oe 0 h2 
ea 2p r2 & >) a Du v2 Copy Vr) (4.2.85) 


As in (4.2.78) define the negative part of the potential V(r) 
V(r) O(-V(r)) = -v(r). (4.2.86) 
We then have the following theorem. 


Theorem 4.2.6 
For 


— | rdrv(r)<1 (4.2.87) 
0 


the radial Hamiltonian (4.2.85) admits no bound-states for all €=0,1,.... 


The proof of this theorem is given at the end of §4.5. 
For the Yukawa potential —v(r) = —Ae~8"/r, r = |x|, A > 0, (4.2.87) 
gives the sufficiency condition 


2Ur 


aa ee (4.2.88) 


for no binding for any = 0, l,.... 
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A further interesting application of Theorem 4.2.6, is the following one 
which includes the spherical well potential —v(r) = —Up@(R— 1), Up > 0, 
as a special case. For any given (space) scale parameter R > 0, define the 
energy scale parameter 


9 Co 
Up = = | r dr v(r) (4.2.89) 


then according to (4.2.87) the radial Hamiltonian (4.2.85) admits no bound- 
states (for all €=0, 1,...) for 


R?U 


m Az <l (4.2.90) 


as a sufficiency condition. 


4.3 Hamiltonians with Potentials Approaching Finite 
Constants at Infinity 


In this section we are interested in a special class of potentials such that 
for any0< R<w,yv=1,2,3, 


i d’x |V(x)|? < 00 (4.3.1) 
|x|<R 


and there exists a real finite constant C’ such that 


|V (x) — C]| ———> 0 (4.3.2) 


|x| — 00 


in reference to a Hamiltonian as given in (4.1.1). 
A specific example of a potential belonging to this class is the potential 


V(x) = tanh? (6|x|) + ae7?l*! (4.3.3) 


with y, @, a, p real constants and p > 0. In this case C = y. 
We note that quite generally for any V(x) 


|V(x) — C/? < 2(|V@x)? + C?) (4.3.4) 
and hence V(x) — C is, from (4.3.1), locally square-integrable, i.e., for any 


0<R<ow, 
/ d’x |V(x) — C/? < oo. (4.3.5) 
| 


x|<R 


We may rewrite the Hamiltonian in question as 


H=H'+C (4.3.6) 


200 4 Spectra of Hamiltonians 


where 5 
p 


~ On 

According to the analysis given in subsection [E] of §3.3, H’ is bounded 
form below, with a lower bound of the spectrum given by, say, —@y4. Hence 
from (4.3.6), we have for |¢) normalized, 


H' 4(VOSC): (4.3.7) 


(| H|b) = C+ (IHW) > (Ctr) 
i.e., a lower bound of the spectrum of H is given by 
— fy =C — lx. (4.3.8) 


Finally, with Ho replaced by 
Hij==—+C (4.3.9) 


and V(x) replaced by V(x) — C, we may refer to the analysis already given 
in §4.1. Instead of (4.1.5), we choose & to be any fixed real number such that 


fo < min[C, —fy]. (4.3.10) 


From Proposition 4.1.2, in particular, we may infer that H has a continuous 
spectrum consisting of the interval (C’, oo) and has no eigenvalues of infinite 
degeneracy, and in particular —€y < C. Also if —€y < C, then H has 
also a discrete spectrum (of at most finite degeneracy) falling in the interval 
[—f7,C). 

We may summarize the above by stating the following. 


Theorem 4.3.1 

A Hamiltonian with the potential V(x) satisfying (4.8.1), (4.8.2), with |C| < 
co, is bounded from below. It has a continuous spectrum consisting of the in- 
terval [C, co) and has no eigenvalues of infinite degeneracy. Also if the lower 
bound —ly, of H is such that —ly < C, then H has also eigenvalues (of at 
most finite degeneracy) falling in the interval [—ly, C). 


4.4 Hamiltonians with Potentials Increasing with No 
Bound at Infinity 


In this section we are concerned with the spectrum of Hamiltonians with 
potentials V(x) belonging to the following important class of potentials: 


V(x) >0, V(x) ———> +00 (4.4.1) 


|x| 00 
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and V(x) is locally square-integrable, v = 1, 2,3. 
The classic example in this category includes the harmonic oscillator po- 
tential 


V (oe) = Spo? bx? (4.4.2) 
The underlying Hamiltonian 
H= Be + V(x) (4.4.3) 
2p 
is obviously positive, and we have the following theorem: 


Theorem 4.4.1 
HT has only a discrete spectrum. 


To establish this theorem, we first note that for any positive constant C, 
we may find a real positive constant R large enough such that for |x| > R 


V(x) >C (4.4.4) 
and for |x| < R, we may obviously simply write 
V(x) > 0. (4.4.5) 


Given any such a positive constant C’, we define the function u(x) of one- 
variable by 
-C/y, |z| < R 
u(x) = (4.4.6) 
0, |z|>R 


where v = 3,2, 1. 
Let x = (%,...,@,). Then if 


oi 1/2 
|x| = (>: “) <R (4.4.7) 


one necessarily has |#;| < R for all 7. From (4.4.6) we may then infer that 


C+ (>: ue) =0, for |x|/<R. (4.4.8) 


i=l 


On the other hand for |x| > R, it is easily seen that 


C+ Ss u(xi) <C. (4.4.9) 
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That is, in all cases, we have from (4.4.4), (4.4.5), (4.4.8), (4.4.9): 


V(x) > C+ S- u(ai) (4.4.10) 
i=1 
for all x, i.e. V(x) cannot be smaller than C + S> u(a;). 
i=1 
We define the Hamiltonian 
Poy 
H!’ == +) u(2i) (4.4.11) 
a as 
then 
(v|H| b) > C + (h|H'|v) (4.4.12) 


for normalized |y)). Also from (4.4.3), (4.4.10), (4.4.11), (1.8.87), (1.8.110) 
and (1.8.111) we have? 


An(H) > An(H") + C. (4.4.13) 


The eigenvalues of the operator H’ may be determined from the one- 
dimensional problem considered in §4.2, with the corresponding Hamiltonian 
defined in (4.2.1), (4.2.2). In the notation of the latter we set Up = C/v, 
L = R. Let (k) stand for v-tuplet of numbers (ki,...,k,) corresponding to 
the eigenvalues in (4.2.14): 


C hn 
Ex, =—-— ae 4.4.14 
ky v + Im R2 gk, ( ) 
In reference to the operator H’ one has 
feeree se (4.4.15) 
(k) = ImR2 ae ky ee 


Let n denote the total number of E(x) values in (4.4.15) which are 
arranged in a non-decreasing order Ey < ... < En, with Ex, = Ax(H"). 


In particular 
2 


2mnR? 
where no corresponds to n for the one-dimensional case as defined in (4.2.14). 
Equation (4.2.15) reads (see also (4.2.11), and note that Up = C/v here) 


2nC R 
(280 #2) ce, a 


? The proof for the spectrum of H follows the treatment by Reed and Simon 
(1978). 


An(H’) = -C+ ve (4.4.16) 
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where obviously C and, correspondingly, R, may be chosen large enough 
so that the left-hand side of this inequality is strictly positive, hence from 


(4.4.16) 
2 
hv 2mC 1 T 
/ 
> \/ : A, 
An(H')+C > oa ( yo a) (4.4.18) 


Also (4.2.13) reads 
1 /2mC R 
=N-+e. 4.4.1 
T vy oh 7 ( 2) 


From Figure 4.1, we know that we always have n > N. Accordingly, the 
limit C — oo, R — oo corresponds from (4.4.19) to N — oo and hence also 
to n — oo. From (4.4.18) and (4.4.13) we then have 


lim An(H) = +00. (4.4.20) 


Using Proposition 1.8.2 we may then conclude that the positive operator 
H has only a discrete spectrum. 


4.5 Counting the Number of Eigenvalues 


4.5.1 General Treatment of the Problem 


We introduce the spectral decomposition of a Hamiltonian (§1.8, (1.8.15)) 
H= f daPa(a). (4.5.1) 


Suppose that for some given real and a specified number €, H may have 
at most eigenvalues < €.3 Then we may write 


a=Df dW 6(X — HB) |d,0(d)) (4,00) 
Ax 


= iz dw’ 6(X — A) [A,0(A)) 70) 
at * 


= YF Pv) Av) OE = A) (4.5.2) 


A,v(A) 


3 In most of this section but not all, we consider negative € values, and, and when 
convenient, we replace € by —€, with € > 0 in the latter. 
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where v(A) specifies the degree of degeneracy of the eigenvalue , |A, v(A)) 
are corresponding eigenvectors, and O(€ — X) is defined here as O(a) = 1 for 
x >1, O(a) =0 fora <0. 

From (4.5.2), we may introduce, a density of states 


(x| Pir(€) bx) = S> davon)’ O(E - 9) (4.5.3) 


A,v(A) 


where 
Wy,v(ay(X) = (x A, ¥(\) ) 
are the eigenstates in the x-description. 
Upon integration of (4.5.3) over x, we obtain 


| “x (x| Pu(€) |x) = S > O(€-A) 
AVA) 
= N(H,6) (4.5.4) 
where 


N(H,€) = Number of states with eigenvalues < € 
= Number of eigenvalues (counting the degree of degeneracy) 
<é. (4.5.5) 
One may also introduce a non-local density of states defined by 
(x| Py (€ =" Wo, va) ( x) Vy. v( (ny) (x x’) O(E — d) (4.5.6) 
dv) 


which will be useful later on. 

Equation (4.5.4) also allows one to obtain an expression for the degree 
of degeneracy of an eigenvalue, say, Ao. Let A1, A2 be eigenvalues of H such 
that Ay < Ao < Ag. Define 


E= min(A2 = Xo, o oan A1) (4.5.7) 
and 
Xo =Aote, Ao =2XA9-€ (4.5.8) 
then from (4.5.4) 
| ”x [(x| Pa (Ao) Ix) — («| Pa (Ao) x)] = $2 [OQ —) - (0 — 9) 


A,v(A) 


I 


yA (4.5.9) 


v(A) 
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for the degree of degeneracy of the eigenvalue of Ao. Later in Chapter 9, we 
will see how the left-hand sides of (4.5.3), (4.5.4), (4.5.9), may be evaluated 
in terms of so-called Green functions. 

Another useful expression that may be obtained from (4.5.4) is the fol- 
lowing. Upon integration (4.5.4) over € as follows from —co to 0, we have 


[ae forx (x| Px (€) |x) = ie df O(€ — 2) 


A,V(A) 


= a dé N(H,£) (4.5.10) 


providing a useful expression for the negative of the sum of the negative 
eigenvalues (if any) of H. 
As another application, we consider for € > 0 


(x| (H —V)Pu(6) |x) = $2 (xl (HV) )A,uQ)) (7) |x) OE - A) 


A,v(A) 
(4.5.11) 
which from the equality 
(|v = farx’ Gal Vx) 
= for’ xv (x’) (x’| 
= V(x) (x| (4.5.12) 
for a local potential: 
(x| V |x’) = V(x’) 6’(x — x’) (4.5.13) 


leads to 


(x| (H —V)Pu(€) bx) = S5 (xl A — V(x) A, 1) (AQ) Ix) OE — A) 


A,v(A) 


= ey Prv( — V(x)] dy vay(x) O(€ — A). 
A,v(A) 
(4.5.14) 


From the first equality in (4.5.14), if H — V(x) = —h?V?/2y defines the 
kinetic energy, then an integration of (4.5.14) over x gives (€ > 0) 


[ox (x| (HV) Pa(@) bx) = > AHI ELA. v)) OE). (4.5.15) 


A,v(A) 
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As we will see later, this expression is important for multi-electron systems 
as used in Chapter 13. For example, if only one particle occupies a given state 
and the states, up to the maximum possible energy < €, are filled then (4.5.15) 
gives the sum of the average kinetic energy of all the particles with the most 
energetic having the maximum possible energy allowed < €. 

The purpose of the remainder of this section is to find an upper bound for 
the number N(H,), in (4.5.4), of the eigenvalues for € < 0, and the negative 
of the sum of the eigenvalues in (4.5.10), by considering in the process the 
€ — 0_ limit, for a given Hamiltonian H. Lower-bound expressions will be 
also obtained for the ground-state energy. 


4.5.2 Counting the Number of Eigenvalues 


We rewrite the Hamiltonian H (g) = Hj +V, where Hp is the free Hamil- 
tonian p?/2y, in the form 


H(g)=Ho+9V (4.5.16) 


by introducing a variable coupling parameter g > 0, with g = 1 corresponding 
to the Hamiltonian in question. 
As in (3.3.18), (3.3.19), we introduce the negative part of the potential 


V O(-V) =-v=-|V|_. (4.5.17) 


Accordingly, 
H(g) > Ho — gv. (4.5.18) 


Let N(H(g), —€) denote (see (4.5.4)) the number of eigenvalues of H(g) < 
—€, with € > 0. The inequality (4.5.18), together with those in (1.8.110), 
(1.8.111) imply that 


An (H (9) > An (Ho — 9 v)- (4.5.19) 


That is, the number of bound-states of Ho — g v cannot be less than that of 


H(g): 
N (Ho — g v,—€) = N (Ho + gV, —€). (4.5.20) 


Similarly for 0 < g’ < g, 
Hjy-gvu>Ho-gv (4.5.21) 


and hence 
N (Hp pu oye N (Boag a6); (4.5.22) 


Finally for ; > 2 > 0, 


N (Ho — g v, —€2) = N (Ho — 9 v, -&1) (4.5.23) 
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obviously holds true recalling the definition in (4.5.5). 

Let A(go) denote an eigenvalue of Hp — gov for some go. A(g), being neg- 
ative, is a decreasing function of g (see (4.5.19) and Proposition 1.8.2) as g 
increases beyond the value go (see Figure 4.2), the curve traced by X(g) 


necessarily cuts every horizontal axis, such as g = gj, g = g,---, with 
go < 9 < 9 <..-.; in the manner shown in Figure 4.2, otherwise the 


Hamiltonian Hy — gov will have more eigenvalues than, say, Ho — gjv in 
contradiction with (4.5.22). 

Also every eigenvalue A(g) of Ho — g v, for which X(g) < —€, for a given 
€, falls on a curve which cuts the vertical axis 1 = —€ (see Figure 4.2) at 
some g-value, say, equal to g’ corresponds to the eigenvalue \ = —€ of the 
Hamiltonian Hy—g'v. Finally, note that the order of eigenvalues (see (1.8.90)) 
is preserved as g increases. 

All told we have the important relation: 


N (Ao — g v, —€) = [Number of g’’s in 0 < g/ < g for which Hy — g’v 
has the eigenvalue A = —é]. (4.5.24) 
According to (4.5.24), we are led to consider the eigenvalue problem 
p? 
(E- av) |v) =-ele), Ill=1 (4.5.25) 
which may be rewritten in the form 


Ald) =] |) (4.5.26) 


where A is the positive operator 


A=yvv —_ vu (4.5.27) 
(+) 
and 
|?) = Vv |). (4.5.28) 


In Figure 4.2, the eigenvalues, say, {A(gj)} of Ho — gv are defined by the 
intersections of curves {A\(g)}, as g varies, with the horizontal axis g = gj. 
These same curves {A(g)} cut the vertical axis 1 = —€ at corresponding 
points {g’}, where the 1/g’ are the eigenvalues of the positive operator A, in 
(4.5.27), with the eigenvalue equation with the eigenvalue equation given in 
(4.5.26). 

We introduce the spectral representation (1.8.15) of A: 


A= fe dPa(7). (4.5.29) 
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A 


Fig. 4.2. X(g), an eigenvalue of Ho — gu, being negative, is a decreasing 
function of g and necessarily cuts all the horizontal axes g = 9,9 = 95---; 
as g increases beyond a given value go. Also every eigenvalue A(g) < —€, of 
Ho — gv, falling on a curve which cuts the vertical axis \ = —E, say, at g = 9’, 
corresponds to the eigenvalue \ = —€ of the Hamiltonian Ho — g’v. 


We will have occasion to use powers of the positive operator A 


Ara f ay Olay = Ady 
0 


= joe dP (7) (4.5.30) 


(see (1.8.14), (1.8.15)), where p is not necessarily an integer. Explicit represen- 
tations of such powers of positive operators are formally given. For example 


9) co 
A-V?2 — —_ } dg exp (—2? A) (4.5.31) 
A 
and for p < —1/2, 
1 f° da 
ae i, Poree exp (—2? A) (4.5.32) 
p 


where 
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° da 
c= f per &XP (—2?). (4.5.33) 


Positive powers of A may be obtained from (4.5.32), e.g., by multiplying the 
latter by appropriate powers of A: 


2 CoO 
AM? = = | da exp (—2*A) A (4.5.34) 
Vr Jo ( ) 
2 Co 
AS? =F [der exp (-2?A) A? (4.5.35) 
vi Jo ( ) 
and more generally 
od 
ao Saat exp(—27A)A™, o <-1/2 (4.5.36) 
with p=a+m,m=0,1,..., in a convenient notation. 


Other representations may be also given. For example, 


Ar =C! fw got a , 0<o<l (4.5.37) 
* Jo (A+ 2) 
with p=o+m,m=0,1,..., and 
,  sin(7o) 
CL= (4.5.38) 
1 


Later on, in (4.5.93), we will encounter the case p = 3/2. 
From (4.5.30), for p > 0, in particular, 


(x| AP |x) > / e d(x] Pa(a) bx) 


> Z| Patra) x) — (x 


(9) om 


recalling that P4(y7) is self-adjoint, thus (4.5.39) involves real numbers only, 
where yo > 1/g, and is otherwise arbitrary. Upon integration over x we obtain 
(see (4.5.4), (4.5.24)) 


1 
| d’x (x|A?|x) > a x {number of all g’ ’s, counting degeneracy, as eigenvalues 
g 


of A in (4.5.26), such that 1/70 < g' < gl]. (4.5.40) 


Since the point yo > 1/g is arbitrary, we may take the limit yo — oo in 
(4.5.39) and obtain from (4.5.24), (4.5.40) 


N(Ho — gv, —€) < g? [ex (x |A?| x) (4.5.41) 


210 4 Spectra of Hamiltonians 


and from (4.5.20) 
N(Ho + V,-&) < Jor (x |A°| x) (4.5.42) 


with g=1. 
In reference to the definition (4.5.17), the following inequality will be also 
needed: 
V O(-V) + €/2 > (V + €/2) O(-V — €/2) (4.5.43) 


where € > 0, whose validity is easily established by considering, in turn, the 
three cases: V > 0, —-€/2<V <0,V < —&/2. 
From (4.5.43), we obtain 


Hy) +V O(-V) 2 Ao4 (v4 5) e( V =) : (4.5.44) 


and hence 


No 4 V O(-¥).-8) <N (i | (v+s)e( V 5): s). 


(4.5.45) 
In (4.5.2), we are interested in Hamiltonians which are bounded from 
below (§§3.2-3.4, §4.1). Accordingly, an important situation arises for which 
the spectrum of a Hamiltonian is empty for energies < —£ for some specific 
value & of €, i.e., for which N(Hp + V,—0) = 0. Clearly, such a value —£ 
would then provide a lower bound to the spectrum of the Hamiltonian Ho+V. 
In what follows, we investigate the nature of N(Ho + V,—€) in space 
dimensions v = 3,2,1. The radial part of the Hamiltonian of a spherically 
symmetric potential is also considered. Subsequently, we examine the expres- 
sion in (4.5.10) for the negative of the sum of the negative eigenvalues of a 
Hamiltonian. In all of these investigations, one encounters existence of in- 
tegrals of the form fd’x v°(x) < oo of the potentials in (4.5.17), for some 
positive powers 3, which are implicitly assumed to be satisfied. 
We will show below that the number of eigenvalues of H = Hj) + V, 
counting degeneracy, with energies < —é, for € > 0, satisfies, in a compact 
form, the inequality 


N(H, -€) < a, (EQ) -4"42)/? Je (v (x)) Or")? (4.5.46) 
for vy = 3,2,1, where 


_— ( MiP) (ea Bu eay ys 
a, = (355) () (4.5.47) 
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The three-dimensional case (v = 3): 


In three dimensions (v = 3), we choose p = 2 on the right-hand side of 
(4.5.42). Thus with the definition of A in (4.5.27), we obtain for the right- 
hand side of (4.5.42): 


Je’ d°x’ u(x) u(x’) (x ied “) (4.5.48) 


where (see the Appendix to this section), with € > 0, 


1 ‘ne 1 |x — x’| 
(x [E+q x ) = Ont 30 exp ( i v8) (4.5.49) 


2 
e+e 


giving from (4.5.42) the so-called Schwinger inequality:+ 
en 2Ix—x' | Vue /h 


u(x’). (4.5.50) 


N(Ho +V,-€) < Go) [ex dx! u(x) 


|x — x’|? 
Since the integrand in (4.5.50) is positive, and the exponential factor is 
bounded above by one, we may further bound the right-hand side of (4.5.50) 


from above by 


2 1 
(45) [ox d?x’ u(x) eK? v(x’). (4.5.51) 
This expression is nothing but the constant b? in (4.2.82), (4.2.80) of Theo- 
rem 4.2.4. In particular, for b < 1, the latter theorem states that the Hamil- 
tonian admits no bound-states (see also (3.3.46), (3.3.47)). This is consistent 
with (4.5.50), showing that N( Hy) + V, —€) = 0 for any € > 0, however small, 
in case b < 1. 
Now we use (4.5.50) to derive a lower bound for the spectrum of the 
Hamiltonian in question as well. To this end, we use Young’s inequality (see 
Appendix II), with p = q = 2, to obtain 


ge te.) e721x—x"| V2 /h ; 
Jc x d’x’ u(x) eae 


Ix —x 


< ( fox (v0)") ( [es — (4.5.52) 


which from (4.5.50) gives 


4 Schwinger (1961b). 
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/ 
N(Ho + V,—-€) < ) : oR dbx v?(x) (4.5.53) 


which coincides with (4.5.46)/(4.5.47) for v = 3. 
Clearly, if for any 6 > 0, we choose 


c= ee (ses) J (fet we y") (4.5.54) 


then the right-hand side of (4.5.53) is necessarily less than one, implying that 
N(Ho + V,—&) = 0 for such a —€, and the spectrum of the Hamiltonian is 
empty for energies < —€. That is, (4.5.54) gives the following lower bound 
for the ground-state energy of the Hamiltonian, 


a ee (ses) J (fet we ye) (4.5.55) 


for any 6 > 0. 
The two-dimensional case (v = 2): 


For v = 2, p = 2, the right-hand side of (4.5.42) is given by 


[ox (x |A?|x) = forx d?x' u(x) (x ea “) u(x’) (4.5.56) 


where (see the appendix to this section) 


(eta) <sere el vm) asin 


and Ko(x) is a modified Bessel function with asymptotics 


Ko(x) —> —Inaz (4.5.58) 


Ko(e) —— ,/—— e |, (4.5.59) 


From (4.5.42), we obtain 
= PNP fe ti Ix — x'| mes 
N(Ho + V,—-€) < () d°x d°x’ u(x) | Ko i V 2uE u(x’). 


Using the integral, 
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2 2 
; |x| th 
fa x CG ( ve) ) =e (4.5.61) 
and Young’s inequality, as done in (4.5.52), we obtain from (4.5.60), (4.5.61), 
Me 1 Bie cd 
< .O. 
N(Hp + V,—€) (5) az fx vs) (4.5.62) 


which coincides with (4.5.46)/(4.5.47) for v = 2. 
Accordingly, if for any 6 > 0, we choose 


eve) (5) [ox v(x) (4.5.63) 


T 


then the spectrum of the Hamiltonian is empty for energies < —€, as the 
right-hand side of (4.5.62) will be less than one. This gives the following 
lower bound for the Hamiltonian 


aM + §) (5) [ex v2(x) (4.5.64) 


for any 6 > 0. 
The one-dimensional case (v = 1): 


For the one dimensional case we choose p = 1 in (4.5.42), to obtain 


[i (a |A| a) = [i v(2) (= Esq “) (4.5.65) 


where 
1 ant dp 1 
( [ze +¢| ) sie anh [ee +6] 
2 Jz 5 (4.5.66) 
and hence 


N(Hy + V,-8 < Jz ; / de v0) (4.5.67) 


which coincides with (4.5.46)/(4.5.47) for v = 1. 
Accordingly, by choosing 


ga (148) as ( [ae ve) (4.5.68) 
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for any 6 > 0, we have the following lower bound for the ground-state energy 


— (148) 3h (| ae vte)) (4.5.69) 


—oco 


The radial part for a spherically symmetric potential: 


The radial part of the Hamiltonian of a spherically symmetric potential 
energy V(r), where r = |x|, v = 3, is given by (see, e.g., 87.2) 


Re Ae fia: oO h? ee+1) 
Hy = tV 4.5.70 
: Qy r? & >) Qu or? (") ( ) 
where ¢ = 0,1,... define so-called orbital angular momentum quantum num- 


bers (§5.1). 
The Dirac delta in spherical coordinates is given by 


d(r—r’) 6(0— 6’) 


1\ _ 53 5 ee. / 
(x| x’) = 0° (x — x’) 72 and d(@-—¢’) (4.5.71) 
so that ; 
| r? ar | sind do | dd 6°(x—x’) =1. (4.5.72) 
0 0 0 
Accordingly, for the radial part only, we may write 
d(r —7’) 
Upon defining the “free part” of (4.5.70) by 
Be of O30 h? €(€+1) 
Hoe = Qy 1? & =) Qu or? e004) 
and choosing p = 1, € = 0 in (4.5.42) we obtain 
Nz, = N(Hoe + V,0) < | r? dr (r|Alr) (4.5.75) 
0 
where ; 
A= Vv(r) — Vo(r) (4.5.76) 
Hoe 
u(r) = —V(r) O(-V(r)). (4.5.77) 


We explicitly have 


(rlAlr) = 0(r) (re 


: 
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= oe fa [5 (r" (r +0)) 4 5(r’ (r —0)) x re r’) 


(4.5.78) 


with asymmetric average taken over r in performing the trace operation, and 
where we have used (4.5.73). 
It is easily verified that 


o(r—r’) 
= ge (4.5.79) 
Since 
— O(r—r) = d(r—1') (4.5.80) 
this gives from (4.5.78) 
Qu u(r) 
Alr) = = —— 4.5.81 
rai h? r(20+1) aoe) 
which leads finally to the so-called Bargmann inequality° 
Qh 1 
Nex =z d 4.5.82 
(<2 open f rare) (4.5.82) 


as an upper bound for the number of bound-states corresponding to an orbital 
quantum number . 

An application of (4.5.82) was given at the end of §4.2, here it is spelled out 
in more details and includes the spherical well potential —v(r) = —Up9O(R—-r) 
as a special case. For u(r) any function of r such that® 


0< | dr u(r) << 00, 0< i rdr u(r) < o. (4.5.83) 
0 0 
These allow us to introduce a (space) scale parameter 
rd 
ee ean) (4.5.84) 
5 dr v(r) 
and an energy scale parameter 
U = | aaes (4.5.85) 
=—/ rdrv(r). ib: 
0= Re J, 


° Bargmann (1952). 
® Note that the finiteness (< 00) of any one of the integrals in (4.5.83) does not 
necessarily imply the finiteness (< oo) of the other. 
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Upon substitution of (4.5.85) in (4.5.82) we obtain 
RU 


<=. 10. 
NeS ers 1 (4.5.86) 
Hence for ‘ 
LRU 
= See (4.5.87) 


Hy admits no bound-states for any given @. 


4.5.3 The Sum of the Negative Eigenvalues 


Now we use the expression on the right-hand side of (4.5.10) to derive 
upper bounds for the negative of the sum of the negative eigenvalues (if any), 
counting degeneracy, of a Hamiltonian H. These will find useful applications, 
for example, to multi-particle systems in §4.6 and in Chapter 14. 

The expression for the negative of the sum of the negative energies of H 
is obtained by integrating N(H,&), in (4.5.4), (4.5.5), over € from —oo to 0, 
as shown in (4.5.10). With the substitution € — —&, it will be shown below 
that this sum, in vy = 3,2,1 dimensions, satisfies, in a compact form, the 
inequality,” 


[oe wct-9 eC, forx coc yy eareia2 (4.5.88) 


where —v(x), as before, is the negative part of the potential V in H = Hyp +V 
defined in (4.5.17), and 


v/2 (3-—v)(v—-1) (v—1)(v—2)/2 
Cai! 2 s (4.5.89) 
h? v(vy+2) \4 T 


vy =3,2,1. 
The inequality in (4.5.88) is established below. For the clarity of the pre- 
sentations, each of the respective dimensions are treated separately. 


The three-dimensional case: 


The expression in (4.5.53), is not suitable for the integration to be carried 
out over € when applying (4.5.10). We may, however, use the upper bound 
expression of the inequality in (4.5.45) as long as we replace —u(x) by —v(x)+ 
€/2 and € by €/2 in (4.5.53). Due to the step function restriction on the right- 
hand side of the inequality (4.5.45) we now have the constraint 0 < €/2 < 
u(x). Accordingly, 


[aenen+v-9< (By fare (v8) ve 


” See also Lieb and Thirring (1976). 
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[o-e) 3/2 
[ent PO) = (3 +) [axe as (4.5.91) 
0 


which coincides with the inequality in (4.5.88) for v = 3, and is referred 
to as a Lieb-Thirring bound,® providing an upper bound for the negative 
of the sum of the negative eigenvalues (if any), counting degeneracy, of the 
Hamiltonian in question. 

Needless to say, since the ground-state energy cannot be less than the 
sum of the negative eigenvalues, (4.5.91) gives the following lower bound for 
the ground-state energy 


eg (3) [ex (u(x))9/? (4.5.92) 


which is to be compared with (4.5.55). Which one provides a more optimal 
bound depends on the potential v. 


The two-dimensional case (v = 2): 


This lower dimensional cases is a bit more difficult to handle. For p = 1, 
the trace on the right-hand side of (4.5.42) is infinite (see Problem 4.4) and 
hence not useful. For p = 2, the right-hand side of (4.5.62) is not integrable 
over €. On the other hand, for p = 3/2 all the relevant integrals are conver- 
gent. 

To the above end, we use the inequality (see Problem 4.5) 


Ps [4° | x 2x ( x|v? : —oa|*} (4.5.93) 
Jc (x| ) < farx (x |” ari 4.5.93 


E 


where A is defined in (4.5.27). 
Upon using the integral 


ne Seat (4.5.94) 


we obtain from (4.5.93) the bound 


8 Lieb and Thirring (1975). 
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N(Hp + V,-€) < 7 d2x v3/?(x). (4.5.95) 


Finally using the upper bound in (4.5.45), as done in writing (4.5.90), 
together with (4.5.20), the above inequality leads to 


[scsi BE fore f° (00 -§)” 


— 2u T(1/2) [(5/2) 2 5 
= TG) Jc x (u(x)) (4.5.96) 


or the inequality which coincides with the one in (4.5.88) for v = 2: 


2 
[ee Neto + v -< Te [atx (w (4.5.97) 


Needless to say, minus times the expression on the right-hand side of 
(4.5.97) provides a lower bound to the ground-state energy. The expression 
obtained in (4.5.64), however, gives a more optimal one for sufficiently small 
5 > 0. Finally, (4.5.95) also leads to another lower bound for the ground-state 
energy given by 


~ (146) we ( Jes (oxy?) (4.5.98) 


for any 6 > 0. Which one of the expressions in (4.5.64) or (4.5.98) provides a 
better bound, depends on the potential u(x). 


The one-dimensional case (v = 1): 


Upon using (4.5.45), as done in (4.5.90), (4.5.96) leads to 
2vu(2x) 
7 -O< te ef wl )-§) (4.5.99) 


[cenit V,-£) < ve da (v(x))3/? (4.5.100) 
0 


which coincides with the inequality in (4.5.88) for v = 1. 
This provides the following lower bound for the ground-state energy 


4/2 2 
Sere da (v(x))°/?. (4.5.101) 


—oco 


As before which one of (4.5.69) or (4.5.101) is better depends on the potential 
Vv. 
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We rewrite the lower bound for a ground-state energy for a potential, 
whose negative part is denoted by —v as defined in (4.5.17), for v = 3,2,1, 
and derived in (4.5.55), (4.5.64), (4.5.69), in a compact form 


1/q(v) 
HS ay ( forx (wearer) (4.5.102) 


where 
1/2,v=3 
Aipty? E19 
q(v) = —— = 1, v=2 (4.5.103) 
1/2,v=1 
1\ 71 we \2/2-a(v))+1 
bee) (=) (5) (4.5.104) 
for any 6 > 0. 


Similarly, for the lower bound of the ground-state energy obtained from 
the sum of the negative eigenvalues, as derived in (4.5.91), (4.5.97), (4.5.100), 
we have in a compact form,® 


Bes hi, Jor (u(x))it¥/? (4.5.105) 


where 


v/2 (3—v)(v—1) (v—1)(v—2)/2 
eee ss 2 f . (4.5.16) 
h? viv +2) \4 1 


Needless to say, for vy = 1,2,3, the existence of the integrals in (4.5.102), 
(4.5.105) is assumed. 


Appendix to §4.5: Evaluation of Certain Integrals 


We evaluate the expression on the left-hand side of (4.5.49). To this end 
we are led to consider the integral 


(x _ i “hm | dp elp(x-x')/h 
[B+ é| Gan? Tee 


The angular integration is readily evaluated yielding for the latter the integral 
UL 1 oo ‘ einp/h 
niin | 2 Pea 


° See also Lieb (2000), for similar and other inequalities and for improvements of 
coefficients such as av, by. 
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integrating symmetrically over p, where 7 = |x — x’|. In the complex p-plane, 
the integrand has simple poles at p+ iV/2ué€. We may close the contour in 
the upper complex p-plane since the infinite semi-circle contour in the upper 
plane will not contribute. This gives immediately by the residue theorem at 
the pole p = iv/2u the result given in (4.5.49). 

For the corresponding two dimensional case given in (4.5.57), we note that 
for the integral 


/ dp ip: (x—x')/h 
2 2 


the angular part is given by 


20 
| d6 eipncos O/h — 27 Jo(pn/h) 
0 


where Jo() is the Bessel function of order zero. On the other hand, 
is ES ene (A-4.5.1) 
, (x2 + a) QlOr) = Nola -4.0. 


is a well known modified Bessel function of order zero, thus obtaining (4.5.57). 
[See also (4.5.58), (4.5.59).| 


4.6 Lower Bounds to the Expectation Value of the 
Kinetic Energy: An Application of Counting Eigenvalues 


The results obtained in 84.5 will be used to obtain lower bounds for the 
expectation value of the kinetic energy operator as a useful application. The 
latter will be important, in particular, for studying stability problems of 
multi-particle systems as given in Chapter 14. 


4.6.1 One-Particle Systems 


We first consider a particle in three dimensions, introduce the probability 
density 
2 
p(x) = |b(x)| (4.6.1) 
and define the positive function 


f(x) = esas = (4.6.2) 


where 


r=(¢ Ea ) (4.6.3) 
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and the parameters are positive and will be defined shortly. 
We note that explicitly 


(o]E - #8) SS yair (4.6.4) 


In reference to the bound in (4.5.92), for example, one formally!? has 


7 3/2 
(ir ![8) 2 age GE) feo” 
3/2 o 
4 @ ae pear (4.6.5) 
x p@ 


where we have used (4.6.2) in writing the last equality. This suggests to choose 
5a/2=a+1, or a = 2/3. Hence with y > 1, (4.6.4), (4.6.5) give 


2/3 2/3 792 
y-1 15a h 
(3) ( ; ) Dp dx (oe) Pp? of. (4.6.6) 


Optimizing over y, gives y = 5/3, or! 


: (3)" i i dx (o(x))°8 < T. (4.6.7) 


Needless to say, f(x) in (4.6.2) is not the potential energy for any given 
physical Hamiltonian. It is just introduced in order to be able to obtain 
a lower bound for T. Also all the integrals in (4.6.2)—(4.6.7) are implicitly 
assumed to be finite. 

Similarly, by choosing a = 2/v, y = (2 +v)/v, we have, in reference to 
the bound in (4.5.105), 


h 
Avs, | *x (p(x))CM” < T (4.6.8) 


with 


Waly fp \ (4X 28-MO=D/o 
= (v—1)(v—2)/v 
ay is) (55) (5) (7) ee) 


for v = 1,2,3 (see Problem 4.6). It is easily checked that (4.6.8) coincides 
with (4.6.7) for v = 3. 


‘0 For related technical details and subtleties see also, Lieb and Thirring (1976), 
p. 273. 

11 This inequality is referred to as a Lieb-Thirring kinetic energy inequality: Lieb 
and Thirring (1975). 
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On the other hand, by choosing a = 1, using the normalization condition, 


Jo p(x) =1 (4.6.10) 


and optimizing over y, as in (4.6.6), we obtain in reference to the bound 
(4.5.102), the following inequality for the expectation value of the kinetic 
energy, 


vy pm\(v—(4-v)/v R2 aie 
= a=) pose pe <P 4.6.11 
ie) 2p (| x s(x) (0-11) 


for v = 1,2,3 (see Problem 4.7), for any € > 0. 
In particular, for v = 3, (4.6.11) gives 


30 pm\2/3 Re aA 
— (5) oe ( fer Px) ra (4.6.12) 


for any € > 0, which is to be compared with (4.6.7). 
To the above end, we may write 


p°/3 = pA/3 1/3 (4.6.13) 


and use Hélder’s inequality (see Appendix II), with p = 3/2, gq = 3 to obtain 


[2x0 < ( Jax fo) (4.6.14) 


where we have used the normalization condition in (4.6.10), with v = 3. 
Accordingly, the left-hand side of the inequality (4.6.12) is bounded below 


by 
1/3 2/3 49 
(3) (F) sf ix oC) (4.6.15) 
LU 


l+e 2 


Since for sufficiently small ¢ > 0, 3'/3/(1+.¢) > 3/5, (4.6.12) provides a 
better bound than the one in (4.6.7) for ¢ > 0 small enough. We leave it as 
an exercise to the reader for the comparisons of (4.6.8), (4.6.11) for v = 2,1, 
in a similar fashion. 


4.6.2 Multi-Particle States: Fermions 


We consider N identical fermions, each of mass m, and introduce the 
particle number density in three dimensions: 


p(x) = N S- ie d?x3...d°xy |¢)(xo1,X202,...,xvow)|” (4.6.16) 
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where o,...,0n specify spin projection values taking each (2s + 1) values 
for a particle of spin s (85.4). 

The total number of particles N is obtained from the normalization con- 
dition 


Jo p(x) = N. (4.6.17) 


The (normalized) wavefunctions ~(x101,...,XNaN) are assumed to satisfy 
the appropriate statistics which in this case are anti-symmetric in the ex- 
change of any two particles which amounts to the interchange of the position- 
spin labellings: (x;o;) = (x;0;). 

As in (4.6.2), we introduce the positive function 


5 pr/9(x) 


= op 4.6.1 
where 
T= : 4.6.1 
(» d a | (4.6.19) 
then it is easily verified that 
= 5 
(° SS” f(x) «| =,f. (4.6.20) 
i=1 
We consider the operator 
N 2 
Pp; 
| - i) (4.6.21) 
rr 2m 


defining a hypothetical Hamiltonian of N non-interacting fermions which, 
however, interact with the external “potential” — f(x). 

To obtain a lower bound to the spectrum of the “Hamiltonian” in (4.6.21), 
we note that, allowing for multiplicity and spin degeneracy, we can put the 
N fermions in the lowest energy of levels of the “Hamiltonian” in conformity 
with Pauli’s exclusion principle, if N < number of such levels. If N is larger 
than this number of levels, the remaining free fermions may be chosen to 
have arbitrary small (— 0) kinetic energies, and be infinitely separated, to 
define the lowest energy of the Hamiltonian in (4.6.21). That is, in all cases, 
the Hamiltonian (4.6.21) is bounded below by (2s +1) times the sum of the 
negative energy levels of the Hamiltonian [p?/m — f(x)], allowing, in the 
sum, for multiplicity but not for spin degeneracy. A bound to the latter sum 
has been already determined in (4.5.91), and with u(x) > f(x), uw — m, we 
obtain 


(° 


3 EE 2 i) | +) 2 —(28+ N= (jn) [ox (ieee. 
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Hence from (4.6.18)—(4.6.22), we have 


9 4 (2m\3!2 /5\ 8/2 —3/2 
—3f 2 (s+ & (5) i he ( fox 0°00) : 
(4.6.23) 


ct Ge ae a ee 
[3 < 
F (5) om fa x p'’?(x) < T. (4.6.24) 


Again, we note that f(x) is just introduced in order to derive a lower 
bound for T and is not the potential for any given physical Hamiltonian. 

We leave it as an exercise to the reader to consider, formally, the vy = 2,1 
cases (see Problem 4.8). 


4.6.3 Multi-Particle States: Bosons 


For simplicity of the notation, we consider (identical) bosons of spin 0. 
We introduce the particle number density: 


p(x) = N | d3x2...d®xy |u(x,x2,...,xN)|’- (4.6.25) 


A very conservative lower bound for the bosonic case may be directly obtained 
from that of the fermionic one in (4.6.24) without any further work by simply 
replacing (2s + 1) in the latter by N since in the present case one can put 
the N bosons in the lowest energy level of [p?/2m— f(x)], where f(x) is 
similarly defined as in (4.6.18). This leads to the inequality 


3 (30 \7> we 


for bosons, where the N~?/3 factor on the left-hand side should be noted. 
An improvement (i.e., a larger value) to the numerical (3/5)(37/2)?/3 may 
be obtained but we will not go into it here. 

A similar analysis may be carried out in vy = 2,1 dimensions (see Prob- 


lem 4.8). 


4.7 The Eigenvalue Problem and Supersymmetry 


4.7.1 General Aspects 


The purpose of this section is to construct supersymmetric Hamiltonians, 
and show how one may study the eigenvalue problem of a given class of 
Hamiltonians using supersymmetry as a tool for doing so. 


12 This inequality is referred to as a Lieb-Thirring kinetic energy inequality: Lieb 
and Thirring (1975). 
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In §2.9, we have defined supersymmetric transformations and, by consid- 
ering successive infinitesimal transformations in a closed path, have derived 
the following commutation/anti-commutation relations involving the super- 
symmetry generators Q, Qt and the Hamiltonian H: 


[H,Q])=0, [H,Q'] =0 (4.7.1) 


{Q,Q}=0, {Q',Qr} =0 (4.7.2) 


and, in turn, obtained the explicit expression for H in terms of Q, Q?: 


1 
i= {Q,Q"}. (4.7.3) 


The unitary operator for infinitesimal supersymmetry transformations, 
specified by infinitesimal anti-commuting c-numbers 8£, 5€* (see (2.9.2), 
(2.9.3)) is given by (see (2.9.21), (2.9.22), (2.9.20)) 


U = 14+ (i5e" Q-iQt8¢). (4.7.4) 


For any state |¢) in the domain of Q, (4.7.3) implies that 
ae i Tato! 
(61H14) = = (6|QQ'|4) + & (6latal¢) 


il 2. fol 2 
= 5, 1'4| + = llegll” > 0. (4.7.5) 
That is, the spectrum of H satisfying (4.7.3) is necessarily non-negative. 

A state for which 


Q |b) =0 (4.7.6) 
and 
Q* |) =0 (4.7.7) 


is called a supersymmetric state as it remains invariant under the transforma- 
tion implemented by the transformation in (4.7.4). Given generators Q, Q’, 
the actual construction of supersymmetric states will be spelled out below by 
solving the equations (4.7.6), (4.7.7). Clearly, from (4.7.5), a supersymmetric 
state defines the ground-state of the Hamiltonian H corresponding to the 
lowest point of its spectrum of zero energy. 

A theory for which the supersymmetry generators commute with H, as 
given in (4.7.1), and the ground-state is supersymmetric, is said to be super- 
symmetric. A special class of theories is for which the commutation relations 
in (4.7.1) hold true but for a ground-state 


Q |b) #0 (4.7.8) 
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Q* |v) =0 (4.7.9) 


(or Qt\wv) 4 0, Q|w) = 0). These theories are said to be spontaneously 
broken theories. For such a theory the ground-state energy is strictly positive 
as follows directly from (4.7.5). The latter is also necessarily degenerate. This 
is easily shown by noting that the states Q |W), |W), are orthogonal 


(HIQ| ¥) = (Qhy|y) =0 (4.7.10) 


where we have used the equality in (4.7.9), signaling the importance of this 
equality, and 


H |p) = Ely) (4.7.11) 
implies from (4.7.1), upon multiplying (4.7.11) by Q, that 
A(Q|¥)) = E(Q|y)). (4.7.12) 


Here we are interested, however, in the role of supersymmetry in studying 
the eigenvalue problem of quantum physics. 

We first develop a method to construct supersymmetric Hamiltonians. 
This is then followed by showing how supersymmetry, seemingly unrelated 
to the problem at hand, may be used, as mentioned above, to study the 
eigenvalue problem in quantum physics for a large class of Hamiltonians. Su- 
persymmetry will be also applied later in various chapters, notably in Chap- 
ter 6 in studying the Bose-Fermi oscillator, in Chapter 8 in reference to the 
Pauli Hamiltonian for a charged spin-1/2 particle in an external electromag- 
netic field, also discussed here. Several examples are also given in the present 
chapter. 


4.7.2 Construction of Supersymmetric Hamiltonians 


We first consider the one-dimensional case and define the following oper- 


ators: 
OS (2 [p —iw(x)] Ut (4.7.13) 


gt = j= [p +iw(ax)] U (4.7.14) 
where [2, p] = if, 
{W,Wi}=1, {¥,v}=0, {UT wi}=0 (4.7.15) 


and w(x) is a real function of . The operators Q, Q' were chosen to be linear 
in p to ensure, in particular, that H as given in (4.7.3) includes a kinetic 
energy term p?/2. So-called Fermi operators satisfying anti-commutation 
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relations in (4.7.15), were introduced to satisfy the conditions in (4.7.2). The 
operators V, Ut are assumed to commute with p and w(2). 

We specify the range of x to be a < « < b. Implicit in the definition of 
Q', given in (4.7.14) as the adjoint of Q, is that p is self-adjoint. That is, it 
is necessary that 


(g|pf) = (pg | f)- (4.7.16) 
In detail, (4.7.16) implies from 
(g|pf) = (pg| f) — ih lg" (b)F() — 9° (a) F(a) (4.7.17) 
that is, we must have 
g*(b) f(b) — 9*(a) f(a) = 0. (4.7.18) 


Thus in defining a self-adjoint operator associated with the momentum of 
a particle, one has to impose restrictions on the functions f, g satisfying 
(4.7.18) and, in turn, define the associated domain of the self-adjoint operator 
in question introduced. 

Of particular interest in applications are the cases where (a > —oo, b > 
oo), (a = 0, b > cw). With r = |x|, for example, the latter case may be 
applied to the radial part of the Schrédinger equation (assuming that it may 
separated into a radial part) with the usual boundary conditions of f(r) 
vanishing at r = 0 and r — oo. In many cases encountered in practice are 
those with vanishing boundary conditions at a and b. 

From (4.7.13)—(4.7.15), we readily obtain the explicit expression for H to 
be 


2m 2m 2m 


om (Z + so) 14 wi(2) [vt vy] (4.7.19) 


where we have used (4.7.3). 
The anti-commutation relations in (4.7.15) for the fermi operators U, Wt 
allow us to introduce the convenient representations for the latter 


Wi=qt= (; 5) (4.7.20) 


__ [00 
W=o = (; 5) (4.7.21) 
as obtained from the Pauli-matrices, with 


[wi Ww] =03 (4.7.22) 


leading to the supersymmetric Hamiltonian!? 


3 Throughout, it is understood that the first term in (4.7.23) is multiplied by the 
identity matrix. 


228 4 Spectra of Hamiltonians 


= - (p? + w*(a)) + wi (0)os. (4.7.23) 


For a supersymmetric theory, the ground-state |) is defined by the con- 
ditions 


Q\¥o) =0, Q* |v) = 0 (4.7.24) 
(see (4.7.6), (4.7.7)). That is, it is the solution of the differential equations 
a ee a 
—ih— —iw(x)| oT Yo(x) =0 (4.7.25) 
daz 
a: See = 
[ine + inte) a wvo(x) = 0. (4.7.26) 


Upon multiplying (4.7.25) by —o~, and (4.7.26) by +o* and adding lead 
to 


A ola) =  asdo(a. (4.7.27) 


The general solution of (4.7.27) is given by 


eee a F(z) +8 (?) Aw (4.7.28) 


where a, 3 are some constants and 


F(x) = exp € if ae w(e)) (4.7.29) 


The following elementary consideration shows that for an infinite interval, 
ie., a > —co and/or b > o~, if F(x) is square-integrable then 1/F'(«) is not 
and vice versa. To see this, note that for a function f(x) such that 


b 
0< ow ewe (4.7.30) 


we may use the Cauchy-Schwarz inequality to write 
=| f dx f ral <( re da | f(x yP) | fa 


and derive the bound 

b—a)? . 1 

eee < [ow = (4.7.32) 
Jade |f(a) Ja | F(@)| 

The latter shows that if (4.7.30) is true for a — —oo and/or b > ov, then, 


upon taking the corresponding limits in (4.7.32), that 1/f(x) is not square- 
integrable on such infinite or semi-infinite intervals. On the other hand, for 


(4.7.31) 


x)|? 
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finite intervals (a < b, |a| < oo, |b] < oo) the square-integrability of a func- 
tion f(a) and of 1/f(a) are not ruled out. 

We may use the inequality in (4.7.32) to conclude that for an infinite 
interval if F(a) is square-integrable, then for the square integrability of ¢o(x) 
we have to choose 3 = 0 (and vice-versa, vis-a-vis 1/F(x), to choose a = 0). 

We consider the construction of supersymmetric Hamiltonians in higher 
dimensions (see also Problem 4.14). To this end we provide examples of the 
interaction of a charged particle of charge e with an external magnetic field 
B = (0,0, B). Upon setting 7 = p—e A/c, with B = V x A, we have the 
following commutation relations 

[75,7] = 1 cianBn (4.7.33) 


as is readily checked. In two dimensions we may define the generators 


Q= [im +imjot, Qt= [im —im)o~ (4.7.34) 
m m 


which verify the relations Q? = 0, (Qt)? = 0, and from the relations 
{fot,o-} = 1, [o*,07] = o3, and (4.7.33) for 7 = 1, k = 2, we immedi- 
ately obtain the supersymmetric Hamiltonian 

2 2 h 
Ta, Lt Be (4.7.38) 


2m 4mc 


ae! i 


with g = 2, which is the celebrated Pauli Hamiltonian of spin —1/2 studied 
in Chapter 8, with the so-called g-factor restricted to the value of 2.'4:15 The 
value of g = 2 is only approximate for the electron (see 88.5) and such a 
departure may be formally interpreted as a breaking of supersymmetry. 

In three dimensions, we introduce Fermi operators W,, Wg as 4 x 4 matri- 
ces: 


_fo 90 _ (00 + _ {03 0 
n=(4 Rae He= (79): (wim) = (G2) 


{VW1,W.}=0, {W],wh}=0, {W,,vl}=16, 


(4.7.36) 


with obvious dimensionalities of the unit matrices. 


We define the generators Q, Q? 
f/m ; jm é 
i Q => (19 +im)v! +730, h Qi = (19 —im)Vy +73WVo (4.7.37) 


4 Here and in (4.7.38), the so-called scalar potential U is zero. 
' See also Khare and Maharana (1984), Cooper et al. (1995) and de Crombrugghe 
and Rittenberg (1983). 


230 4 Spectra of Hamiltonians 


where recalling that B = (0,0,B), give from (4.7.33) that [72,73] = 0, 
[71,73] = 0, i.e., 73 commutes with 71, 72. Thus we immediately verify that 
Q? =0, (Qi * — 0. The supersymmetric Hamiltonian is then readily worked 
out from (4.7.36), (4.7.37) and from the latter commutativity property of 73 
with 71, 72 to be 


1 ehg 
H = — (nj+73 +73) -——B 4.7, 
a (x? +75 +73) moe’? (4.7.38) 


arising from two identical copies, each as a 2 x 2 matrix. This is again the 
Pauli-Hamiltonian, now in three dimensions, with g-factor equal to two, and 
a magnetic field B along the x3-axis. 


4.7.3 The Eigenvalue Problem 


We now use supersymmetry as a tool to study the eigenvalue problem of 
the discrete spectrum of a large class of Hamiltonians (to be defined below), 
iLe., in particular, corresponding to normalizable eigenvectors (see §1.8) in the 
underlying Hilbert space. We restrict the study to one-dimensional cases, and 
as we will see below, through examples, that these cases are rich enough in 
applications. The higher dimensional cases in (4.7.35), (4.7.38) will be dealt 
with in Chapter 8. 

In detail, we may rewrite the supersymmetric Hamiltonian H in (4.7.23) 


H= cn ) (4.7.39) 
where 
H, = Ee + Vi (a) (4.7.40) 
Vi(r) = - (w? (x) + hw’ (x)) (4.7.41) 
and 
H_= ss + V_(x) (4.7.42) 
ac > (w?(w) — Faw!(2)) (4.7.43) 
Also we note that 
H_=H,- F wi(a), (4.7.44) 


It is convenient to introduce the operators 


fe aml Pie) (4.7.45) 
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Ai = at —iw(z)) (4.7.46) 

which allow us to rewrite formally 
H,=AtA (4.7.47) 
H_=AAt (4.7.48) 


and are referred to as supersymmetric partner Hamiltonians, and V;, V_ as 
supersymmetric partner potentials. The function w(x) is generally referred 
to as the superpotential. 

We assume that F(x), in (4.7.29), (4.7.28), is square-integrable on its 
domain of definition. From (4.7.26), (4.7.28), with a 4 0, (4.7.45) (4.7.47) 
imply that 


A F(x) =0 (4.7.49) 
H,F(x) =0. (4.7.50) 


That is, F(a) corresponds to an eigenstate of H, with zero eigenvalue. 
We are interested in the discrete spectrum of H belonging to a special class 
of Hamiltonians to be defined below. From (4.7.47), such a discrete spectrum 
(assumed non-empty) would necessarily fall on the positive real axis with 
zero as the lowest point of this spectrum. 

For simplicity of the notation only, we consider dimensionless variables in 
the remaining part of this section, and also divide, in the process, the Hamil- 
tonians by suitable conversion energy scales thus defining their dimensionless 
counterparts. 

Before treating the method in the remaining part of this section on how 
supersymmetry may be used to study the eigenvalue problem of a given class 
of Hamiltonians, we consider the following preparatory examples. 


1. Suppose that 
w(z) = -(z-§£)=w(z,8), -o<z<0o (4.7.51) 


where € is an arbitrary parameter. This leads to the Hamiltonians: 


Hy(6) = -3 5 + Valn8) (4.7.52) 
with 
Valas€) = 5(2- 8)? - 5 (4.7.53) 
and , 
Din ae (4.7.54) 
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with 
V_(z,6é) =Vi(z,6)4+1 (4.7.55) 


where we have made the dependence on the parameter € evident. 
In particular (see (4.7.29)), up to a proportionality constant, 


F(z) = exp (-3¢ - 6) (4.7.56) 


The simplicity of the relationship between the potentials V;(z,&), V_(z,&), 
in (4.7.55) is to be noted. 


2. As another example, consider the function 


(€+1) 1 
Zz (€+ 1) 


w(z) = =w(z,l), 0<z<aw (4.7.57) 


where = 0,1,2,.... Here it is readily shown that 


“Weos (ry . + zs 7 (4.7.58) 
and ae 
V6 =ViG04 HH = ees (4.7.59) 
Also, 
F(z) = (2) exp(—z/(€+1)), 0<z<00 (4.7.60) 


with vanishing boundary conditions at z — 0, z — oo. Again the interesting 
relationship between the potentials Vi(z, 2) is to be noted. 


3. Finally, consider the function 
w(z) = —tanhz=w(z,é), —co<z<0o (4.7.61) 


where the parameter £ > 0. This leads to the potentials 


Vi.(z,€) = = (€? — €(€ +1) sech?z) (4.7.62) 


1 

2 
1 

V-(26) =Valeé-1)+ (€- 5) (4.7.63) 

An interesting and simple relationship between the potentials V,(z,€) 


emerges again. 
For F(z) we have, 
F(z) = exp (-< / dz tanh :) 
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= exp (€ In(sech z)) 


or 
F(z) = (sech z)é (4.7.64) 
and for € > 0, F(z) is square-integrable over —o0 < z < ov. 


The last three examples given above and, in particular, equations (4.7.55), 
(4.7.59), (4.7.63), for the supersymmetric partner potentials, suggest to con- 
sider the following class of supersymmetric partners H,(&€), H_(€) depending 
on some parameter € such that 


V_(z,§) = Vi(z,&1) + R(Z1) (4.7.65) 
where R(£,) is independent of z, 


R(&i) £0 (4.7.66) 


and there exists some function f such that 


f(§) =&. (4.7.67) 


For the class of so-called supersymmetric partner potentials Vi(z,€) 
which satisfy equations of the structure in (4.7.65), we note that apart from 
the additive z-independent term R(£1), such pairs of potentials have similar 
shapes!® when parameterized with, in general, different parameters. 

For example, in (4.7.55), we have the identity transformation 


f(Q) =€ (4.7.68) 
and 
R(é) =1. (4.7.69) 
In (4.7.59), we have with ¢ = €, 
F()=€+1 (4.7.70) 
and (2€ +3) 
RE+)) = sea ra (4.7.71) 
Finally in (4.7.63) 
f(Q)=€-1 (4.7.72) 
and 1 
R(E—- 1) =€— 5. (4.7.73) 


'® Because of this property, partner potentials V+(z, €) satisfying (4.7.65) have been 
referred to as being shape invariant in Gendenshtein (1983). 
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For the class of supersymmetric partner potentials satisfying (4.7.65), we 
may write for the corresponding Hamiltonians 


H_(€) = Hy(&) + R(&i). (4.7.74) 
Also by setting 
f(k) =€r+1, &=0,1,... (4.7.75) 
with 
f= é (4.7.76) 


we may also write a corresponding expression to (4.7.74) the following one 
by replacing € — &%, 1 — &41: 


A_(&) = Hs (En41) + Rlén41): (4.7.77) 


By definition of the H,(&) considered, (4.7.49), (4.7.50) admit a square- 
integrable solution on its domain of definition. That is Wo(z,€) = C(€) F(z, €) 
satisfies 


A(E)¥0(2,€) =9,  [Ieo(-s EI] = 1 (4.7.78) 


H4(€)vol(z, €) = 0. (4.7.79) 


For the present analysis, we suppose, as an induction hypothesis, that for 
allk =1,...,K for some K, that 


u(z,€) = AN(E)AN(E)... AT (Ex-1)Wo(z, &) (4.7.80) 


are square-integrable, on the domain of definition of the variable z, and when 
normalized, satisfy the boundary conditions imposed in the problem at hand, 
and then generalize the result by induction. 

Consider the state AT |yo(€1)), and hence from (4.7.47), (4.7.48) we may 


write 


HF, (€)A'(€) |o(E1)) = At (E)A_(€) |bo(E1)) 
= Al (€) [H4(£1) + R(&)] |Yo(&)) 


= R(&:)At(€) [Yo(Es)) (4.7.81) 


where we have used (4.7.77) and (4.7.79) with € — & in the latter. 
Accordingly we have the properly normalized state 


(i) 
Ex (§) 


where the (i) factor is chosen for convenience, and 


lwi(€)) = At (E) |Wo(&1)) (4.7.82) 
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Fy (€) = R(&) £0 (4.7.83) 


(see (4.7.66)) implicit in the normalization that has to be done of the ¢%(z, ) 
in (4.7.80), 
Fs(§) |i (€)) = Ex(€) |i (6)) - (4.7.84) 


Similarly, from (4.7.77), (4.7.84) 
H(E)AT(€) [a (E1)) = AT(E)H_(E) [1 (€1)) 
= AN(€) [H4(E1) + R(&)] [Ya (€1)) 
= [Ei(f1) + R(&:)] At (€) [Y1(1)) 
= [R(f2) + R(Ex)] AN) [y(&1)) (4.7.85) 


where we have finally used (4.7.75) with k = 1. Thus we may introduce the 
properly normalized state 


= (i) At (E4 
2(€)) AG (€) |i (€1)) 
be oo se At(ey\At(é, 9 4.7.86 
PlO\EG) (€)A™(£1) [Wo(£2)) ( ) 
where 
E2() = R(é1) + R(E2) (4.7.87) 


and in writing (4.7.86) we have invoked the definition (4.7.82) with € — &, 
and hence also with |w9(1)) — |Wo(&2)). By hypothesis of normalizability, 
Bo(€) Ei (é1) £0. 

By induction, we obviously obtain from (4.7.82), (4.7.83), (4.7.86), 
(4.7.87) the state 


le (€)) = Cy(€)At(E) At (Ex)... Ab(Ex—1) |[o(Ex)) = Cr(€) [ba ()) (4.7.88) 


(see (4.7.80)), where the normalization constant C;,(€) is given by 


Cy (€) = (i)* (Bx (€)Ex—1(&1) .-. Ei(Ee—1)) 1? (4.7.89) 
and 
Fy |e (€)) = Ex(€) lve (€)) 
(4.7.90) 
Fs |Wo(§)) =0 
with % 
Ex() = DRI). (4.7.91) 
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The expression of the normalization constant C;,(€) may be readily ex- 
pressed as a function of € as follows. 
For different k, we have from (4.7.91): 


k—m+1 
Ex—mi() = RU) + D7 REE) (4.7.92) 
k—m-+1 
By in(€i) = yy R(E;) (4.7.93) 


where in writing the latter equation we have made use of (4.7.76). Hence 


Ex—m(&1) = Ex—(m—1)(€) — R(E1)- (4.7.94) 


Upon carrying out the transformation € — &,-1, and hence €; — &m, 
(4.7.94) reads 


Ex—m(&m) = Epes) ~~ R(Em) 


= Ep nay Ca=3) a R(Em-1) > R(Em) 


= Ex(§) — [R(&1) +--+ + R(Em)| 
= Ex(§) — Em(§). (4.7.95) 


By choosing, in turn, m = 1,2,...,k —1, in this equality, we note that the 
normalization constant C;(&) in (4.7.89), (4.7.88) may be rewritten as 


Cy (€) = (i)* (Ex (€) [Ex (€) — Ex(8)] + (Ex (€) — Ex-1(Q)) 7. (4.7.96) 


We may summarize the main result of this subsection thus far, as obtained 
from supersymmetry theory, as follows. 


> Summary: “Suppose we are given a Hamiltonian of interest H+(£), de- 
pending on some parameter €, defined by 


—~ + Vi (z,€) (4.7.97) 


(w?(z, €) + w'(z,)) (4.7.98) 


with w’(z,€) = Ow(z,€)/dz. 
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Suppose that the following Hamiltonian H_(€): 


2 
H_() = — + V_(z,€) (4.7.99) 
with i 
V_(z,€) = 5 (w?(z,€) — w'(z, €)) (4.7.100) 
is such that 
V_(z,€) = V+(z, 61) + R(é1) (4.7.101) 


i.e., the shape invariant property of the supersymmetric partner potentials 
holds true, where R(£1) is independent of z, R(€:) 4 0, and there exists some 
function f such that 


fQ=& (4.7.102) 
and in general 
f (Ek) = €&+1 (4.7.103) 


fork = 0,1,...,K (for some K, which may be finite or infinite), &) = €. Then 
with the square-integrability of the w,(z, €) in (4.7.88), (4.7.89), (4.7.96) (up 
to some K) implicit, on their domain of definition, and consistent with the 
boundary conditions imposed in the problem at hand, the w,(z,&) satisfy 
the eigenvalue equations in (4.7.90) with eigenvalues given in (4.7.91) for the 
Hamiltonian H,. Also implicit is that wWo(z,&) is square-integrable (normal- 
ized) as a solution of the equation 


A(z, &)vo(z,€) =0 (4.7.104) 


consistent with the boundary conditions in the problem, and defines the 
ground-state, where 


Ao: |-is ES iu(e.6) : (4.7.105) 


The function wWo(z,&,) in (4.7.88) is defined by making the formal replace- 
ment € > £4.” < 


It is important to point out that the above construction for eigenvalues 
and eigenvectors just uses supersymmetry as a method of application to a 
given Hamiltonian H,(€) of interest satisfying the properties spelled out 
above. 

We apply the method just developed to study the eigenvalue problem of 
two of the three examples mentioned through (4.7.51)—(4.7.64). The last one 
is left as an exercise to the reader (see Problem 4.15). 


1. For w(z,&) defined in (4.7.51), —oo < z < co 


AL(§)=A()+1, f(E)=E& RE) =1 (4.7.106) 
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(see (4.7.68), (4.7.69)). Hence from (4.7.90), (4.7.91), the eigenvalues of 


ges 1 
Hy(é) = z £7 
+(6)= saat 5% — 8)" — 5, (4.7.107) 
are 0, and 
k 
tiga Shy BS lan (4.7.108) 
i=l 
From (4.7.104), the ground-state is given by 
1\1/4 
vol28)= (4) ew (-@-8) 2 (4.7.109) 


and from (4.7.88), (4.7.96), the eigenvectors corresponding to the eigenvalues 
€, = € are 


k 
bel2,€) = oe | in tite 0| bo(%.€) (4.7.110) 
for k = 1,2,..., respectively, since €j = €, 
Cu) = F(R? (4.7.11) 


and is independent of €. Due to the exponential factor in (4.7.109), w,(z,) 
is obviously square-integrable for all k = 0,1,.... The connection of H,(§), 
for € = 0, with the elementary harmonic oscillator problem is obvious (86.1). 
The i* factor in C;,(0) leads to the conventional normalization of the harmo- 
nic oscillator wavefunctions (see §6.1) up to an overall minus sign. 


2. For w(z, @) defined in (4.7.57), 0 < z < o, 


(20 +3) 
2(E+ 1)2(€+ 2)2 


H_(@) = Hy(@+1) 


(elt: 4 1 
= Hy(€+1) +5 laa iy mE | (4.7.112) 


(see (4.7.71)), 


R(€+1) = ; se se (4.7.13) 


f=, & =f+k, k=1,2,.... (4.7.114) 
Therefore the eigenvalues of 


1d 1 &e4+1) 1 
+ + 
2dz2 z 227 2(€4+ 1)? 


H,(Q) = (4.7.115) 
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are from (4.7.91), 


1 1 1 
Eh) = om Fea tie 


j=l 


ie, St 1 
=5 = i aes a (4.7.116) 


and for k — 0, this reduces to the eigenvalue 0 for the ground-state. We note 
from (4.7.104), (4.7.105), that 


— ifd &+1 1 eer 
Al(é;) = a E esi: & =l4+5 (4.7.117) 
tho(z, £) = ce(z)"** exp (—z/(€+ 1) (4.7.118) 


and more generally 
tho(%,&j) = ce; (2) *? exp (—2/(& + 1)) (4.7.119) 


with the eigenvectors |7;,(@)), for k = 1,2,... defined in (4.7.88), (4.7.96). 
The connection of H+(£) to the (one-dimensional) radial part of the 
Hamiltonian of the hydrogen atom (Chapter 7) 


1d 1 &e+1) 


A 2dz2 z 222 


(4.7.120) 


with ¢ as non-negative integers, should be noted. Here H(£) is expressed in 
terms of the dimensionless variable 


z=rpe/h (4.7.121) 


where jz is the reduced mass of the atom, and in energy conversion units of 
pie’ /h?. It follows from (4.7.115), (4.7.116) that the eigenvalues of H(£) are 


—- eee (4.7.122) 

Since = 0,1,2,..., one may introduce a natural number n by setting 
€+k+1l=n. (4.7.123) 
Therefore for a fixed n value, k = 0,1,...,n — @—1. Thus upon eliminating 


k in the states |y,(¢)) in favor of n and @, the allowed values of £ become: 
€=0,1,...,n—1 for a fixed n, where n specifies the eigenvalues in (4.7.122). 
In detail, we then have from (4.7.88), (4.7.96), (4.7.119) with 


f&=l+k=n-1 (4.7.124) 
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that 
Yn—e-1 (28) = Cn—1Cn—e-1 (CAT (Q)AN(E +1) + AN(n — 2)2"e 7!" 
=dne(z) for €=0,...,.n-2; n=2,3,... (4.7.125) 
and from (4.7.118) 
Wo(z,n—1) = Gazette w/" 
=dnn-i(z), for €=n-1; n=1,2,.... (4.7.126) 


The normalization constant factor Cy—¢_i(@) in (4.7.125) is defined in 
(4.7.96), with (see (4.7.91), (4.7.113)) 


Li a 1 
B,(@) = 5 io a (4.7.127) 


where now 7 = 1,...,n—@—-1. 

The wavefunctions @y,0(z), for €=0,...,n-1; n=1,2,..., in (4.7.125), 
(4.7.126), are square-integrable on 0 < z < oo, as one-dimensional integrals, 
and vanish at z = 0, z — oo. 

For example, for n = 2, c, = 1/V24 = (1/2)3/2//3, and for @ = 1, 
(4.7.126) gives 


1 ay? 2 
2,1(z) = (5) ae (4.7.128) 
For ¢ = 0, (ie., k = 1), E,(0) = 3/8 in (4.7.127), hence 
_ |8 
C1(0) =iy/ 3 (4.7.129) 


and 


w= EB [Eo soe 


= Gs z(2— zje"*/?, (4.7.130) 


The conversion factor z — r, is given in (4.7.121). 


If the property of shape invariance of a pair of supersymmetric partner 
potentials, as defined in (4.7.65), holds, that is 


H_(€) = 44(&) + RG) (4.7.131) 


with (4.7.66), (4.7.67) satisfied, we may infer from (4.7.91) that 
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k 
S> R(Ei41) + R(E1) = Eegi() (4.7.132) 
i=1 
with k = 0,1,... (up to some K which may be infinite) are eigenvalues of 


H_(€) and note, in the process, that the eigenvalue 0 of H+(&1) is missing 
for H_(€) with E,(&) representing the ground-state of the latter. Such a shift 
of the eigenvalues for a supersymmetric partner Hamiltonians is typical. The 
eigenvectors of H_(&) in (4.7.131) corresponding to the eigenvalues E;,+1(€) 
in (4.7.132) are from (4.7.88) given by 


Lxm41(€)) = Ce(E1) At(E1) At (E2)-+- AT (Ex) [to (Ex41)) (4.7.133) 


for k = 1,2,... (up to some K), corresponding to eigenvalues F(&), 
E3(€),..., respectively, and 


Ix1(€)) = |vo(1)) (4.7.134) 


corresponding to £;(&). Implicit in this is that |¢zo(€)) and the states in 
(4.7.88) (up to some K’) are square-integrable and consistent with the bound- 
ary conditions imposed in the problem at hand. Implicit also, is that the latter 
hold true for € — &). 

Irrespective of any shape invariance properties of a given pair of supersym- 
metric potential partners, the following question arises. Does an eigenvalue 
E, say, of a supersymmetric partner H,, i.e., corresponding, in particular, 
to a square-integrable wavefunction, is also an eigenvalue of the partner H_? 
The situation is different whether E = 0 (i.e., the ground-state energy of H4, 
assuming it exists) or E # 0. 

Suppose £ = 0 is the ground-state energy of H,. That is, F in (4.7.29) 
is square-integrable, as the solution of (see (4.7.49)) 


AF=0 (4.7.135) 


and satisfies 
Al AF=0. (4.7.136) 


From (4.7.25), (4.7.28), (4.7.29), (4.7.46), (4.7.48), we are formally led to 
consider the pair of equations 


At (5) =0 (4.7.137) 
AAl (=) = (4.7.138) 


and at least for problems defined on infinite or semi-infinite intervals, we 
may be conclude from (4.7.30)—(4.7.32) that 1/F is not square-integrable 
and hence cannot correspond to any eigenvalue (discrete spectrum). 
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On the other hand if EF 4 0 is an eigenvalue of H,, then 
Hy |b) = ATA) = Ely) (4.7.139) 
where ||w|| < oo. Upon multiplying the above by A from the left, we obtain 
A Al (A|d)) = H_(A|d)) = E(Alv)) (4.7.140) 


where we have used (4.7.48), and rather formally infer that E also belongs 
to the discrete spectrum of H_. 
In the light of the above discussion, we consider a couple of examples. 
Consider the superpotential 


w(z) =—tanhz, -co<z<o (4.7.141) 


corresponding to € = 1 in (4.7.61). Then 


Hy.= SG 2 sech?z) (4.7.142) 
fe eee Tg sech” z we 
and 2 
1 1 


(see (4.7.62), (4.7.63)). Clearly 0 cannot be an eigenvalue of H_. 
The ground-state wavefunction F(z) of H, is given from (4.7.64) to be?” 


F(z) = sech z (4.7.144) 


(up to a normalization constant) which is square-integrable and vanishes 
for |z| — co. On the other hand, 1/F(z) = coshz, apart from its notable 
properties for |z| — oo, is not square-integrable. A second formal solution to 
this, as obtained from Problem 4.9 (iii), gives the function sinh z and hence 
is not acceptable either. That is, 0 is not an eigenvalue of H_, with the latter 
having an empty discrete spectrum, as expected. 

As another example, consider the superpotential 


w(z)=mcotmz, 0<2z<1. (4.7.145) 
Hence 
1 d? 1? 
A, = --— -— 4.7.14 
+ 2dz2 2 ea 
1 d? 2 2 1 


'” Note that a second formal solution of, H+g = 0 as obtained from Problem 4.9 (iii) 
is given by g(z) = [sinh z + zsech z] and is not acceptable because it is not square- 
integrable and its related bad behavior for |z| — oo. 
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where we note that V_(z) — oo for z > 0,1. 
The ground-state of H, is given by 


rte)moo( f" ras extx2) 


= sin 7z (4.7.148) 


(up to a normalization constant), which is square-integrable on [0,1] and 
vanishes for z > 0,1. The function 1/F'(z) = csemz, apart from its notable 
properties for z — 0,1, is not square-integrable, neither is a second formal 
solution of Hig = 0, given by 


ee E = = (4.7.149) 
sin 7 Z 27 
as obtained from Problem 4.9 (iii). [Note that g(z) in (4.7.149) was chosen 
such that g(0) = 0.] That is 0 is not an eigenvalue (i.e., not in the discrete 
spectrum) of H_. 
Now consider the eigenvalue E = 3m7/2 of Hy in (4.7.146) with eigen- 


vector represented by ~(z) = sin27z (up to a normalization factor). From 
(4.7.140), (4.7.145), we are then led to consider the function 


i [fd . 
A(z) = 5 E — 7 cot r,| sin 27z 


= inv2sin? rz (4.7.150) 


(vanishing at z = 0,1) which is square-integrable, hence the non-zero eigen- 
value E = 3n7/2 is also in the discrete spectrum of H_. 

The formal rules spelled out above about a zero eigenvalue versus a non- 
zero & # 0 eigenvalue are, in general, true modulo some mathematical sub- 
tleties. A careful analysis of this involving all subtleties, with the boundary 
conditions at hand, for all general cases is quite tedious, and it is relatively far 
easier to examine a pair of supersymmetric Hamiltonian partners at a time. 
The moral of the formal analysis at the end of this section given through 
(4.7.135)—-(4.7.140) is this. By studying the eigenvalue problem of a simple 
Hamiltonian, as in (4.7.146), one obtains information about the eigenvalues 
of a relatively much more complicated Hamiltonian, as in (4.7.147), with little 
extra work! 

Much work has been done on the role of supersymmetry in quantum 
physics and not just on the eigenvalue problem and we refer the reader to 
the rapidly developing literature on this subject in physics journals. We will 
have ample opportunities to apply notions of supersymmetry in various other 
chapters as well. 
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Problems 


4.1. 


4.2. 


4.3. 


4.4. 


4.5. 


4.6. 


4.7. 
4.8. 


Consider the spherically symmetric potential V(r) = —Ae~"/?, \ > 
0, R > 0 providing an elementary and rather formal model for the 

deuteron, as a loosely bound-state of a proton and a neutron with a 

binding energy FE ~ —2.23 MeV, with @=0. 

(i) By choosing No = 1 for @ = 0, in (4.5.86) providing exactly one 
bound-state, obtain a lower bound for as a function of h?/wR?. 

(ii) Find the exact solution for the ground-state (¢ = 0). 

(iii) Taking, as input data, R = 2.180 fm (fm = 10~1° m), FE = —2.23 
MeV with the reduced mass of the deuteron pp = MpmMy/(Mp+Mn), 
obtain a value for A (and the potential energy “depth” Up in 
(4.2.89)) admitting only one bound state. Is your estimate con- 
sistent with the lower bound in (i)? 

For the Yukawa potential V(r) = —\e~"/"/(r/R), in three dimen- 

sions, with A > 0,R > 0, choose 2uR?/h? = 3/X. Using a radial trial 

wavefunction u(r) = Ae~’/”o for ¢ = 0, with respect to the measure 
r2dr, derive an upper bound for the ground-state energy (€ = 0) asa 
function of A, by optimizing over the parameter R/ Rp. What can you 
say about the maximum number of bound-states this potential can 
ever have for = 0? 
For the Hamiltonian 
1d? ae 

Be eg it cot“(7z), O<z<1l 
in one dimension, with a confining potential, expressed in terms of 
a dimensionless variable z, and in some suitable energy unit, find the 
eigenvalues and the eigenvectors for this Hamiltonian. [The reader will 
be asked to reconsider this Hamiltonian in Problem 4.16 in the light of 
supersymmetry in reference to the pair of Hamiltonians in (4.7.146), 
(4.7.147) for direct comparison.] You may express the solutions for the 
eigenvectors in terms of hypergeometric functions (see, e.g., Arfken 
and Weber (1995) for the properties of these functions.) 
Show that in two dimensions, the trace [d?x (x|A?|x) in (4.5.42) 
for p = 1, where A is given in (4.5.27), is infinite which necessitates 
another choice for p, e.g., o = 3/2 as in (4.5.93). 
Prove the inequality in (4.5.93), where A is given in (4.5.27). [This is 
a hard nut to crack.| 
Derive the general expression in (4.6.8) for the expectation value of 
the kinetic energy T which is also valid for v = 2,1. 
As in Problem 4.6, derive (4.6.11) which is valid for v = 3, 2,1. 
Extend the analyses in deriving a lower bound to T in 84.6 for multi- 
particle systems in vy = 2,1 dimensions as done there for v = 3. Can 
you extend the analysis also to higher dimensions? 


4.9. 
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Consider two possible functions f(z), g(z), satisfying formally the 
time-independent Schrédinger equation in one-dimension 


3a ae via) y(z) = By(2) 


expressed in terms of a dimensionless variable, where F is a real num- 
ber (the eigenvalue). 
(i) Show that this equation leads to the constraint 


9 (2) f(z) - f'(2) gl) = 4 


where a@ is some constant which depends on the boundary condi- 
tions to be imposed on the functions. [Note that this property is 
consistent with the Hermiticity condition of the Hamiltonian in 
question. | 
(ii) Show that for f’(z)/f(z) = —w(z), where w(z) is some given func- 
tion, g(z) satisfies the differential equation 
a 


g'(z) + w(z)9(2) = Ft) 


and hence the homogeneous solution of this equation coincides 
with the one satisfied by f(z). 
(iii) Show that the general solution for g(z) in part (ii) is given by 
(= afte) | + 6402) 
g(z) = af (z = z 
f?(z) 

where (3 is a constant. That is, given a formal solution f(z) of 
the Schrédinger equation above, then any other formal solution is 
related to f(z) as given above. Also for the boundary conditions 
which specify that a = 0, the above shows that the formal solution 
f(z) is unique. 


4.10. Show that any Hamiltonian H = p?/2m + V(x), which has a finite 


4.11. 


ground-state energy Eo with ground-state wavefunction 7o(x), may 
be rewritten as H = A'- A+ Ep, where 


[Note that, in one-dimension fi (O%9/0x) /o = w(x) in (4.7.45), for 
Wo(x) square-integrable. | 

The Virial Theorem. Consider the Hamiltonian of a single particle 
H = Hy) + V(x), where Hp = p?/2m, and the eigenvalue equation 
Hy = Ew. Suppose that (x) is non-vanishing in a region R, of 
volume v, only. R may be of finite or of infinite extension. For example, 
for a particle confined to a rigid sphere of finite radius, FR is of finite 
extension. 
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(i) Show first that for a given function $(x) 


| d?x Yi"Hod = [a2x (How")¢ 


he? * * 
— 5 fas. [uo - (Wu) 
m s 
where the volume v is bounded by the closed surface S’,, and that 


(x-V Ho v) =—2 How + Ho(x- Vv). 


(ii) Upon multiplying the eigenvalue equation [Hp + V(x)] w~(x) = 
Ew(x) from the left by ~*(x-V), and using the results in (i) 
show that 


h2 
2 (Ho), VV), = 5 faSe[(Vur)(x- Vv) — wr Vix Vv] 
where (A), = f,, d°x p*A y. 
(iii) For a multi-particle system of N particles, with Hamiltonian 


N 
H= Spi /2m; + V(x1,...,XN) 


i=l 


and eigenvalue equation 


Ay(x1e1,...,Xnven) = Ev(x1e1,...,XNEN), 
where (€1,...,€) = € are, in general, extra labellings needed to 
specify the particles, such as spin, charge, ..., show that the virial 


theorem in (ii) is readily generalized to 


N N 
2 (x ei/2m,) = (Ses °V;5 ”) 


j=l 


— S- S- a [ex fares d?x,_4 dPxp41 oe dex 
€ k=1 2m 


N N 
x pasy: (Vi b") (Sox Vid | —v'Ve | Sx) Vib 
j=l j=l 


where, in an obvious notation, the integrations are over the con- 
figurational coordinates. 
[Ref: Mare and McMillan (1985).] 


4.12. 


4.13. 


4.14. 


4.15. 


4.16. 


4.17. 
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Verify the commutation relation (4.7.33), and show that the generators 
in (4.7.34) lead from (4.7.3) to the Hamiltonian in (4.7.35). 

For the Fermi operators in (4.7.36), verify the underlying anti- 
commutation relationships given there. Then show that the generators 
in (4.7.37) lead to the Hamiltonian in (4.7.38). 

Show that, as direct generalization of the two Fermi operators defined 
in (4.7.36) as 4x 4 matrices, one may introduce three Fermi operators, 
as 8 x 8 matrices, 


_({% 0 _ (WV. 0 _ (00 
MA 0 —wv, )° X2 = 0 —w, )° X3 = 10 


where W,, V2 are given in (4.7.36), and satisfy the anti-commutation 
relations 


{xis Xj} = 9, {xt.x}} =o, {xinxt} = 164. 


Use these operators to generate supersymmetric Hamiltonians in 
higher dimensions. 

Study the eigenvalue problem for the Hamiltonian H,(€) with super- 
potential w(z,€) in (4.7.61) with shape invariant partner potentials 
satisfying (4.7.63). [Determine, in particular, the number of the eigen- 
values. | 
From the elementary eigenvalue problem associated with the Hamil- 
tonian H, in (4.7.146) for a particle confined to a one dimensional 
box with constant potential —7?/2, use supersymmetry to generate 
the eigenvalues and eigenvectors for the far more complicated Hamil- 
tonian H_ in (4.7.147). Compare your results with the ones obtained 
in Problem 4.3. 

Consider the superpotential 


w(z) =< — 2 0<z<m. 

(i) Derive the expressions for the supersymmetric partner potentials 
V4, VL. 

(ii) Find the ground-state of H,, corresponding to the eigenvalue 0. 
Is the value 0 also the ground-state energy of H_? Show your 
analysis in detail. 

(iii) Refer to Chapter 6, if needed, to find the eigenvalues and eigen- 
vectors of H, with the latter vanishing for z — 0,z — oo. Do all 
the eigenvalues E 4 0 of Hx also belong to H_? 


5 


Angular Momentum Gymnastics 


Under a coordinate rotation (§2.1) by an angle y about a unit vector n, 
a state |¢) obeys the transformation law (see (2.3.1), (2.3.43)) 


1!) = exp (Fen-3) Wy) (5.1) 


where J is the angular momentum operator (§2.7), satisfying the commuta- 
tion relations 


[ee |W inet G2) 
i,j =1,2,3. 
By defining 
JS (PF) + PP + (PY (5.3) 
we note that 
Le |) 30, eS 2: (5.4) 


Since the different components J* do not commute, but they all commute 
with J, we may find simultaneous eigenstates of J* and one, and only one, 
of the components J‘ which is traditionally taken to be J°. 

We denote the simultaneous eigenstates of J?, J? by |a, 3): 


J? |a, 3) = ala, 8) (5.5) 
J* |a, B) = Bla, 8). (5.6) 


In a given physical problem, there may be, in general, in addition to 
J”, J®, more observables constituting together a complete set of compatible 
observables. In such cases, the simultaneous eigenstates of the corresponding 
complete set of commuting operators, will depend on additional parameters to 
qa and @, which for simplicity of notation will be suppressed. We suppose that 
these additional operators, in the complete set of operators, each commutes 
with the angular momentum operator J. The latter means that if we denote 
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the simultaneous eigenstates of the complete set of observables, including J a 
J?, by |a, 3, «), with « denoting collectively the additional quantum numbers 
needed to specify the eigenstates, then 


(a’, 3’, 6 |Jla,B,«6) =0 unless a’ =a and k’ =k (5.7) 


where we recall that « is the eigenvalue of J? (see (5.5)). 
To study the eigenvalue problem (5.5), (5.6), the introduction of the so- 
called ladder operators 


Iga P Li (5.8) 
turns out to be very useful. They satisfy, in particular, the following relations 
[JP oss] 0 (5.9) 
ae eee ae (5.10) 
[Jey J | As? (5.11) 
Bl Seo ae OS fad ale 9 (5.12) 
Jo=JSt 

and from (5.7) 

(a’, 3’, 6 |Jzla, 8,6) =0 unless ao! =a, kb’ =k. (5.13) 


Finally we summarize pertinent properties established in §2.7, §2.8 for 
arbitrary spins. 
The general structure of the angular momentum J is given in (2.7.6) to 
be 
J=XxP+S (5.14) 


where X denotes the position operator associated with a particle, or of the 
center of mass position of a system of particles, and, the spin, S is the trans- 
lational independent contribution to J, 


[Pest] 0 (5.15) 

and 
[X*, $7] =0 (5.16) 
[S*,.9] = ine**.9* (5.17) 


[Se S"|=0, (5.18) 
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Arbitrary spins s > 1/2, may be described by a completely symmetric 
spinor w%--%2s (§2.8) and, under a coordinate rotation by an angle y about 
a unit vector n, it transforms as given in (2.8.1). For s = 0, we have 


v(x’) = o(x) (5.19) 


where x’ = Rx. 
Under a coordinate rotation by an angle 27, about any unit vector n, one 
has the profound relation (2.8.68): 


yt 428 (x) = (—1)?8qp2t-028 (x), (5.20) 


for all s = 0,1/2,1,3/2,.... This relation emphasizes the odd nature charac- 
ter of half-odd integral spins. Only a rotation by 47 (or integer multiples of 
it) restores a (+1) value for the corresponding transformed spinors for such 
spin values. The phase change of (—1) occurring for half-odd spin values is 
observable and will be discussed in §8.12. 

The unitarity relation (2.8.24), (2.8.25), between spinors and wavefunc- 
tions should be noted. The matrix elements of the operator exp (;yn : S) 
with respect to the simultaneous eigenstates of $3, S* (see (2.8.45), (2.8.46)) 
are given in (2.8.51). 

This chapter covers in fairly details most of the aspects dealing with 
the intricacies of angular momentum and is quite complete. The eigenvalue 
problem (5.5), (5.6) is discussed in §5.1 followed by determining the matrix 
elements of finite arbitrary rotations in §5.2. Orbital angular momentum and 
spin are covered, respectively, in §5.3 and 85.4. The vectorial addition of two 
independent angular momenta and the respective rules are worked out in 
85.5, 85.6. So called vector and, more generally, tensor operators are treated 
in 85.7, §5.8 which lie in the heart of the theory of angular momentum. In §5.9, 
we consider the problem of combining several independent angular momenta. 
The final section, §5.10, deals with the definition and construction of single 
and two particle states of arbitrary spins in terms of angular momentum 
components which have become quite useful over the years. 

The association of the word “gymnastics” with a topic such as angular 
momentum involving “infinite” details is due to A. S. Wightman and we find 
it quite appropriate to include it in the heading of this chapter. This in no 
way, however, is meant to diminish the importance of the subject of angular 
momentum. 


5.1 The Eigenvalue Problem 
We consider the eigenvalue problem 


J? |a, 8) = ala, 8) (5.1.1) 


252 5 Angular Momentum Gymnastics 
F° |a, 8) = Bla, 8) (5.1.2) 
which on account of J? > (38)’, imply that 
asp. (5:1,3) 


Since J’, J’ satisfy (see (5.2), (5.4)) exactly the same commutation rela- 
tions as S”, 9% (see (2.8.46), (2.8.47)), the solution of the eigenvalue problem 
in (5.1.1), (5.1.2) may be formally inferred from (2.8.46), (2.8.47). The fol- 
lowing independent and general derivation is, however, illuminating. 

Let 3 denote the largest possible value for @ satisfying (5.1.3). Then from 
(5.10) 

PSs. 


a, 3) = (8B +h) Jy la, B) (5.1.4) 


which is consistent with the fact that B is the largest possible value for 3 
satisfying (5.1.3) only if Jy la, B) is the zero vector. The latter, in particular, 
implies that 


JI p10: B= 0 (521.5) 
which from (5.11), (5.1.1), (5.1.2) gives 


(a-B — 1B) a, B) =0 (5.1.6) 


and hence 


a=B(B+h) (5.1.7) 


since |x, 2) is not the zero vector. 
Let @ denote the smallest possible value for 3 satisfying (5.1.3). Then 
from (5.10), 


J? J_|a, 8) = (@ —h) J_ |a, 8) (5.1.8) 
leading, in analogy to (5.1.7), to 
a= B(B—h). (5.1.9) 


Upon adding 8 to both sides of (5.1.9), (5.1.7) and comparing these equa- 
tions, we obtain 


(8+ 6) (@-6+h) =0. (5.1.10) 
Since 8 < B, this gives the unique solution 
o=-B. (5.1.11) 
By repeated applications of (5.10), with the upper sign, k times lead to 
J® (J4)* Ja, BY = (8+ Ak) (J4)* |e, B) (5.1.12) 


which imply that there exists a largest non-negative integer k such that 
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B=B+hk (5.1.13) 

or from (5.1.11) to 
B= — =hj, B= hj (5.1.14) 


where we have denoted the latter by Aj. Accordingly, from (5.1.7), we may 
write 


a =h?j3(j +1) (5.1.15) 


and due to the very nature of k as a non-negative integer, the possible values 
of 7 are restricted to 


j=, . 1, os (5.1.16) 
On the other hand, from (5.1.12), (5.1.13) we may write 
B= B+hk = —hj + hk (5.1.17) 
and since k = 0,1,...,k = 2j, we have for the possible values of (3: 
—hj,-hjg +h,...,hg — fh, hj. (5.1.18) 


Upon setting 6 = hm, and using the standard labelling for the eigen- 
states of J*, J? by |j,m), instead of |a, 3), we may summarize the eigenvalue 
problem through 


J? |j,m) = R75 (5 + 1) |3,m) (5.1.19) 
J* |j,m) = him |j,m) (5.1.20) 
(j',m! | j,m) =. 5577 Om'm (5.1.21) 


where the possible values of j are given in (5.1.16), and, from (5.1.18), the 
allowed m values are 


With the assumption that the additional operators, to J”, J°, in the 
complete set of operators, commute with the angular momentum operator J, 
and hence also with Ji, we may identify, by using in the process (5.7), 


J+. |j,m) = Cz (Jj, m) |j, m+ 1) (5.1.23) 


and obtain from (5.11), (5.12), 


(j,m|JzJa\j,m) = fh? (fj =m) (j+m+1) (5.1.24) 


leading, with a standard phase convention, to 
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Jz. |j,m) =h/G Fm) (Gtm+1)|j,mH+l1). (5.1.25) 


By repeated application of J_ to |j,m) k& times, we obtain the useful 
expression 


- (he Lm k),k=0,1,...,j +m 


0, k=jt+m+1jt+m+2,.... 
(5.1.26) 
That is, k is restricted to non-negative integers such that the arguments of 
the factorials are non-negative. 
Similarly, 


(J)* bism) 00 Gem ERT UME R =O lam 
+) |J,™) = 


0, k=j—-—m+1,j—m+2,.... 


(5.1.27) 
For further reference, we also note from the definitions in (5.8) and the 
relations in (5.1.25) that 


Fm) = SVG m) GF mF I) lit 1) 


+ 3 VGFm)G— mF 1) [i,m —1) (5.1.28) 


P jm) = VG—m) GF mF Dim tI) 


+ig VGFm)G— mt 1) \j,m—1). (5.1.29) 


5.2 Matrix Elements of Finite Rotations 


Since the angular momentum components J!, J?, J? satisfy the same com- 
mutation relations as the spin components $!, $?, $%, and the rules for the 
action of the operators J’ on the states |j,m) (see (5.1.20), (5.1.28), (5.1.29)) 
are identical to the action of the operators S* on the states |s,ms) (see 
(2.8.43)—(2.8.45), (2.8.22)), we may conclude directly from (2.8.51), that the 


i 
explicit expressions for the matrix elements of the operator exp (Zn J ), 


with respect to the states |j,m), is given by 
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exp (Zn . 5) ism) 


@ 5. Se VG NG = IG mG =m)! 
7 LG fm — q)\(m! — m+ g)!q\(j — m’ — q)! 


(im 


x (ayes ([aj24y"™ —™*4 ([A}*2)? ([A}22)°-" 9 
(5.2.1) 


where the sum is over all non-negative integers q such that the arguments of 
the factorials are non-negative integers. Here 


cos . + in® sin . i(n! — in?) sin 


A= (5.2.2) 


i(n? + in?) sin P cos © — in? sin 
2 2 
and n = (n!,n?,n°). 
We rewrite the matrix elements of finite rotations in (5.2.1) in the equiv- 
alent description in terms of the so-called Euler angles. 
To the above end, we define consecutive rotations specified by the Euler 


angles a, 3, y in the following manner: 


e A counter-clockwise rotation of the coordinate system xyz about the 
z-axis by an angle a, defining the coordinate system 2’ y/ 2’, 

e the above is followed by a counter-clockwise rotation of the coordinate 
system 2’ y’ z’ about the y’-axis by an angle (, defining the coordinate 
system a” y" 2”, 

e finally, this is followed by a counter-clockwise rotation of the coordinate 
system wy” z” about the z-axis by an angle y, defining the final coor- 
dinate system & YZ. 

The above consecutive rotations are illustrated in Figure 5.1 below. 
A vector x with components (x,y,z) (= (x!,2?,2%)) in the coordinate 
system x, y, Z, is described to have the components @, y, Z in the final coor- 


dinate system %YZ, as follows: 


x Hy 
7} =R(a,8,9) | y (5.2.3) 
z z 
and 
cosy siny 0 cos3 0 —sin@ cosa sina 0 
R(a,8,y) = | —siny cosy 0 0 ol 0 —sina cosa 0 


0 0 1 sinG O cos@ 0 0 1 
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Fig. 5.1. Definition of the Euler angles (a, (,7). The coordinate systems 


resulting from the rotations are as follows: ryz — a'y'2z’ > a" yz" = BYzZ, 
a B y 


with a rotation by an angle a about the z-axis, followed by a rotation by an 
angle @ about the y’-axis, followed by a rotation by an angle y about the 


z"'-axis, respectively. 


cos 3 cos a@cosy — sina siny cos Zsinacosy+cosasiny —sin{cosy 
—cos(cosasiny—sinacosy —cos@sinasiny+cosacosy  sin(@siny 
sin GB cos@ sin GB sina cos 3 
(5.2.4) 


Upon the comparison of the matrix elements of R (a, 3, y), given in (5.2.4), 
with those given in (2.1.4), corresponding to a counter-clockwise rotation by 
an angle y about a unit vector 


n = (cos ¢ sin 6, sin ¢ sin @, cos 9) , (5.2.5) 
one obtains the equalities 


sin 5 = sin 6 sin . (5.2.6) 


ielo—7)/2 — Qi? (5.2.7) 
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cos B ellat/2 = (cos . +icos@ sin *) ; (5.2.8) 


Accordingly, the matrix A in (5.2.2) may be rewritten in terms of the 
Euler angles in the form 
cos ° ella+y)/2 sin ° een yie 
A= (5.2.9) 
B i(a—7)/2 Fenian 


—sin=e 
2 


from which (5.2.1) is spelled out to be 


(ium exp (Zn. 5) isn!) 


VG +M)IG — mG + mG —m’)! 
= =F (G+m-—q)!(m’ —m+q)!q!(¥ —m’—@)! 


; 8 2j+m—m'—2q 6 m’—m+2q eae 
x (-1)™7™4 (cos 5) (si 5) em” Sem) (5.2.10) 


as expressed completely in terms of the Euler angles a, 3, y, where the sum 
is over all non-negative integers q such that the arguments of the factorials 
are non-negative integers. 

Upon the comparison of (5.2.10) with (2.8.52) and using, in the process, 
(5.1.20), (5.2.7), we also have the identity 


(ium exp (Pn-3) Jim’) = (i,m (exp 7°) (cw 7") 
ia 
x (exp 27°) 


expressing the relationships between the two equivalent schemes, where all 
the rotations specified on the right-hand side of (5.2.11) are about the original 
axes 0z, Oy, 0z, respectively. 

We use the notation 


. iy 33 18 79 ia 73 
(ium| (exp 23°) (exp 27?) (ex 2°) 


€ m| (exp 7) i. mt) = DY) , (0, 8,0) = d” (8). (5.2.13) 


j, m') (5.2.11) 


a) = DG (a, 8,7) 
(5.2.12) 


and 


258 5 Angular Momentum Gymnastics 


The pD® , (a, 8,7) have the following property upon complex conjuga- 
tion, 
(D8 a Ba) =(-D™-™D®, _ (08,7) (5.2.14) 


as follows from (5.2.10). 

Useful orthogonality relations of the D® , (a, 3,-) will be derived in §5.5 
when combining two commuting angular momenta. 

The following relations for the matrix elements d? (8) in (5.2.13) easily 
follow from the explicit expression given in (5.2.10) 


dn — B) = (-1)t™'a® (8) (5.2.15) 
de) at) = (Gi) omen (5.2.16) 
do) (=) = (-1)4 Sin, —m! (5.2.17) 
de), (2m) = (-1)*bmm’. (5.2.18) 


For j as obtained from the addition of an orbital angular momentum and 
spin, with quantum numbers ¢ and s, as studied in §5.5, 7 = |@— s|,|@—s|+ 
1,...,€+.s. Then for €>s,j7 =€—s+k, k =0,1,...,2s and (-1)77 = 
(—1)—?8 = (-1)**. For 2< s,j=s—l+k,k=0,1,...,20,(—1)%7 = (-1)?8. 
That is 

d)) (2m) = (—1)?*Smm! (5.2.19) 


in conformity with (5.20). 


5.3 Orbital Angular Momentum 


5.3.1 Transformation Theory 


From §5.2, the orbital angular momentum L (see (2.7.6), (2.7.11)) is de- 
fined by 
L=XxP. (5.3.1) 


Conversely, with (2.7.43) providing the definition of the transformation of 
a state, in the coordinate description, under a coordinate rotation, we have 


W(x!) = (exp (G0 s) v) (x) (5.3.2) 
with x’ = Rx, w = yn (§2.1), or 


p(x) = (ex (0 d s) v) (RPT e)2 (5.3.3) 
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For example, for infinitesimal dy 
Ro'x=x+5w xx 


(see (2.1.13) with 5w — —5w), and 


w(x + dw xX x) = (x) + tw - Ly(x) (5.3.4) 


by carrying out a Taylor expansion, where 
L =x x (-ifiV) (5.3.5) 
provides the coordinate description for L. From (5.3.3), (5.3.4) give for infin- 
itesimal dy 
w(x) = ((1 EY _ bw (S+ 1)| v) (x) (5.3.6) 


which upon comparison with the transformation law of a state in (5.1), 
(2.3.43), for infinitesimal dy, provides the definition of J in terms of L and 


S in (2.7.6). 
In spherical coordinates (r,0,¢), we have the following representations 
a) 
L? = -ih—— 3 
ihs 3 (5.3.7) 
L? = —# 2 fee ae (5.3.8) 
= sin 00-00 * sin? 6 0¢? - 
and for the ladder operators (5.8), 
i ) 3) 
Ls =heti? f cot O55 + wl ; (5.3.9) 


5.3.2 Half-Odd Integral Values? 


We first formally show that according to the eigenvalue scheme for an- 
gular momentum developed in 85.1, the orbital angular momentum does not 
admit half-odd integral 7 values.' There is a long history associated with this 
problem beginning with some work by Pauli which, however, we will not go 
into it here. 

With m = j, k =1, (5.1.27) gives from (5.3.9), 


; 0 0 
0= Ls Olja) =he fica +5] @oliad (6.8.10) 


' We will eventually use the standard notation @ for the quantum number asso- 
ciated with the orbital angular momentum once it is seen below that half-odd 
integer values are not acceptable for the latter. 
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or 


fe) 0 
cot a5 + = (0,¢|9,3) =0 


whose solution is in 
(9, 6|5,9) = Aje’?? (sin 8)’ 


where 


qe fh TOTS) 
JV Qn DG + 1)P(1/2) 


and we have used the integral 


Tg +1) 


: sin 9)794+1 = \/qp = 
[a9 omar = ve ay 


for normalizing (0, ¢|j, 7). 
From (5.3.9), (5.3.12), we explicitly have 


L_ (0,0 ,) = hAje"I-29(sin 8)!-"(—2),j[e0s 6] 
(L_)? (8, 6] 5,5) = Wh Aje-* (sin 8)? (—2)? 
ava 2 As osc 
x j |(g — 1) cos* 8 - 5 sin 6). 
In particular, for 7 = 1/2, (5.3.16) gives 


(LJ? (6,6|1/2,1/2) =e /%(sing)-¥? 


(5.3.11) 


(5.3.12) 


(5.3.13) 


(5.3.14) 


(5.3.15) 


(5.3.16) 


(5.3.17) 


while (5.1.26), with 7 = m = 1/2, k = 2, states that (5.3.17) should be zero 
and hence, in particular, D_ (6,¢|j,7) should be in the domain of definition 
of L_. We will see later that such a contradiction does not arise for integral 
values of j for the orbital angular momentum. [The expression in (5.3.17) 
is not only different from zero but is also not even square-integrable over 0, 
ie., L_ (0,¢|1/2,1/2) is taken out of the space of square-integrable functions 


over 0, ¢ by the action of L_.| 


To proceed with other half-odd integral j values we consider the following. 
By repeated applications of L_ to (0,¢|j,7), as in (5.3.15), (5.3.16), we have 


the following elementary limits 


L_ (8,015.5) a> RAs" V8 (—2) 507" 


(L_)” (0, 63,3) yaa RAY OS Vij G=pe 
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(L_)” (9, 15,5) sa (YY Age? (-2)"7 FG - I) 5+) (AO. 


That is, explicitly, 
(RY) A, eH? (—2)7 (—AV292(7 +4 1a; 7 = 1/2,3/2,.-. 
0, j=0,1,... 
(5.3.18) 


where we have used the fact that —j(—j+1)...(j —1)j necessarily includes 
0 as a factor for integral values for j. 

More generally, as an induction hypothesis, suppose, by invoking (5.3.15), 
(5.3.16), that for some k > 1 


(L_)* (8,615, 5) = (h)* Aye" (sin 6)?" Py (cos , j) (5.3.19) 
where Px(cos 6,7) is some polynomial of degree k in cos @ whose coefficients 


depend on j. 
By using the elementary property 


0 
P;,(cos 0,9 5.3.20 
sag Pecos 4.3) (5.3.20) 
we obtain directly by the application of L_ to (5.3.19), that 
(L_)**! (6,6|3,7) has the same structure as in (5.3.19) with k replaced by 
(k + 1) on its right-hand side. 
Equation (5.3.18) gives for @ — 0: 


 Py(cos 6,7) = —sind 


Po j(cos@ = 1,7) = (—2)74(-1)9 79245 +. 1)! ; jf = 1/2,3/2,.... 
(5.3.21) 

That is, for such 7 values, cos @ = 1 is not a root of P2;(cos 6,7). Now since 
P2;(cos 6,7) is a polynomial in cos 6, and hence is continuous, the polynomial 
cannot abruptly vanish as we move away from cos@ = 1 to cos@ < 1 from 
the non-vanishing value in (5.3.21) for cos@ = 1. Hence there exists a non- 
empty interval 0 < @ < a;, where a; depends on j and is strictly positive, 
such that P:;(cos 6,7) € 0 for all 6 in this interval. The latter in turn means 
that there exists a strictly positive constant C’; which depends on j such that 
| P2; (cos 6, 7)| S C; > 0, for j = 1/2,3/2, sey for 0 < 0< aj. 

Finally, from (5.1.26), (5.3.19), with k = 27, m = j, we may formally 
write 


(9,6|9,-5) = eae (sin 0) ~J Px; (cos 6, j) (5.3.22) 


and for half-odd integral 7, one has the following lower bound 
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20 wT aj 
[a6 [a9 sin6)(0,613, a)? > 2m [a0 sino (6,413.1)? 
0 0 0 


T Ail Ce 40 (sin )!-2! 5.3.23 
((29)!)" aff ene 


where a; is strictly positive, and the integral on the extreme right-hand side 
of (5.3.23), clearly, does not exist (due to the divergence at the origin) for 
j = 3/2,5/2,... which is in contradiction with the normalizability of the 
eigenstates in the eigenvalue problem. 

For the non-admittance of 7 = 1/2 for orbital angular momentum, see 
(5.3.17), (5.3.42) and the discussions immediately below these equations. 

With half-odd integral values of j not admitted for the orbital angular 
momentum, we will denote the latter parameter by @, where now 


£=0,1,2,... (5.3.24) 
m= Sb Hb ets 0, lee bt (5.3.25) 

We also use the notation 
(0,¢|£,m) = Yem(4, ¢) (5.3.26) 


defining the so-called spherical harmonics treated next. 


5.3.3 The Spherical Harmonics 


With an adopted sign convention, (5.3.12) gives the properly normalized 
expressions 


Yee(9,¢) = (-1)° 


(20+ 1) ieg Sa 


4n(20)!~ — | 2201 (sin a (5.3.27) 


where we have written the normalization constant A; in (5.3.13) as 


T (+ 3/2) (26 +1) (20)! 
(aa (€+1)T (0/2) © |? (20)! 2201” (5.3.28) 


Equation (5.1.26) leads to the following recursive relation 


1 1 
— i GE CLES) 


Yer—1 (9, 0) Yer- (5.3.29) 


As an induction hypothesis, suppose that for all m = @,@—1,...,k, for 
some k, 
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Yo (8,0) = (—1) orth? ae gas as vl) P!™! (cos@) (5.3.30) 


4n =(€4+|ml) 
where Cane : 
m £+|m| 
Mian We) d 2 ae 
PI”! (2) = ~ (a? —1)'. (5.3.31) 


For m = &, we check that Yp¢(0,¢) in (5.3.27) satisfies (5.3.30) by noting 
that 


ea (a? —1)° = (20). (5.3.32) 


We will show that (5.3.30) also holds true with k replaced by k — 1 as well: 


1. Suppose k > 0. Then (5.3.29) leads to 


Po 9041) (€—k+1)! (sing)* 4 
Yo r—1 (8,6) = (—1)" ttn ra ae Ty (cos 0) 


(5.3.33) 


where 


1 
(+k) (@—k+1) 


Th (x) = 


d l+k 
2Qka (=) _ (1 _ az”) 


x (t) "| Gai, (5.3.34) 


To simplify the expression for T; (x), we use the following identities: 


CB) )=-(@)" seer n(@y one 


from which 


(A) (A) pk -erea]. a0 


os [ oy 7” =—(¢+k-1)(€+k-2) io 


—2(€+k-l1)a (2) + (1-27) (ay (5.3.37) 


which in conjunction with the identity (5.3.36) gives 
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d ) é+k+1 


0-2) (5 


= (A) t 2) 2+ kad CHR (e+h | 
(5.3.38) 


The two identities (5.3.36), (5.3.38) immediately lead for T), (x), defined 
in (5.3.34), the simple expression 


l+k—-1 
Tp (x) = (=) (2? —1)' (5.3.39) 


which from (5.3.33) establishes the correctness of (5.3.30) for m = k—1 as 
well. Here note that (—1)*~' = (Gaye anes 


2. On the other hand suppose k < 0. Then (5.3.29) immediately gives from 
(5.3.30) 


You-1 (0,0) = gine [Ot 1) (€+k-1)! 


dn (€—k+1)! 


2) (1-k)/2 


1—# d l—-k+1 
ee va (=) (22-1) (5.3.40) 


which again verifies the expression (5.3.30) for m = k — 1 as well. Here we 
note that (ase =. 

This completes the derivation of the expression for the spherical harmonics 
Yom (0,0) as given in (5.3.30). The polynomials pin (x) are the so-called 
associated Legendre polynomials, and for m = 0, P? (x) = Pe (2) are referred 


to as the Legendre polynomials. 
For m = —£, we have from (5.3.30) 


o-ito, | 24+ 1)! (sin 9)! 


Ye-e (0, ?) A dn 2£0I 


(5.3.41) 


and is easily checked that unlike for the half-odd integral values (see (5.3.17)) 
L_Y¢_«(0,¢) =0 (5.3.42) 


as it should be. That is, the algebraic relations (5.1.26), (5.1.27), resulting 
from the eigenvalue problem for angular momentum (§5.1), are internally 
consistent for orbital angular momentum only if £ has integral values given 
in (5.3.24). [In the earlier literature, single valuedness, for example, was, a 
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priori, often imposed under a rotation over ¢ by 27 to dismiss with half-odd 

integral values! The literature on this subject is too vast and many different 

approaches have been taken on this problem to dwell upon them here.] 
Under space reflection (see Figure 2.4), 6 zt —O0,¢ o+ 7, hence 


PI! (cos) > (—1)!*"™! Pl”! (cos @) in (5.3.31), (—1)*'™"! = 1, and 


Ye m (9,9) > (-1)' Ye m (0,4). (5.3.43) 
Complex conjugation, on the other hand, gives 
Yen (8,6) = (—1)" Yo,-m (0,4) - (5.3.44) 


The spherical harmonics for some special values of @ and m are recorded 
here for convenience: 


Yoo (0, ¢) = 1/W4a (5.3.45) 
ts 3 +id .: 3 
Y1,41 (6,6) = F4/ —e*"® sin 9, Yi,0 (0, 6) = 4/ — cos0 (5.3.46) 
81 ‘ An 
and 
19: Jebig i 3 
Yo,42 (9,6) = er“? sin” 6 (5.3.47) 
: 327 
sf De eae § 
Yo,41 (8,9) = e+? sin 20 (5.3.48) 
: 327 
Yo.0 (0, ¢) = a (3cos”@—1). (5.3.49) 
Bron, 167 
For 0 =0,¢ =0, 
2 1 
An 


The orthonormality condition (5.1.21) reads 
Qn Tw 
i Of deed Ve Oey Oyo is. (5.3.51) 
0 0 


The Cartesian components (x!,x?, a?) (= (#,y,z)) may be written in 
terms of spherical harmonics as 


ei => (= Tr [Yi-1 (6, ¢) = Yi (6, )| (5.3.52) 
Page” iV Oat OO (5.3.53) 


3 
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cS ie rYi0 (9, ¢) (5.3.54) 


with all corresponding to = 1, emphasizing the vector character of x. 
Selection rules for the matrix elements (¢’,m’|a"|£,m) are readily ob- 
tained from the following considerations. 
The elementary integral 


Pa elm") 6 = ond, ant (5.3.55) 

shows that 
(@,m'|a*|ém)=0 for m ¢Am+1 (5.3.56) 
(f,m'|x?|@,m)=0 for m #¢m+1 (5.3.57) 
{¢,m!|e%|tm)=0 for mi xm. (5.3.58) 


On the other hand, the addition rule of angular momenta studied in §5.5 (see 
also (5.8.35), (5.8.36)) with x2’ corresponding to ¢ = 1 (see (5.3.52)—(5.3.54) 
shows that 

(,m' |z'|£,m) =0 (5.3.59) 


if the value of ¢’ is not in the set {]—1],...,€+ 1}. 
The spherical harmonics in (5.3.26) may be readily related to the matrix 
) 


m’ 


elements of the D® matrix, with matrix elements pe 
follows. 
To the above end, let 


given in (5.2.12), as 


n = (sin d, — cos ¢, 0) (5.3.60) 
No = (0,0,1). (5.3.61) 


Then according to (2.3.5) 


; t 
(x [bs (0m 5) | tm) = (N|é,m) (5.3.62) 
where (§2.1) in an obvious notation 
N = R(0,n) No 
= (sin 0 cos ¢, sin @ sin ¢, cos 6) , (5.3.63) 
and (5.3.62) may be rewritten as 
(9 = 0,¢ =0lexp (— jon 5) tm) = ¥; m (0,0). (5.3.64) 
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The left-hand side of (5.3.64) may be expanded as follows 


exp (-7 én- 5) lm) 
p =. én- J 
h , 


where we have used (5.3.50), to give 


i 
“ee exp (-;,0n-3) tm) 
— f2l+1 Deg 58 To 23 ao 
= ae (40 exp ( 5P! ) exp (504 ) exp (;7 ) L m) 
2f+1 20+1 
Tn 2+ Di? (6,4,0) = qq Pom (9; 9,7) (5.3.66) 


where in writing the second equality we have used the identity (5.2.11), and 
in the last equality we have used the definition (5.2.12). We note that for 
m = 0 in (¢,0|, the angle y does not contribute in (5.3.66), i.e., the angle 7 
may be taken to have any value. 


e 
S> (6 =0,¢6=0|£,m') (en 


m/=—£ 


2+ 1 
= («0 ex 


ig m) (5.3.65) 


Vee (40 


5.3.4 Addition Theorem of Spherical Harmonics 


Finally, we establish the following theorem for the addition of the product 
of two spherical harmonics: 


An 


ESI ba Yim (91; $1) Ye m (82,62) = Pe (cos @) (5.3.67) 


where (see Figure 5.2) 


cos 6 = N,-Na, (5.3.68) 
N; = (sin 0; cos ¢;, sin 6; sin gj,cos6;), i= 1,2 (5.3.69) 

and in detail 
cos 6 = cos 6; cos 62 + sin 6; sin 62 cos (¢1 — 2). (5.3.70) 


We note that due to the reality of P,(cos@), the complex conjugation sign * 
may be put on either Yen (01,61) or Yom (02, 62) in (5.3.67). 
To establish (5.3.67), we note that according to (2.3.5) 
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Fig. 5.2. Figure showing the angle 0 as the angle between the two unit vectors 


Ni, No and hence Ni - N2 = cos. 


(x, lexp (77) exp (fos) r0) =(N 


where (§2.1) -_ 
N = R(61,%7) R(¢1,X°) No (5.3.72) 


£,0) (5.3.71) 


2 


where X2, x3 are unit vectors along x” and x? axes, respectively. In particular, 


WN’ works out to be equal to cos 6. The corresponding ¢-angle does not con- 
tribute to (N|£,0) since m = 0 in |¢,0) (see (5.3.30)). Accordingly, (5.3.71) 


may be rewritten as 


; : } 
(1 2 lexp ( 7°) exp (; 7°) 


10) = Y¢ 0 (8,0) 
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2l+1 
=1/ = P¢ (cos 6). 


(5.3.73) 


The extreme left-hand side of this equation may be expanded in the form 


£ . 2 + 
S- (02, b2|£,m) (1m lex» (47) exp (;0.7°)| 60) 


m=—£ 
* 


£ 
= > Yim (82,42) [DE (61, 61,0)] (5.3.74) 


m=—e 


which from (5.3.66), (5.3.74) gives the identity in (5.3.67). 
Additional properties of the spherical harmonics are given in 85.8. 

5.4 Spin 

5.4.1 General Structure 


Under a coordinate rotation, spin is described by the transformation prop- 
erty (5.3.2) of a wavefunction 


w') = (exp (Fon-8) &) Oo) (5.4.1) 


where x’ = Rx. A wavefunction w (x) may be written as a (2s+1) component 
object (see (2.8.23), (2.8.25)) 


W (x, 8) 
w (x,s—1) s 
w(x) = = So X*(m) v(x, m) (5.4.2) 
w(x, -s+1) ess 
yp x, —S) 
where 
1 0 0 
0 1 0 
X*(s)= | 9] ,x*(s—1) =] 9],...,X°(-s) =]: (5.4.3) 
: 0 
0 0 i 


with the normalization condition (2.8.25) for the wave functions ~ (x,m) 
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> [ex lw (x, m)|? =1 (5.4.4) 


m=—s 


satisfied. 
In detail, (5.4.1) may be rewritten in terms of components as 


exp & : s) 


1 
exp (Gon . s) s m) are given in (2.8.51). 


We may use (5.2.10), (5.2.11), (2.8.41) to find the eigenstates of the pro- 
jection of the spin N -S along an arbitrary unit vector 


Ss 


Gi SH 


m=—s 


8, m) w (x, m) (5.4.5) 


where the matrix elements (s m 


N = (sin G cosa,sin @ sina, cos 3) (5.4.6) 
making, in particular an angle @ with the z-axis. We consider specific ele- 
mentary cases first which are easily and directly worked out before dealing 
with arbitrary spins. 

5.4.2 Spin 1/2 
For spin 1/2, S = ho/2, 


cos? singe 


N-S=5 ane ite: ) (5.4.7) 


and the eigenstates |A, N) of N -S satisfying 


N-S|\,N) =AA|A,N), A=41/2 (5.4.8) 


are easily worked out to be 


cos B ger 
\1/2,N) = (5.4.9) 
sin 3 eia/2 
—sin B e ia/2 
|-1/2,N) = (5.4.10) 
cos 5 eia/2 


For example, the amplitude for a spin measurement along a Z-axis, making 
an angle @ with the z-axis with a = 0, to be along the +7 direction if it is 
initially prepared to be in state |—1/2, z) is given from (5.4.9), (5.4.10) to be 


(+1/2,2|—1/2, z) = (cos § sin 5) (7) = sin - (5.4.11) 
In reference to Figure 1.5, we also have 
(41/2, 2|+1/2,2) (41/2,2|-1/2, 2) <sin 8 i as (5.4.12) 


We recall (§2.7) that spin S and the momentum operator P commute, 
and if p denotes the momentum of a particle, then the spin S along p/p is 
called the helicity of the particle with eigenvalues iA, \ = £1/2 in (5.4.8). 

We note the following elementary properties: 


h2 


‘Te S=0,. “Wr S's") = re (5.4.13) 
and 
h2 
(N-S)? = pl (5.4.14) 


The spin density operator p associated with a mixture (§1.5) of spin com- 
ponents in reference to the vector N may be written as 


p= S > w(d))A,N)(,N| (5.4.15) 
A=+1/2 
with 
Ss" w()=1 (5.4.16) 
A=+1/2 


and is easily work out from (5.4.9), (5.4.10) to be given by 
1 
p= 5 b+ (w (+1/2) — w(—1/2))N- o] (5.4.17) 
which has the general expected structure 
1 
pP=5 [l1+b-o] (5.4.18) 
satisfying the normalization condition 


Tr [p] =1. (5.4.19) 


The expectation value of the spin S (§1.5) is then given by 


Tr [pS] = & (w(41/2) w(—1/2))N (5.4.20) 


where we have used (5.4.13). 
The vector b 
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b = (w(+1/2) — w(—1/2))N (5.4.21) 


is often referred to as the spin-polarization vector and satisfies 
0< |b] <1. (5.4.22) 


For example, for w(+1/2) = 0, w(—1/2) = 1, that is, b = —N, one is 
dealing with a beam with spin polarized completely in the —N direction. For 
w (+1/2) = w (—1/2) = 1/2, that is, b = 0, one has a completely unpolarized 
beam. 

Detailed treatments, with various applications, of density operators for 
spin 1/2 will be given in 88.6. 


5.4.3 Spin 1 


For spin 1, we have from (2.8.62)—(2.8.64), 


cos 3 sin Bia 0 
sinf . snp _. 
N-S=h e'™ 0 e 1° 5.4.23 
9p 3 ( ) 
0 ae B oa —cos 3 
V2 
and the eigenstates |A, N) are readily worked out to be 
1+ cos B ia 
2 
sin 3 
+1,N) = 5.4.24 
| ) DB ( ) 
1—cos@ ,, 
5 @ 
sin? _,, 
= e 
V2 
|0,N) = cos 3 (5.4.25) 
sinf ., 
—~e 


v2 
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1 — cos B ia 
2 


sin 3 


-1,N)= 5.4.26 
| ) 7B ( ) 
1l+cos@ ,, 
———e 
2 
We note the following relations 
1+cos?@ sin cos sta sin? B 21a 
2 J2 2 
sinfcosZ . . sinfcosB _. 
N-S)* =f? el sin? —— Fo eio | (5.4.27 
(N-S) fa p 5 ( ) 
sin? B pia sin 3 cos B ic 1+ cos? 8 
2 J2 2 
N-S\* N-S 
= 42 
( : ) : (5.4.28) 
Srcr 
Teles: T| a =20" (5.4.29) 
SPsts™ . kom 
Tr | = ick? J (5.4.30) 


The spin density operator associated with a mixture of spin components 
in reference to the vector N may be written as 


p= > w(A)|A,N)(A,N| (5.4.31) 
A=0,+1 
with 
w(A)=1 (5.4.32) 
A=0,+1 


and is readily worked out from (5.4.24)—(5.4.27), (5.4.23) to be given by 


(w(+1)-—w(-1))N-S 
2 A 


(w (+1) + w (—1) — 2w (0)) (%; ah 


p=w(0)1+ 


fp (5.4.33) 


2 
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verifying, in particular, the normalization condition 
Tr [p] = 3w (0) + (w (+1) + w (-1) — 2w (0)) = 1 (5.4.34) 


where we have used (5.4.29), (5.4.32). 
Upon using (5.4.29), (5.4.30), we have for the expectation value of S 


Tr [pS] = A(w (41) —w(-1))N (5.4.35) 
(See also Problem 5.7). 


5.4.4 Arbitrary Spins 


From (2.8.42), (5.4.6), the matrix elements of the (25+ 1) x (28+ 1) ma- 
trix N-S are given by 


h : 
(s,m|N - S|s,m’) = 5 sin Bel /(s — m) (s + m+ 1)bmnm41 


+ * sin gen Us +m) (8s —m+1)dmm—1 


+ hcos 8 Mom m (5.4.36) 


and from (2.8.41), (2.8.51), (5.2.11), (5.2.12), the eigenstates |s; A, N) satis- 
fying 
N-S|s;A,N) = fA|s;A,N), (5.4.37) 


A=-s,-s4+1,...,5—-1,5 (5.4.38) 
are explicitly given by 
DXY (a, 8,0) 


Des (a, B, 0) 
|s;A,N) = (5.4.39) 


D&” (a, 8,0) 


where pe (a, 8,0), in (5.2.12), is given by the expression on the right-hand 
side of (5.2.10) with 7’ = j =s,m=A,m’=N,7=0. 

The spin density operator p associated with a mixture of spin components 
in reference to the vector N may be then written as 


p= oD w (A) |s;A, N) (5;A, N]- (5.4.40) 
A=-—s 


For N = p/p, where p is the momentum of a particle, N -S is referred to 


as the helicity, having eigenvalues hA, AX = —s,...,8. 
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5.5 Addition of Angular Momenta 


In this section we deal with the angular momentum eigenvalue problem 
resulting from the addition of two commuting angular momenta. That is, we 
consider the angular momentum operator 


where J1,J2 are assumed to commute. A typical example, is the addition of 
the orbital angular momentum to the spin of the electron in the hydrogen 
atom discussed later in §7.4, §7.5. 

For the (J;,J2) system, J7, J}, J3, J3 system are commuting operators, 
and for a given pair (j1,j2), we may generate a (27; + 1) (27; +1) dimen- 
sional vector space V(ji,j2) spanned by the orthonormal set consisting of 
the vectors 


131,171) |j2, M2) = |91,1M13 Jo, M2) = |mM1, M2) (5.5.2) 


where my = Sis hi + 1, che wil _ 1,913 m2 = —Jje, —Jja + 1, fess »J2 = 1, jo. 
For the combined system, J”, J°, J - J ; commute and we denote their 
simultaneous eigenstates by 


19,75 J1,J2) = |j,m) - (5.5:3) 


For a given fixed pair (j1,j2) we may expand the eigenstates |j,m) in 
(5.5.3) in the terms of the basis {|m,mz2)} in (5.5.2): 


i,m) = S> |mi,me) (mi,ma|j,m). (5.5.4) 


m1,m2 


The expansion coefficients (m,m2|j,m) are referred to as Clebsch-Gordan 
coefficients. Unfortunately, there are many different notations used for these 
coefficients in the literature. 

The application of J? = J? + J§ to (5.5.4) yields 


hm |j,™m) _ y, h|my + mz) |m1, m2) (m1, mz | Jj,m) 


m1,m2 
implying that the sum in (5.5.4) over mj, mg is restricted to 


Given a fixed pair (j1,j2), one may readily find the allowed j values 
corresponding to J? for the combined angular momentum. 

To the above end, we note that m takes its maximum (minimum) value 
j (—j) when m1, m2 take on their maximum (minimum) values j1, jo 
(—j1, —Jo). For m = +(j1 + je) one necessarily has 7 = 7; + jo and only 
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one vector |+71,+j2) contributes to the sum in (5.5.4) for each sign. Accord- 
ingly, we may identify 


[91 + Jo, + (91 + Jo)) = |, £32) (5.5.6) 
and set 
(491, $92] 91 + Jo, + (01 + jo)) = 1 (5.5.7) 
for the corresponding Clebsch-Gordan coefficients. 
Since my = —J1,---,J13 M2 = —Jje,---,J2, the next possible value to 
m = ji + je is 
m= jit jo—1. (5.5.8) 


In this case we have 
M2=j2, Mm=ji—l 


or 
m2 = Ja er 1, My, = ji. (5.5.9) 


We then have exactly two vectors, |j1 — 1,2), |j1, j2 — 1) contributing to the 
sum in (5.5.4) for the given m value in (5.5.8). 
More generally, we may write for the possible values of m 


m=ji+tjo—-k=m+m (5.5.10) 


where 


k =0,1,2,...,2(j1 + je) (5.5.11) 


and for k = 2 (1 + je), m= — (1 + je). 

For the subsequent analysis, suppose, without loss of any generality, that 
ji > jg. The reversed situation is similarly handled. 

To satisfy (5.5.10), we may find the smallest possible non-negative integers 
a and b such that 


M2g=—jota, mM, =2jo+j,-k-a 


or 
(5.5.12) 


or 
m2 = j2—b, m=jri—k+b 
Since, in particular, —j; < my, < je, these possible solutions imply that 
fi < QMBethp—-k-acn (5.5.13) 
fre fie kes ji (5.5.14) 


or that 
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2j2-—k fa <2(ji+je)—k (5.5.15) 
k-2j, <b0<k. (5.5.16) 


Clearly the number N (k) of |mi,mz2) states, contributing to the sum in 
(5.5.4), giving rise to the same m value given in (5.5.10) is, using in the 
process (5.5.12), given by 


N (k) = m (max) — m,(min) + 1 
= m2(max) — m2(min) + 1. (5.5.17) 


Depending on the value of k, i.e., whether 0 < k < 272 or 272 <k < 291 or 
231 <k, the values of a, b and N(k) are worked out in Table 5.1 for a given 
k, all corresponding to the same value m = 3; + jo — k. 


Table 5.1. For m = mi + m2 —k, j1 > jo, depending on the value of 
k = 0,1,...,2(j1 + jz), the possible values of the pair (mi, mz) are given 
in (5.5.12) with a and b as given in this table. N(k) denotes number of 
states |m1,mz2) that contribute to the sum in (5.5.4) leading all to the 
same m = j1 + jo —k value for a given k. 


a b N(k) 
O<k < 2%Hjo jo —k 0" k+1 
Qjo ck < hr 0 0 2jot+1 
2j1 <k < (jr + je) 0 k-2j, | (ji t+je)—k+1 


* This result follows because we have taken 71 > jo. 


Finally, we refer to Table 5.2, to infer, by using in the process m = 
—j,-j +1,...,9 — 1,9, that for fixed 71 > ja, the possible values of 7 are 
J=hi—-—Ja,di—Jo4+1,..-,91 + Jo. For 71 < Jo we may simply interchange 
the indices 1, 2. That is, for a fixed pair of values (j1, j2), the possible values 
of 7 are 

g= jr Fal 591 dal eget a (5.5.18) 


The above constraint |j1 — jo| < J < ji +Je is referred to as the triangular 
condition. 
For a given pair (j1, j2) the number of independent vectors |7,m) in (5.5.3) 
is 
(j1+J2) 
S> (29 +1) = (21 +1) (232 + 1) (5.5.19) 


j=|g1—J2| 


5 Angular Momentum Gymnastics 


278 


I Zz ee elz, T+ 2g || T+ 22 lz, ie Zz T (4). N 
el (26 — T)]-- | (@e6 — W)— 
T+ 2-1 T+ (#6 — Te) ] (eo — TY) | (el — te)—|t — (ef — Fe) 
T—-+0 T— C64 TCE + (26 — TL) (26 — 1) (ee — te)—|t — (26 — Te)—] | + (26 + FC) 
CO + TE lel + WT — el + Tel | + (ee — 10) (20 — Te) |e | (ee — Tey— |p — (26 — We) |p + (26 + Te) | (20 + 1e)— 
| | | | | 
C 
’ 0 | T | (#2) | J a Wz (T+ 16z) [oc pr— (E+ We] (46+ W)z] oy 


‘sdoys Josoqyul Aq ws aspai2ut am sp yNo poinSy Ayiseo ore pure @l < TW sired 
refnoryred LOF ayNqi1yWOS YOU ATSNOTAQO [[LM SMOI OY} PU SUTINIOD oY} JO aUIOS ‘A[BUIPIOOW “JYSII 04 Yo] WIOIZ OF aM Se oSeoIOUI SoNyeA 
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yey} uoTyRor oy} Suisn Aq ‘el < TW ared poxy ev uaats ‘C Jo sonjea afqissod oy} JoyuI ATIpeor ABUL OUO ‘aTGey oY} UI AT[eJUOZTIOY ssor19e 
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since m takes (27 + 1) values. 
We may invert the expansion in (5.5.4), and expand |m,, mg) in (5.5.2) 
in terms of the kets |j,m), 


jitje 
|m1, m2) = Se J, m) (j,m|m1, m2) (5.5.20) 
J=ld1 Jal 


where m= m1 + ™o2. 
From the orthonormality relations 


(i,m! lJ, m) rs 6575Om! mm (5.5.21) 
(m{,m| m1, M2) = Omit mi Smime (5.5.22) 


for a given pair (j1,j2) the following completeness relations follow from 
(5.5.20) and (5.5.4), 


JitJa 
S- (m4, m|7,m) (j,m|mM1,mM2) = Sm! m,5msmz (5.5.23) 
J=l1—Jal 


with m= m,+ m2 =m), +m, 


S> (i,m!| mi, m2) (m1, ma j,m) = 55-75 m'm (5.5.24) 
m1,™m2 
where m = my + m2 =m, +m), m= —ji,—ji t+1,.--591 — 1,91, m2 = 


—jo,—ja+1,.--,J2 —1, 72. Equation (5.5.24) is valid for 7 taking any of the 
values in (5.5.18). 
Upon applying 


Jg = Sin + Jo (5.5.25) 
to (5.5.4) and using (5.1.25), we obtain 


hy (j $m) (j+m-+1)|j,m +1) 


= SO h(i Fm) Gr £m $1) [rma + 1, ma) (m1, M2|j,m) 


m1,m2 


+ So h/Ge = ma) (jo m2 + 1) |m1, m2 +1) (m1, m2|5,m) 
my1,m2 


(5.5.26) 


which upon multiplying by (m‘4,m5| and using (5.5.22) gives the following 
useful relationship relating Clebsch-Gordan coefficients: 


JG +m) Gtm+t+1) (m1,™M2|j,m + 1) 
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= V(j1 m1) (1 F m1 +: 1) (m1 F 1, m2 j,m) 


+ V/(j2 = m2) (jo F m2 +1) (mi, m2 F 1] j,m). (5.5.27) 


The following relation for the Clebsch-Gordan coefficients as following 
from (5.5.4) or (5.5.20) should be also noted 


(m4, ms lj, m’) = Om! ym1-+ms (mi, m2 BE my ale mz) : (5.5.28) 


Of particular interest is the addition of an arbitrary non-zero angular 
momentum j; > 1/2 to a spin jg = 1/2 which is easily handled. In this case 
For 7 = ji + 1/2, jo = 1/2, mp = +1/2, and taking the lower sign in 
(5.5.27) gives a recurrence relation leading to the following chain of equalities: 


(mm —1/2,1/2| jm) =f (mm + 1/2,1/215,m-+1) 
] 1 
(m+1/2,1/2|j,m+1) = ee (m +3/2,1/2|j,m +2) 


- a a - 
(j — 3/2,1/2|9, —1) = a (7 —1/2,1/2|9,9) 
or 


1tm+1/2 


(m —1/2,1/2|41 +1/2,m) = ,/22! 1/2 je Udy sy 41/2) 
27, +1 


(5.5.29) 
and from (5.5.7), with the + sign, to 


— fjrtm+1/2 
(mm — 1/2,1/2 fn +1/2,m) = fT (5.5.30) 


Similarly for 7 = 7; + 1/2, mz = —1/2, and now taking the upper sign in 
(5.5.27) leads to 


(m+ 1/2, -1/2|9, + 1/2,m) 


= (ee fi, —1/2| fr + 1/2,— (91 +1/2)) (5.5.31) 
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and from (5.5.7), with the — sign, leads to 


; ji-—m+1/2 
1/2,—1/2 1/2 = 4/——_—___.. 5.32 
(m+ 1/2,-1/2|fa + 1/2,m) = fe (5.5.82) 


Upon using (5.5.23) with mi, = m1, ms = m2 = —1/2, we have from 
(5.5.32) 
h — 1/2 
Ja M412 tn 4.1/2, -1/2|j1 —1/2,m)[? <1 (5.5.33) 
27, +1 


and with a definite choice of phase to 


; jr tm+1/2 
1/2,-1/2|9, —1/2 = ,/ ———___.. 0.34 
(m+ 1/2,-1/2 | ~ 1/2,m) = fo (5.5.34) 


With this choice of phase, the phase of (m—1/2,1/2|j, —1/2,m), is 
uniquely determined from (5.5.24) and one obtains 


i —m+1/2 
n= doe Sy) Oe (5.5.35) 
271 +1 


These Clebsch-Gordan coefficients are tabulated in Table 5.3. 


Table 5.3. The expression for the Clebsch-Gordan coefficients 
(j1,m13;1/2,m2|j,m) for the addition of an arbitrary 7; > 1/2 angular 
momentum to a j2 = 1/2 one. 


ji+1/2 noe 


1/2 (22 jeaeale 


2h +1 2141 
1/2 jane 7 peasare 
21 +1 2141 


For the addition of two spin 1/2’s, the above Table gives 


[1, +1) = |1/2,1/2) (5.5.36) 
1 
|1,0) = op (l1/2,-1/2) + L=1/2199\) (5.5.37) 


t= (1199) (5.5.38) 
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a triplet associated with spin 1, and a singlet 


1 
V2 


associated with spin 0. The triplet should be compared with the correspond- 
ing expressions in (2.8.59)—(2.8.61). 

As another illustration, we consider the problem of constructing simul- 
taneous eigenstates of the commuting operators (§2.3, §2.7) J, J°, L?, S? 
which we denote by |7, m, ¢, s). In particular, for s = 1/2 we may use Table 5.3 
directly to construct such eigenstates. These will be important in our study 
of the hydrogen atom when we include spin. To this end, using the notation 
\j,m, ,1/2) = |j,m, @), and setting 7; = £, mz = m’ in the Table, we have 
from (for £ 4 0) 


|0,0) = (\1/2, -1/2) — |-1/2,1/2)) (5.5.39) 


|j,m, &) = > [€, me; 1/2,m’) (€,me;1/2,m'|j,m) (5.5.40) 


metm’ =m 


that 


- 1/2 
smn, f) = — [em —1/251/2,1/2) yf EMP 9 V0, 4.1/251/2, -1/2) 
/£+m-+1/2 
x > at (5.5.41) 


for 7 = €—1/2, and 


bj, £) = [fm — 1/251/2,1/2) f PAM? 4 V6 mn 4 1/251/2,-1/2) 
2+ 1 
£-m+1/2 
—___—. 5.42 
“VO 2041 (O42) 


for 7 = 0+1/2. 

Finally, we derive a useful orthogonality relation of the pe, (a, 3, ) 
functions given in (5.2.12) by combining two independent, i.e., commuting, 
angular momenta Ji, J2 as defined in (5.5.1). 

For the above purpose, we note that 


: . i . ; 
(jasimjava exp (jen-3) fas javm) 


5 i * na ‘ i ‘ 7 
= ( ji,m1 |exp zen di ji, My, ) ( J2,M2 exp Zen: Ie j2,M 


= pi) (a, 8,7) pF) (a, 3,7). (5.5.43) 


—~ mim} m2ms 
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Upon using the expansion (5.5.20), the above may be rewritten as 


S* (juss ja,me| jm) DY, (a, 8,7) (j,m’ | Jr, m3 j2,m) 


j,;m,m/ 


SDE ha De Cae 4). (5.5.44) 


We will consider those cases in which 41, jg are either both integers or are 
both half-odd integers. Then 7 necessarily takes on integer values. [For other 
cases see Problem 5.12]. 

Upon multiplying (5.5.44) by sin @ and integrating over 0 < a < 27,0 < 
B<1,0 << 27, we obtain 


[re a [as sing [ dy D2) 7 (8,4) DS* 5 (G89) 


=n? (jnstas jay ma|i.0) f dé sin 8 d') (8) (j,0|d1, m3 jo, m5) 
; 0 
J 

(5.5.45) 


where we have used (5.2.10) and (5.2.13). For j = 0, (5.2.10) gives d\!) (8) = 
1. For 7 £0, (5.2.10) leads explicitly to 


{Ors | : dé sin 3 d¥) (8) 
0 


J 3 
=e ae da (sina)? (cosn)t1-74 (5.5.46) 
q=0 (G _ q)! 


which upon using the integral 


aie 2a—1 yp \2b-1 1T (a) 0 (0) 
| dz (sin x) (cos x) = Sao (5.5.47) 


for non-negative integers a, b, gives 


j j) 2 
ID= 1-1)’ =0. 5.5.48 
cep OM Gag game a 
That is, 
7 dB sin BdY) (8) = 26,0. (5.5.49) 
0 


The reader will recognize (5.5.49) as a special case of the orthogonality con- 
dition of the Legendre polynomials P; (cos @) = piml (cos @) for m = 0 (see 
(5.3.30), (5.3.65), (5.3.51), (5.2.13)), with dY? (8) = P; (cos 8). 
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Hence the right-hand side of (5.5.45) becomes equal to 
81? (J1,™13 Jj2,ma |0, 0) (0, O| j1, 4; j2, mg) 


= BIO tgs O ma Ome hae (251 an iia eae a | 


= 8155, 5,51m1,—maSmi,-mi, (251 + 1)" (—1)™272 (-1)?G2+™) (55.50) 


as follows from the general expression of the Clebsch-Gordan coefficients that 
will be obtained later in (5.6.14). Finally, we use the complex conjugate prop- 


erty of the Dp , (a, 3,7) function in (5.2.14) to obtain from (5.5.45) 


[ee fe vf dg sin 8 ( (oe) 3 (2,8,7)) Dina ey) 


= BI Ose O aime aut (5.5.51) 
271 4+1 7 


since (Si) 7eer) = 1, where jj, j2 are either both (non-negative) integers 
or are both half-odd integers (see also Problem 5.12). 

Clearly, the same procedure as above may be used to obtain the integral 
of the product of more than two Dery functions. 
Another useful orthogonality relation is the following one 


27 T 
Fa Gi) p's) _ 40051 529m’! 
ao | dsp sin 3 ( (v% (a, B,— a)) D in (a, 8, —a) (Qi, +1) 
(5.5.52) 
where, again, 71, jg are both either integers or are both half-odd integers. 
The proof of (5.5.52) follows by noting that the integration over a, on 
its left-hand side, imposes the restriction that m’ = m”. Accordingly from 
(5.2.10), (5.2.12) we may rewrite the left-hand side of (5.5.52) as 


Homme fa y fa a [as sing (DY), (a,8.7))" DY, (a,8.9) 


(5.5.53) 
which from (5.5.51) leads to (5.5.52). 


5.6 Explicit Expression for the Clebsch-Gordan 
Coefficients 


To obtain the explicit expression for the Clebsch-Gordan coefficients for 
the addition of any two independent angular momenta we may proceed as 
follows. Taking the upper sign in the recurrence relation (5.5.27) with m = j, 
mg = j+1-—-— mz, its left-hand side is then zero and leads to 
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V (jr + :m1) Gr — m1 + 1) (m1 — 1, m2 5,9) 


= —/(jo + m2) (2 — Ma £1) (m1, m2 — 13, j). (5.6.1) 


By successive replacements, m, — my +1— m,+2,...,J1 in this equation, 
we get the following chain of equalities 


(mm, — my | jpg) = —f BAP de I) 
; (j1 +m + 1) (j1 — m1) 


x igh) tig 9) 


; af ekg mi = 1) Ga pe 2) 
m +1, —m =A ’ = 5 5 
(m1 + 1,7 — m1 — 113,39) i Gey 


x (my + 2,7 —m — 2| 9,9) 


(5.6.2) 
iatj-hn+)(e2-j+3 
1, j— At Ugg) = yf BAIRD a9 
21 
SG a a7) 
Upon taking the product of these equalities we obtain 
(mi, Jj —m1|J,9) 
a Ga 5+ il] Gib Ge Bi = Ta) pe i 
=! ui eee ay ue we ve Hil 53) 
Vi)! Y G2t+5- a)! Gi - mi)! G2 - § + m1)! 
(5.6.3) 


[It is easily checked that the arguments of all the factorials are non-negative 
integers.] To obtain the expression for (j1,7 — ji|j,7), we use the unitarity 
condition 
So Mma, §-— mi 5,9)? =1 (5.6.4) 
mitme=j 


as obtained from (5.5.24), and the sum 


(j1 +71)! (Jo + ma)! 
(1 — m1)! (je — ma)! 


my +m2=J 


_G+nt fat Me =A t Mh = fat 3)! 
: (249 +1! (1 +j2—39)! (5.6.5) 
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with —7; < m, < ji, —Jo < Me < jo, established below. With the phase 
convention, generally referred to as the Condon and Shortley convention, 
defined by 

(159 — F199) = |G1.5 - A193) (5.6.6) 
equations (5.6.3)—(5.6.6) immediately lead to 


(ma, j — ma| J, J) 


=m | (247+)! + j2— 9)! 
Ge tyst Oli got Geary)! 


: atm et (5.6.7) 


(j1 — m1)! (J2 — J +m)! 


To obtain the expression for the general coefficient (m1, m2|j,m), we note 
from (5.1.26) that 


(j+m)! 1 feat 
(23)! G — m)! (Ry™ (JF yd) (5.6.8) 


i,m) = 


where 
J1 
.9)= YS Img — ma) (rm, 5 — ma | 7,9) (5.6.9) 


mi=—j1 


and hence amounts to evaluating 


(g+m)! 1 
(2)!G—m)! (iy—™ 


(amy, g — m1 |(J-)" 5,5) = (ma, — ma | j,m). 


(5.6.10) 
The evaluation of the expression on the left-hand side of the above is 
straightforward. To this end we use the binomial expansion 


Ce aS C (Ca a ON aa (5.6.11) 


where 


}—m (j — m)! 
4 k ) = ers (5.6.12) 


since Jj_ and Jz_ commute, to obtain from (5.1.26), (5.6.9), (5.6.10) 


Goer 


(nn 3,3) 
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+m) +m 
= s (Ja , ' (Ja yolk \|m+m2—j +k) 


= —m 
wines (it ! (Je 2) 


x (m1, m2|J, J) 


(5.6.13) 


(j —m)! ie my +k)! (je —ma+j—m-—k)! 
kl(j —-m-—k)! (ji tm k)! (ja + mg j+m+k)! 


with —j1 < m, < ji, —Jo < M2 < Jo, and the sum over & is restricted 
to all k non-negative integers such that the arguments of the factorials are 
non-negative. 

From the expression for (m,,m2|j,j) = (m1,j7 —m1|Jj,7) in (5.6.3) to- 
gether with (5.6.10) and (5.1.21), we finally obtain, the so-called Racah ex- 
pression of the Clebsch-Gordan coefficients: 


(m1, m2|J,m) = (j1,™1; Jo, Ma Jj,m) 

= 341 

< (in + do ~ JEG — m)L(F +m)! (Ha — m1)! (a — m9)! 
G+ i 


jab I) (jr — ga + 2) jo = fit 2)! Gi +m)! Ga + mz)! 
127 —* Gy tm +B)! Ga + — m1 — b)! 


x » ki (j —m—k)! (91 — m1 — k)! (Jo — jG +m +k)! (5.6.14) 


where now m; + m2 = m, and k is over all non-negative integers such that 
the arguments of the factorials are non-negative. The reality condition of the 
Clebsch-Gordan coefficients as given in (5.6.14) is to be noted. 

It remains to establish the expression for the sum as given in (5.6.5). To 
this end, for any two strictly positive integers a and b, the formal expansion 


= Kir g—a—ki (q 1-1)! 
= DE i (y) mi ( acne (5.6.15) 


gives immediately, upon the comparison of the product of the expansions of 
—b (a+b) 


(x+y) “+(a+y) ° with the expansion of (x + y)~ ‘ 
(atk, —1)! (b+ k2—-1)! = (a+b+k-—1)! (5.6.16) 
arses (a—1)! (b—1)! ky! ke! (a+b—1)! k! 
with k,,k2 non-negative integers. Upon setting 
ky = ji —m1, ko = jo — Me (5.6.17) 


@-l=je-fitj, b-l=fi-jet+j (5.6.18) 
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and hence with m; + m2 = J, 
k=hk+hoe=jitje-j (5.6.19) 


(5.6.16) leads to (5.6.5). 

In §5.5, we have added a spin 1/2 to an arbitrary non-zero spin with the 
Clebsch-Gordan coefficients given in Table 5.3. As an application of the gen- 
eral formula (5.6.14), one may consider the addition of a spin 1 to an arbitrary 
spin > 1. The results for the Clebsch-Gordan coefficients are summarized in 
Table 5.4. 


Table 5.4. The expression for the Clebsch-Gordan coefficients (j1,™m1; 1, m2|Jj,m) 
for the addition of an arbitrary 7; > 1 angular momentum to a jo = 1 one. 
oo jin jn jt 
1 (j1 — mM) — m+ 1) (j1 + m)(G — m+ 1) (j1 +m) +m + 1) 
251 (251 + 1) 2j1 (1 + 1) (2j1 + 1)(2j1 + 2) 
; Sargeras m (= 1G +m +1) 
ji(2j1 + 1) Vai +1) (291 + 1)(j1 + 1) 
i (ji +m+ 1) +m) (j1 — m)(j1 +m + 1) (j1 — mM) — m+ 1) 
291 (251 + 1) 2j1(g1 + 1) (2j1 + 1)(2j1 + 2) 


For future reference, we record the following particular Clebsch-Gordan 
coefficient in a unified manner (mj +m2=m) for j = ji: 


(j, m1; 1, m2| 7, m) 


1 1 
= SS [M14 2,0 F 


4 (9 +1) V2 


VG Fm1) GF £m1 + 1S, 41 


= (j,m|j,mi;1,mz2) (5.6.20) 


as follows from the above Table. 
Some symmetry properties of the Clebsch-Gordan coefficients are 
et | ( 


(j1,™1; J2,m2|Jj,m) = (-1)7"" j2,™23 j1,™1|j,m) (5.6.21) 


(j1, m1; ja, ma|j,m) = (—1)2 4-9 (j1, —misjo,-—me|j,—m) (5.6.22) 


: : jot 2j7+1 ,. 3 ; 
(Jay mas Jayma|.j.m) = (1) [5 Ga, mas jym| jams) 


(5.6.23) 
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To display such symmetry properties, it is more convenient to introduce 
the so-called 3-7 symbols, due to Wigner, defined in terms of the Clebsch- 
Gordan coefficients as follows: 


fj J Jji—je—m 1 : ; ; 
ee oreal - 5.6.24 
& mg :) ( ) 27+ 1 (J1, M13 Ja, M2 J, m) ( ) 


where we note that m1+m2+m = 0, and the 3-7 symbols are zero otherwise. 
Some symmetry properties of the latter symbols are 


GU DB GaN A a, Ot, IR Vent oe 2 Gi (5.6.25) 
my ™m2 M3 m3 M1 M2 m2 ™3 My fi 
Fe F293 Ve (— 1) tietia j2 Ki Js (5.6.26) 
my Mg M3 m2 M1 M3 a 


@ je 2) = (14 ( ji oh ds , (5.6.27) 


Mm, M2 ™3 —M 1 —M2 —™Ms3 


The following particular values of the 3-7 symbols are to be noted 


G fA -) =); if J= ji t+jo+ jis odd (5.6.28) 


as follows directly from (5.6.27), and 


i j2 9 2 [(J— 21)! (J — 2j2)! (J — 233)! 
€ 0 _) =a / (J +1)! 


(5/7)! 
* Gia l =p r= se) (5.6.29) 


if J=jitjot+ jz is even. 
From (5.6.24), (5.5.23), we also have the orthogonality property 


S523 +1) te J2 :) @ ja 2) =O pili Onis (5.6.30) 


: my mam) \mi mm 
j 


where m = —m, — m2 = —m‘{, — m4, and (5.5.24) leads to 
ge Pee ot ged ves, = . 
2 @ mg ) & mg wv) ~ (25+ 1° sae ae 


and the triangular condition on j, j1, jg is understood. 
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5.7 Vector Operators 


The operators X,P,N,L,S and the angular momentum operator J itself 
all satisfy the same commutation relations with J (see §2.3): 


[Vi, J] =ife¥*V* (5.7.1) 


where V denotes any one of the operators mentioned above, reflecting the 
vector character of these operators. Any such operator satisfying the commu- 
tation relations (5.7.1) is referred to as a vector operator. The commutation 
relations [V’,V2], however, may be different for different vector operators 
V. For example, the different components of P commute while the different 
components of S do not. 

We are interested in evaluating matrix elements of the form (j,m’|V|j,m). 
To this end, it is more convenient to define the spherical vector components: 


1 1 
V(m) = bm,oV? — yim + Fagin (5.7.2) 


m = —1,0,+1, where 


VeVi Si" (5.7.3) 


as opposed to the Cartesian components V!,V?,V°. 
It is straightforward to show from (5.7.1) that 


VCRs Sh meee os ave 2m ame Dias 
x Vimy, + mg). (5.7.4) 


In particular, for m; = ma, [J(m1) , V(m)] = 0, and there are no ambiguities 
associated with the notations V(+2), arising on the right-hand side of (5.7.4), 
since their coefficients are always identically equal to zero. 

We also have 


J(M) |j,m) =h fndas0 = VGFmGEm+ bas m+ M) 


= |j,m+ M) (j,m+ M|J(M)|j,m) (5.7.5) 
and 
(j,m'| J(M) =h n'da10 + VG +m!) (9 Fm! + 18a (j,m! — M| 
= (j,m'|J(M)|j,m’ — M) (j,m’ — M| (5.7.6) 


where M = —1,0,+1. 
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One may rewrite the commutations relations in (5.7.4) as 
[J (m1) 5 V(mga)| = (1, my, + M2 |J(mz1)|1, mg) Vim, + m3) . (5.7.7) 


By taking the matrix elements of (5.7.7) between (j,m’| and |j,m), and 
using (5.7.5), (5.7.6) we obtain 


(j,m’|J (m1)|J,m" — mz) (j,m" — m1 |V(me2)|9,m) 
— (j,m'|V(m2)|j,m + m1) (j,m + m4 |J(mi)|j,m) 
= (j,m'|V(m1 + me2)|j,m) (1,m1 + m2|J(m1)|1, m2) « (5.7.8) 
In particular we note that 
(j,m’ | J(0)|9,m) = hm by m- (5.7.9) 
For m, = 0,m2 — M, (5.7.8) then gives 
h(m! —m— M) (j,m'|V(M)|j, m) = 0 (5.7.10) 


for the spherical vector components of a vector operator. That. is, 
(j,m’ |V(M)|j,m) is necessarily zero unless 


m’ =m+M. (5.7.11) 
From (5.7.5), we note that 
(1,2M|J(M)|1,M) =0 (5.7.12) 
accordingly, for m1 = mz = M, (5.7.8) gives, upon using (5.7.11), that 
(i,m+M|V(M)\j,m) _ (j,m+2M|V(M)|j,m + M) 


=> : , (5.7.13) 
(j,m+M|J(M)\j,m) — (j,m+2M|J(M)|j,m + M) 
That is, in particular, 
(j,m41|J(£1)|j,m) (7, m+ 2|J(£1)|j,m + 1) 
Similarly, for m, = F1, mg = £1, m’ = m, (5.7.8) leads to 


(j,m|J(F1)|9,m £1) G,m £ LV (419, m) 


— (j,m|V(41)|9,m + 1) G,m F 1 I(F1)|9,m) 


= (j,m|V (0) |j, m) (1,0 J(#1) ve ) 


= +h(j,m|V(0)|j,m) (5.7.15) 
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where we have used (5.7.2) for J(+1) in writing the last equality. Upon using 
(5.7.14), we may rewrite (5.7.15) as 


(j,m £1|V(+1)|3,m) 
(j,m + 1|J(41)|3,m) 


= +h(j,m|V(0)|j,m) . (5.7.16) 


{i,m FFD i,m £1)? = [mF LTDA mpl? b 


For V(tl) = J(+#1), the expression in the curly brackets is 
+h (7, m|J(0) |j,m). Accordingly, (5.7.16) simplifies to 


(j,m + 1|V(+1)|9,m) 
(j,m 4 1|J(41)|j,m)” 


(j,m|V(0)|9,m) = (j,m|F(0)|9, m) (5.7.17) 


From equations (5.7.14), (5.7.17), we then conclude that for M = —1,0,1, 


1 (j,m!|J(M)|j,m) 


Gm! IV(M)|i.m) = 


C(V,3) (5.7.18) 


where C(V,j), a proportionality factor, characteristic of the vector operator 
V, which may depend on 7, but is necessarily independent of m, m’, M. It 
should be noted that (5.7.18) is valid for all m,m’ = —j,-j +1,...,7. The 
1/./7 (g + 1) factor is inserted for convenience. Equations (5.7.5) and (5.6.20) 


allow one to rewrite 
(j,m’ |J(M)|j,m) 
h/7G +1) 


= (j,m’|j,m;1,M) (5.7.19) 


appearing in (5.7.18), in terms of a Clebsch-Gordan coefficient as given in 
(5.6.20). 
It has become customary to denote the proportionality factor C(V, 7) in 
(5.7.18) as 
C(V, 4) = (IIVIld). (5.7.20) 


Don’t let the notation scare you. Due to its independence of m, m’, it may 
be formally evaluated and defined by 


(j,m'|V(m)|3,™) 
(j,m’|j,m;1,M) 


GIIVI|9) = (5.7.21) 


for any allowed and conveniently chosen values for m’, m, for which the 
evaluation of the expression on the right-hand side of (5.7.21) is unambiguous. 
This will be done explicitly later for various cases. 

From the definition of V(M) in (5.7.2), (5.7.3), we may rewrite (5.7.18) 
as 
(j,m'|J|j,m) 


nJiG +1) (FIV II) - (5.7.22) 


(j,m'|V|j,m) = 
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This is a particular case of the so-called Wigner-Eckart theorem that will 
be established quite generally in the next section. 
For V = J, (5.7.22) gives 


(i|IS|I9) =hVIG + 4). (5.7.23) 


The factor (j||V||7) is referred to as a reduced matrix element of V. [Some 
authors provide a different definition of a reduced matrix element by mul- 
tiplying the latter by a given function of 7. In any case such an additional 
normalization factor may be absorbed in the definition of a reduced matrix 
element.| 

As shown below, the reduced matrix element (j||V||j) element may be 
explicitly written as 


(j,m|V + J|j,™) 


cea (5.7.24) 


GIIVILs) = 


(with 7 4 0) and due to its independence of m, it may be evaluated for any 


of its values. 
To show (5.7.24), note that 


V-J= (-1)" V(-N) J(N) (5.7.25) 


and hence 


(j,m|V + J|j,m) 


= SF CD" Gm|V(-N)|d,m + N) Gm + N|J(N)|5,m) 
N=0,+1 


= (—1)™ Gj,m|J(—N)|j,m + N) (j,m+ N|I(N)|j,m) 


WAINALE) 
Ry/9 (9 +1) 


= (-1)™ Gj,m|J(—N) JON) |3,m) GIIVIL9) 
N=0,+1 


_ (ipm|S? i,m) GIIVINd) (6196) 
hy/j (G +1) ~ 


which upon using the fact that J?|j,m) = h?j(j+1)|j,m) establishes 
(5.7.24). In writing the second equality in (5.7.26) use has been made of 
(5.7.22). 
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We may then rewrite (5.7.22), in the convenient form 


(j,m|V + J]j,™) 


PAGED (5.7.27) 


(j,m'|V|j,m) = (j,m"|J|j,m) 
for the matrix elements of a vector operator V we were seeking. 
As an application of (5.7.27), we evaluate the matrix element 
(j,m! |(J3+3S°)|j,m). This latter will occur in our study of the Zeeman 
effect in §7.9. To this end, (5.7.27) leads to 


; 2 , 
- rl ( 73 4 @3)| 5 _— ls ml | Bla (j,m|(S° +8 -I)|9,m) 
(j,m’ |(J° + 8°)|j,m) = Gm! |J°|j,m) fej (j+)) 
sess | 
=AMbmm 1-4 — 4 5.7.28 
” { hj (9 +1) 
Upon writing 
1 
S-J=5(J°-L’+8°) (5.7.29) 
and carrying out the expansion 
lim) = SY > |b me; 8, ms) (me; 8, ms | j,m) (5.7.30) 
metms=m 
where j = |€—s|,...,@+., we obtain for the expression in the early brackets 
in (5.7.28) 
(jg +1) -—e(€4+1)4 + 1 
i eae ae eae ea a (5.7.31) 
25 (j +1) 


The states in (5.7.30) are simultaneous eigenstates of the commuting opera- 
tors J*, S*, L?, J°. [Here we note that [J?, 5] 4 0, [J?, L?] 401] 

In particular for s = 1/2, € = 1,2,..., (5.7.31) gives the famous Landé 
g-factor 


j+1/2 
= 7.32 
aan 8 1/2 (5.7.32) 
where j = £+ 1/2. For €=0, s = 1/2, (5.7.32) gives g = 2. 
Another expression for (5.7.27) may be also provided by noting that 


[V-J,J] =0 (5.7.33) 
and hence with V replaced by V- JJ in (5.7.27), we obtain 
(jm! |V»SI]j,m) = j,m!|Jlj,m) (j,m!|V-Tljem). (6.7.34) 
This then allows us to rewrite (5.7.27) simply as 


(j,m'|V + J J|j,m) 
hj (7 +1) 


(j,m'|V|j,m) = (5.7.35) 
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. 


= (im|¥ IJ 


J 


a 


(5.7.36) 


(for 7 # 0). This formally shows that only the component of V along the 
angular momentum may contribute to the matrix element (j,m’|V|j,m). 

The transformation of a vector operator V under arbitrary finite rotations 
may be worked out from (5.7.1). To this end let (see (2.3.18), (2.3.43)) 


exp (= en) V' exp ( en-3) = F*(y) 
for a given fixed unit vector n. Then (5.7.1) gives 


ZF) =-n x P(e). 


On the other hand, let (§2.1) 
G' (~) = VER" (¢,n) 


then F 
qo oY) = x Gl¢). 
yp 


With the boundary conditions: 


G (0) = V = F(0) 


we may infer that 


exp (= pn) V' exp (; en-3) = V* R* (y,n). 


h 
As an application of (5.7.42), consider 
eg? /h 72 ib J? /h — Ji R25 ($,x°) 
where x° is a unit vector along the x? axis, and (see (2.1.4)) 
R?) ($, &°) = 57 cos — 54 sing 


thus giving 
e ib J?/h 2 ib J? /h ee | 
with n now given by 
n = (sind, —cos¢,0). 
In particular (5.7.45) implies that 


ei 8 /RGi8I? /Agid I? /h _ exp F 0 Ce ae) 


(5.7.37) 


(5.7.38) 


(5.7.39) 


(5.7.40) 


(5.7.41) 


(5.7.42) 


(5.7.43) 


(5.7.44) 


(5.7.45) 


(5.7.46) 
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= exp E On: 3| (5.7.47) 


We will make use of this identity in 85.10. 
Finally we note that for two vector operators V1, V2, the product V,-V2 
commutes with J: 


[J,Vi- V2] =0 (5.7.48) 
as expected and is readily checked from (5.7.1). 


5.8 Tensor Operators 


From the previous section, we recall that a vector operator is one such 
that its spherical components satisfy the commutation relations (see (5.7.4)) 
(m = —1,0,1; M = -1,0,1): 


[J(m) ,V(M)| =h Momo + ary Gi =M)(J4M+ 1)dm,41 


x V(m+M) (5.8.1) 


with J = 1, reflecting the vector character of the operator in question. 

Equation (5.8.1) naturally leads to define a tensor operator T# of rank 
K with components specified by M = —K,—K +1,...,K, as one satisfying 
the commutation solutions 


[J(m) ,Tiz] = h | Mono F eV (K FM) (KEM 4+ 1)bm41] Ti 


(5.8.2) 
by simply changing the J = 1 value to general K = 1/2,1,... for the possible 
values of an angular momentum. The spherical components J(0), J(£1) of 
the angular momentum operator J are defined in (5.7.2). 

As a generalization of the relation in (5.7.18) (see also (5.7.19),(5.7.20)), 
we establish the following one 


(i',m' |Txr|5,m) = (7, m"|5,m; K, M) (3 ||T*||3) (5.8.3) 


where ( 4" | ee || 3) is a reduced matrix element, which may depend on j’, 7, 
K, but is independent of m’, m, M. Here 


m' =M+m (5.8.4) 
j=(9-K|,|f Kl] +1,...,9+K. (5.8.5) 


Different normalization are used by different authors to define a reduced 
matrix element (j’ [7 ||7). This is done by multiplying (5.8.3) by a given 
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function of 7. We have chosen such a factor to be one to make direct com- 
parison with the vector operator case defined earlier (see (5.7.18), (5.7.19)). 
In any case, as we have done here, such an additional normalization factor 
may be absorbed in the definition of a reduced matrix element. 

The result embodied in (5.8.3) is referred to as the Wigner-Eckart The- 
orem. As a Clebsch-Gordan coefficient is a geometrical factor, the physical 
properties of T/* enter only in the reduced matrix element. In (5.8.3), we 
have suppressed other quantum numbers which may be characteristics of the 
physical system at hand. 

To establish (5.8.3), we define the ket vector 


le ims 5,K)) = SY) TH |d,m) (i,m; K, M|j/,m') (5.8.6) 
m+M=m’ 
where |j — K| < 7’ <j +K in integer steps. 
We multiply (5.8.6), by (7’,m’|7,m”; K,M”), sum over j’, and use the 
completeness relation (5.5.23), to obtain 
j+K 
Tis lim) = > [bj m's 9, K)) (i'm! |5,m; KM). (5.8.7) 
j'=l9-K| 


Upon applying the operator J(M"), to (5.8.6) where M’ = 0,+1, writing 
J(M') Tx = [J(M’) TH] + TH I(M’) (5.8.8) 


using (5.7.5) for J(M’) |j,m), and the explicit commutation relations (5.8.2), 
we obtain by making a change of the summation variables m, M in (5.8.6), 
and finally using (5.5.27) that: 


JM) |b (9, ms 5, K)) = h {mbar o FV (i Fm) (jem! + Téa} 


V2 
x |p (9’,m' + M';j,K)). (5.8.9) 

Equation (5.8.9), in particular, implies from (5.7.2) that 
J |b (9',m'; j, K)) = fim! |b (7, m's 5, K)) (5.8.10) 


Ja |b (j',m'sj, K)) = h(i Fm’) (Em! +1) |W Gm! + 15, K)) 


and from (5.11) or (5.12) 
J’ wim’ 9, K)) = PF (9 + VY) Ib ms 5, K)). (5.8.12) 


With the underlying assumption that we have at hand, together with 
J°, J° a complete set of commuting operators, with the |j’,m/’), as before,” 


? For simplicity of the notation, we suppress additional quantum numbers that 
these states may depend on. 
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denoting their simultaneous eigenstates, we conclude that |2) (j’,m’; 7, K)) is 
proportional to |j’,m’). That is 


Ip (9',m's 5, K)) = |9’,m’) a’, m'; j, K). (5.8.13) 


It is easy to show, however, that a (j’,m’; 7, K) is independent of m’. To this 
end (5.1.23) reads 


Ja. |j',m!) = Cs (j',m') |j!,m! £1) (5.8.14) 


where Cx (j’,m’) are the coefficients defined in (5.8.11). On the other hand 
(5.8.11), (5.8.13) together (5.8.14) then give 


a 


+ | (9',m'; 7, K)) = |7',m! £1) Cz 9m’) a (7, mj, K) 


— 7’, m' ae 1) Cy (j',m’) a (7's m' + 139, Kk) (5.8.15) 


implying the independence of the factor a (j’,m’; 7, K) of m’. Hence, we may 
rewrite (5.8.13) as 


lv Gms 5,6) = 9m’) aGs5, K). (5.8.16) 
Multiplying (5.8.7) by (j7”,m”|, using (5.8.16), immediately leads to 


(jm! |Txy 


j,m) = (j',m'|j,m; K, M) a(9'; 5, K) (5.8.17) 


which is the desired result quoted in (5.8.3) upon appropriately identifying 
a reduced matrix element, characteristic of the operator Te. and as shown 
above it is necessarily independent of m, m’, M, where m+ M =m’. 

In particular, for K = 1, we have Ti, = V(m), and for j’ = j, (5.8.3), 
reduces to the expression for the vector operator obtained earlier in (5.7.18)— 
(5.7.20). For 7’ not necessarily equal to j, (5.8.3) generalizes (5.7.18) to 


(7',m'|V (M)|j,m) = Gj',m"|j,m;1,M) (7"IV ||) (5.8.18) 


where m’ = m+ M,j’=|j—-1],...,9 +1. 
From (5.8.16), (5.8.7) we also have the following expression for the action 
of the operator T/#* on the angular momentum states |j,m): 


j+k 


Tit jm) = So [i'sm’) GIDE ||9) Gm’ |3,m; K,M) (5.8.19) 
j'=|i-K| 


where m’ =m+M. 

Out of two rank K,, K2 tensor operators vie ‘ie one may construct a kK 
rank tensor operator Tj, where M = M,+ Mo, K =|Ki — Ko|,..., Ki+Ko 
as follows 
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F(K,Ki,Ko)TH = So TAP THE (Ki, Mi; Ko, M2|K,M) (5.8.20) 
M,+M2=M 


as is readily checked, where F (K, K,, K2) is, in general, a function of K, K,, 
Ky and is necessarily independent of M. 

We may also invert (5.8.20) and use the completeness relation (5.5.23) to 
obtain 


ki +Ke 
Ty Tip = >. THF (K, Ky, Ko) (K,M|Ki, Mi; Ko, Mp). (5.8.21) 
K=|K,—Ko| 


An important application of tensor operators is to the spherical harmonics 
studied in 85.3. 

To the above end, for any non-negative integer L, we define a tensor opera- 
tor ¥z of rank L, with components specified by M = —L,—L+1,...,0—1,L, 
as follows. The spherical harmonics Yz 1 (Q) (§5.3), Q = (6,¢), are defined 
by 


(Q]¥i7|2') = 6 (Q — O') Yr (Q) (5.8.22) 
where 
52-2) =* oe 56-9!) (5.8.23) 


with Y,4 satisfying the usual commutation relations in (5.8.2), with the or- 
bital angular momentum states, 


1 
[F(m) Ye] =h [Mono ae pV UE + M)(L£M+1)bm41] Vitas 
(5.8.24) 
and the orbital angular momentum states given by 
|L, M) = [oo Yi4 |0’) (5.8.25) 
where 
dQ = sin 6 dé d¢. 
From (5.8.25), we then have 
(Q|£,M) = Yiu (Q) (5.8.26) 


as expected. 
Also 


J(m) IE, M) = far [J(m) , Yi] ja’) + faa Yi I(m)|Q’). (5.8.27) 


The second term on the right-hand side of this equation may be rewritten as 
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So fase Vs TE) [mV YE ye IO) 
L',m! 


= V4n Yi J(m) |0,0) = 0 (5.8.28) 


using the orthonormality of the spherical harmonics, with Yoo (Q) = 1/47, 
and then using (5.7.5). 
From (5.8.24), (5.8.25), (5.8.27) we then obtain 


1 
JI(m)|L,M) =h | Mémo + av + M)(LiM+ 16m |L,M +m) 
(5.8.29) 
yielding in particular to 
J°|L,M) =hM|L, M) (5.8.30) 
J? |L,M) =f? L(L+1)|L,M) (5.8.31) 


as expected. 

Now we use the general expansion in (5.8. a for the tensor operator Y,‘. 
Upon taking the matrix element (Q| - |’), of (5.8.21), integrating over 11’, 
and using the property 


mi” m2 


Joo keh ee calle ee Joo dan” (Q/¥7 |") (Q” |¥;72 |0’) 


we obtain 


F (0,1, C2) Yem (Q) — Se Yeymy (Q) Yeome (Q) (€1,7m1; 2, m2 | &,m) : 
mi+tme2=m 
(5.8.33) 
To evaluate the factor F' (£, ¢,, £2) explicitly we set 9 = 0, ¢ = 0, use the 
relation (5.3.50) to obtain from (5.8.33) 


20, +1) (20 +1 
Pittvte)= yf para ) 101,05 l0,0)£,0). (5.8.34) 


From the inverse relation (5.8.21), we have from (5.8.33), (5.8.34), 


£1 +2 

20; +1) (2041 

Yoym, (Q) Yegm, (Q)= > Ye mi wae 
l=|l1—£3| 


x (£1, 0; £0, 0| 2,0) (€,m| £1, m1; £2, M2) , (5.8.35) 
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Upon multiplying this equation by Ypsm, (Q), integrating over 9, and 


using the orthonormality of the spherical harmonics, we obtain the following 
useful integral involving three spherical harmonics 


[2 Yezm, (2) Yiams (®) Yon, (®) 


dy OLED OR 4A 
= (-1) y aes, ) (€,,.0; £2, 0|€3,0) (eg, ms |x, 713 £2, ma) 


4n (203 +1) 


= ce (25 + 1) (203 +1) a bo i ie lo bs 


An 0 0 0 my, ™m2 —™Ms3 


) (5.8.36) 


where m3 = mi+ma, 3 = |€, — £0], |/€1. — €a|+1,..., 1 +22, and otherwise the 
integral (5.8.36) is zero. In using the orthogonality relation between Yom (Q) 
and Yim, (Q), we have used the fact that (see (5.3.44)) 


Yo, ms (®) = (-1)™ Yebmng (Q) (5.8.37) 


In writing the last equality in (5.8.36), we have used the definition of the 3-j 
symbols in (5.6.24) and the reality of the Clebsch-Gordan coefficients. 

As a concrete non-trivial application of the integral (5.8.36), consider the 
case (9 = 2, my = 1, 


1 . 
Yo1 (Q) = —4/ - sin 0 cos 6e'® (5.8.38) 


then for 0; = 3 = 0, m3 = m’, m, = m, (5.8.36) leads to the following 
matrix element 


(£,m' |sin 6 cos be'*|¢,m) = —\/(€— m) (€+m-+1) 


(2m + 1) 
poetry imme — (5839) 


Equation (5.8.35) also leads to useful recurrence relations for the spherical 
harmonics. For example, for 2; = 1, m, = 0, 


Yi 0 (Q) = 4/ = cos 0 (5.8.40) 


mz =m, and (5.8.35) leads to 


cos 9 Ye m (Q) = i" 2 7 ; a *) Ye+1 m (Q) 
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+ 4/ Yom (Q). (5.8.41) 


For = 0, the second term should be set equal to zero, and m = 0. 
Equation (5.8.41) gives directly the matrix element 


(€,m’|cos @|€,m) =0 for all m’,m (5.8.42) 


and all @. 

On the other hand, upon multiplying (5.8.41) by cos@, and using the 
recurrence relation (5.8.41) one more time leads to the following matrix ele- 
ments 


[20? + 2-1 —2m?] F 
(fat QEgay. Oe 


(é,m’ |cos” 


tym) = (5.8.43) 


The expression in (5.8.43) may be generalized as follows. 


Let n = (cos ¢sin 6, sin dsin 0, cos@), then quite generally, by using the 
symmetry of the product n’n/, we may write 


(L'Li + LIL’) 
h2 


CG m |n'n? 


Ay 649 + Be 


é,m) = (: m' 


£, m) (5.8.44) 
where Ap, By are to be determined. Since n‘n*t = 1, L'L' |é,m) = L? |€,m) = 
h?0(€+1)|é,m), we obtain 

3A = [1 —2By £(€+ 1). (5.8.45) 


On the other hand for i = j = 3, we have from (5.8.44), the fact that 
(L*)? \é,m) = hm? |é,m), and (5.8.45) that 


(20? + 2€—1) 1 


ne; Qf) 063) OFS 3 


(5.8.46) 


That is, 


1 
(20— 1) (2043) 


iG m! |n'n? 


é,m) = (£,m’| (20 + 2-1) 5” 


(L'L) + LIL") 
h2 


\é,m) . (5.8.47) 


This equation will be useful in our treatment of the hyperfine structure 
of the hydrogen atom for any @ in 87.6. 

Finally we use the integral (5.8.36) to evaluate the reduced matrix element 
(é'||Yx||@) for the spherical-harmonic-tensor operator, and also provide an 
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important application of the integral in (5.8.36) involving three spherical 
harmonics. 

To the above end, the integral on the left-hand side of (5.8.36) may be 
rewritten as 


(—1)"" (€,, m1 |Y,72, |€3, ms) (5.8.48) 


which for m; = 0, mz = 0, m3 = 0 leads to 


t3,0) oo i +1) ee 1) (203 + 1) ({ c a (5.8.49) 


(1,0 hg 


which upon comparison with (5.8.3) we obtain 


(6, |B I[f5) = (yee (CBF DCE +Y (hee) 850) 


where we have used (5.6.24), and the right-hand side of (5.8.50) is zero if 
l, + £2 + £3 is odd (see (5.6.28), (5.6.29)). 

We close this section by giving an important application of (5.8.36), in- 
volving the integral of the product of three spherical harmonics, in evaluating 
the matrix element (¢,,m1|V (|r1 — re|)|€2, m2) of a potential depending on 
the distance 

1/2 
|r) — re| = (r? — 2rir2cos6 + r3) (5.8.51) 


between two particles. To do this, one may expand 


V(\ri — ral) = S_ Ve (ri, 72) Pe (cos 8) (5.8.52) 
l=0 


in terms of Legendre polynomials, where? 
2@+1) [* 
Ve (11,72) = oe | cos 6 d@ V(|r1 — r2|) Pe (cos 8) . (5.8.53) 
0 
Hence upon using the identity (see (5.3.30)) 


At 


Py (cos 6) = Qe+1) 


Ye 0 (9, 4) (5.8.54) 
the above matrix element (¢1,™m1|V (|r1 — r2|)|€2, m2) becomes 


Svil (ri,r2 Vara (01, m1 |Y¥¢|é2,m2) . (5.8.55) 


3 For example for the Coulomb potential A/|r1 — re), Ve (r1,r2) = A(r</rs)! /rs, 
re =min(ri,r2),r> = max(ri,1r2). 
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5.9 Combining Several Angular Momenta: 6-7 and 9-7 
Symbols 


In this section, we consider the problem of combining several independent 
angular momenta. More specifically, the combination of only three or four 
angular momenta are considered. 

One can combine three angular momenta with quantum numbers 71, j2, j3 
in more than one way. We may, for example, combine j1, j2 to form a quantum 
number jig and then combine the latter with j3 to form a final quantum 
number 7. On the other hand, we may, for example, combine j2, 73 to form 
jo3 and then combine the latter with j; to form a final j value. 

Accordingly, one may define (see §5.5, (5.6.24)), the following states: 


7,5 jira GaJ2) + Js) 
= f2jtly%jetl So (1s ecmtecamma 


M1 ,M2 773,12 


. @ jz Jie ve ja J ) birvm) lj2,™2) |J3,™m3) (5.9.1) 


my Mm: —™M12 my2 m3 —™M 


with obvious constraints on the summation variables m1, m2, m3, M12 un- 
derstood, and 


13,5 J15 J23 (J2J3)) 
S/H tial, - 1 y eee sidan 


™1 4,722,773 ,M23 


. @ joa he js jes ) basa) bis) lj3,m3). (5.9.2) 


my m3 —™M m2 M3 —M23 


The two states on the left-hand sides of (5.9.1), (5.9.2) are related by 


Li, ms 1, J23 (jada) = / Bios FI (-1) #4949 > Big F 1 


ji2 


* limi jna hie) he) {B92 7S (6.98) 
J1 J23 J2 
where the orthogonality relations (5.6.30) give 
ji ja J3 = . 4) Udit Dilit dine jl j2 J3 
faBapreety Eo ee 


MiNi 


% é fo. te ) ( f1 je 3) & ly ae (5.9.4) 
my, Ng —N3 ny M2 N3 ny —ng2 ™M3 
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The objects ;. : , are referred to as 6-7 symbols. Some properties of 
1 £2 £3 


the latter are the following. They are left invariant by any permutations of 


the columns, e.g., 
Ji gede | = \iade WNW. jag gs 
i" to : z ly a = y ty Rt sp2D) 
A 6-3 symbol remains also invariant under the interchange of upper and lower 
arguments in each of any two columns, e.g., 


Ji J2 Js £1 jo ls fy fo js 

=e fe ok eee ‘ 5.9.6 

iu ly A . ly | es j2 3 ( ) 
In particular, they satisfy the orthogonality relation 
NRIVJHAI27F| _ -1 


) 


and the following sum rule 


G++! ry 925 Side? = Sak 
a 1) @i+n{h t, 4 ee ‘s a = ie t, Ae (5.9.8) 
j 
As an example, consider a particle composite of two particles of spins S,, 
S2 and relative angular momentum (see §2.7, (2.7.40)) L,. residing in their 
center of mass. Then the total internal angular momentum of the composite 
particle is given by (see (2.7.39), (2.7.40)) 


S=S8,+S8e4+L, =J,4+J2+J3. (5.9.9) 


For definiteness, suppose S$; denotes the spin of a proton: s; = 1/2, and Sg 
denotes the spin of a deuteron taken to be sg = 1. Then j12 = 1/2 or 3/2. 
Consider ¢ = 1, and that the total spin of the composite particle corresponds 
tos= 3/2 and J23 =2; 

According to (5.9.3), 


|s = 3/2,m; 5, = 1/2, 523 =2) = V5 SY) |s =3/2,m; jr (8182), 43 = 1) 
ji2=1/2,3/2 


x Get D{ 7 si ip (5.9.10) 


where we have used (5.9.5), (5.9.6), and the numerical values of the 6-7 sym- 
bols may be evaluated from (5.9.4) to be 


Be. a! £4 1 211 1 
tp 3/2 iyo} = aR re 3/2 oh = 5 (5.9.11) 
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That is, 


. 5) . . 
|s = 3/2,m; 81 = 1/2, jo3 = 2) = fs = 3/2, m:; jie ($182) = 1/2, 73 = 1) 


1 : ; 
at fils = 3/2, mM; ji2 (8182) me 3/2, 93 — 1) 
(5.9.12) 


and the probabilities that in the state |s = 3/2, m; 5, = 1/2, j23 = 2), the pro- 
ton spin and the deuteron spin combine to give the values jj2 = 1/2,3/2, 
respectively, are 5/6 and 1/6. These also check out the correctness of the 
normalization condition. 

For completeness, we note that the 6-7 symbols may be also rewritten 
from (5.9.3) in terms of amplitudes as 


(9, jr2 (J1J2) 33 |9,M5 Jr, J23 (J293)) 


= (1 JO DG tD eI) (69.13) 
J23 J1 J2 
where for the numerical example above, the corresponding amplitudes are 
equal to \/5/6 or \/1/6 for j12 = 1/2 or j12 = 3/2, respectively. 

Finally, we consider combining four angular momenta with quantum num- 
bers 71, j2, J3, ja. As before, there are many ways of combining these angular 
momenta first in pairs. These lead to the definition of so-called 9-7 symbols, 
generalizing further the expression (5.9.13) for 6-7 symbols, to four angular 
momenta. Specifically, 


(9,™; jr2 (JiJ2) , 334 (isda) 19,5 J13 (193) 5324 GaJa)) 
ji jo re 
= V(2j12 +1) (2534 + 1) (213 + 1) (2j2a +1) ¢ 53 Ja Jaa P (5.9.14) 
J13 Joa J 


where we note that for any row (abc) or column (abc)? of the 9-7 symbols, 
the addition rule of angular momenta |a — b| < c < (a + D) is satisfied. 
The 9-7 symbols may be evaluated in terms of 6-7 symbols as follows: 


Jr Jr2 jis Ag 
Jar J22 J23 p= S> (oh! Gr) 
J31 J32 J33 j 


. ve J21 ee J22 e} ‘ie J23 a (5.9.15) 

J31 J33 J J21 J J23 J Jil ji2 
The 9-7 symbols satisfy particularly the following symmetry relations. A 
9-7 symbol remains invariant under each of the following transformations: 
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cyclic permutations of its columns, cyclic permutations of its rows, a trans- 
position as for a 3 x 3 matrix. 
Orthonormality of states leads to the rule 


Ju fiz Fis) [du Jr2 dis 
S- (2313 + 1) (2j23 +1) 4 Jar Ja2 Jos > 4 J21 J22 J23 
jis.Jes Js1 J32 J33) (331 332 J3s 


6 (j31; 931) 6 (J32; 932) 
=U 5.9.16 
Cutis) ee 


Ultimately, the 6-7 and 9-7 symbols are evaluated in terms of Clebsch- 
Gordan coefficients or equivalently in terms of 3-7 symbols as defined in 
(5.6.24). The explicit expressions are given, respectively, in (5.9.4) and 
(5.9.15). 


5.10 Particle States and Angular Momentum; Helicity 
States 


In this section, we use some of the details worked out on angular mo- 
mentum to define and construct one- and two-particle states with or without 
spin. These particles may be composite of several particles as discussed in 
§2.7. For example, a given particle may be the deuteron of spin 1, composite 
of a proton and a neutron. For greater generality, we also consider so-called 
helicity states (see also §5.4) for which the projection of the spin of a parti- 
cle is taken along the direction of its momentum instead of the traditionally 
taken z-axis. These latter states are also important for a relativistic treat- 
ment and are essential for describing zero-mass particles. As we have already 
developed the whole machinery to construct helicity states as well with no 
difficulty, they are worked out here for completeness. The reader may wish 
to skip over the construction of helicity states at a first reading. 


5.10.1 Single Particle States 
Spin 0 
Consider a particle with momentum along the z-axis: 
Po = p (0,0, 1) (5.10.1) 
and a corresponding state (po| satisfying 
(Po| P = Po (Po| (5.10.2) 


where P is the momentum operator. 
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To define a state (p| of arbitrary momentum 
p = p(sin @ cos ¢, sin @ sin ¢, cos 8) (5.10.3) 


as obtained from the state (po|, we use the definition (2.3.5) /(2.3.43) which 
amounts to rotating the coordinate system appropriately instead. To this 
end, we consider the bra 


(Po| lexw (0 . )] (5.10.4) 


where 


n = (sin d, — cos ¢, 0). (5.10.5) 


Since we are considering spin 0, J = L. The state in (5.10.4) is actually the 
state (p| of arbitrary p as given in (5.10.3). To see this, apply the momentum 
operator P to it: 


(pol exp (- 0 5) P' = (po| P“ exp (- jn 3) (5.10.6) 
where 
P” = exp (jon . 5) P’ exp (0m . 5) 
= P* R** exp(0,n) (5.10.7) 


and in writing the equality, we have used the vector property of P as given 
in (5.7.42). Hence 


(polesp (—jm-3) P= 26R* (On) (Polexp(—Fn-F) (6.108) 
and from (2.1.4) we explicitly have 
R*(6,n) = 6° cos 6 — e)?nJ sind (5.10.9) 
pp R* (0,n) = p' (5.10.10) 
with p’ as given in (5.10.3). That is, 
(Po| exp (-j0n-3) P = p (po| exp (—j0n-3) (5.10.11) 


and 


(P| = (Pol lexp (imma) (5.10.12) 
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We note that 
(Po| L° = (po| (XP? — X?P*) =0 (5.10.13) 
since [X1, P?] =0, [X?, P!] =0, and py has zero components along the 2? 
and x! axes. That is, (po| is an eigenstate of L? with m = 0. 


We introduce simultaneous eigenstates of the commuting operators 
P?, L?, L® denoted by (p, é,m|: 


(p, €,m| P* = p? (p, £,m| (5.10.14) 
(p, £,m| L? = h7e(€ + 1) (p, £,m| (5.10.15) 
(p, ,m| L? = hm (p, é,m| (5.10.16) 
normalized as follows: 
(p,£,m|p',,m’) = (ony PP Serban (5.10.17) 


The following properties of the amplitude (pg |p’, £,m) are obvious 
0 = (po|L* |p’, £,m) = hm (po |p", £,m) (5.10.18) 
0 = (Po| (P? — vp”) |p’, €,m) = (vp? — p”) (po |v’, £m) - (5.10.19) 
Accordingly, we may write quite generally 
(Do|p's tm) = Anny “PIPI5,0cy(p) (6:10.20 
where C2(p) is to be determined. 


We expand the state (p| in terms of the states (p, 2, m| and hence introduce 
in the process the identity operator 


i= Dey a Ip’, £,m) (p',£,m| . (5.10.21) 
= 0 (27h) ? >) ’ Et 


From (5.10.12), (5.10.20), (5.10.21) we then obtain 


(p| = Ze (10 lexp (0 3)] 


where we have used the normalization 


tm) (p, £,m| (5.10.22) 


‘ t 
(p',,0| lexp (0-3) | Ip, &,m) 


310 5 Angular Momentum Gymnastics 


_ / 
= (2nh)° Pin, (: 0 


tm) (5.10.23) 


i T 
lexp (0 : 5)| 
since J commutes with both L? and P?. 
From the identity (5.7.47) and (5.3.65), (5.3.66), we have 


i T 
£,0 lex (0-3) | l,m 
a («0 exp (- 704°) exp (;04") exp (; 6°) lm) 


= Di (¢,6,0) 


At 
=] warm (0, ¢). (5.10.24) 


Equation (5.10.22) then becomes 


An 
(l= So Cel) gpg Fem 16) sto (5.10.25) 


and 


(p|p’, £m) = (2h)* Ce(p) 4/ va ; oP Py,, (0,0). (5.10.26) 


Pp 
Finally, from the equality 


5 (p" — p') d°p 
rh 3 5 * Oia ee = ie 1! ml! i Lom! : 
( )° dere p? (p |p) (27h)® (p|p ) 
(5.10.27) 
it is readily verified from (5.10.25), that we may set 
2l+1 
= => wl 2 
Ci(p) = Ce= (5.10.28) 
thus obtaining 
lee) 54 
(pl =S> SZ Yim B) (p,é, ml (5.10.29) 


£=0 m=—£ 


where p is the unit vector (sin @ cos ¢, sin @ sin ¢, cos #) along p. 
From (5.10.29), we also obtain 
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(p,8,m| = Jeo Yim (6,0) (| (5.10.30) 


and note from (5.10.17), the correct normalization in (2.4.15) adopted for the 
states (p|. 

Before treating arbitrary spins we also consider a coordinate description 
of the above. To this end, as in (5.10.4) 


: t 
(x| = (z0| lex» (010 . 5)| , |xl=lzel=r, x=rx (5.10.31) 


n’ = (sin@g’,—cos ¢’,0), and from (5.10.29), (5.10.21), (5.10.24) we readily 
obtain 


(x|p) = as %) Yorn (B) (zo |p, £0). (5.10.32) 


The addition theorem of spherical harmonics in (5.3.67) and the expres- 
sion for the transformation function (x|p) in (2.4.8), allow us to write 


55 (i= p) = 3> PY py &-) (col, 4,0). (5.10.33) 


At 
£=0 


On the other hand, the orthogonality relation (5.3.51), and the definition 
(5.3.30): Yeo (0,0) = ./(20 + 1) /4a Py (cos @) give 
1 . 
(zo|p, £,0) = an f d (cos @) Pe (cos 8) e'”? 08 9/P (5.10.34) 
-1 
Thus we are led to introduce a function of rp/h which defines the amplitude 
(zo |p, €,0). This function is referred to as a spherical Bessel function of order 
é and may be defined as an integral over a Legendre polynomial: 


: pr = 20 . irpcos 0/h 
i ( ~) == fa (cos ) Pr (cos 6) (5.10.35) 


From (5.10.34) and (5.10.35), (5.10.33) becomes 


exp (5x . p) = geet 1) if jc (pr/h) Po (K+ p). (5.10.36) 
The identity 
ol = foi) Po (5.10.37) 
x; = x 3 . 
DY Oni 
allows one to write 
_ w) d?p . se da 
Gl = So 20+ 1) [BS i (ort) PRB) (PI (5.10.38) 
ri (27h) 
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and from (5.10.30), (5.3.66) 
(x|p,€,m) = 4mi® je (pr/R) Yo m (®). (5.10.39) 


As an equivalent expression to (5.10.38), (5.10.39) gives 


Co Co 2 
p dp ae oe 
(x| = | —— Ani® jo (pr/h) Ye m (X) (p, £,m|. (5.10.40) 
s 0 (2nh)° 
The normalization conditions (x’|x) = 6°(x—x’) (see (2.4.14)) and 


(5.10.17), lead from (5.10.40) to infer the closure relation 
op . . tTO(r—1" 
[icons jet) = BS. (5.10.41) 
0 r 
Arbitrary Spins 


Consider a particle of spin s, with projection of spin along the z-axis given 
by fio, and momentum pg (see (5.10.1) along the z-axis. We denote such a 
state by (po, |. That is, 


(Po, 7] P = Po (Po; 2| (5.10.42) 
(Po, 0| S° = ho (po, 01 - (5.10.43) 
As in (5.10.4), the state 
i T 
(Po, 4] lexp (2m: L)| (5.10.44) 


where L is the orbital angular momentum, describes a state of momentum 
p, as given in (5.10.3), and projection of spin along the z-axis equal to ho. 
To see this, note that S* commutes with L: 


: T : T 
(Po; 7| lexp (Fon: L) 5 (p56) 5" lexp (Fon: L)| 


= ho (po, o| lexp (00 . L)) . (5.10.45) 


Also (5.10.6) holds with J replaced by L since P and S commute. Hence we 
conclude that 


(Po, 9| lex (an: )} = (p,o|. (5.10.46) 


As in (5.10.13) 
(pg,0|L? =0, (5.10.47) 
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We introduce simultaneous eigenstates of the commuting operators P’, 
L?, L, S*, $° which we denote by (p, £,m, s,o|, with the identity defined by 


d 
= = an F|p.£,m, 8,0) (p, £m, 8,0]. (5.10.48) 


From (5.10.46)—(5.10.48), we readily obtain in a similar way as in 
(5.10.25), (5.10.29), (5.10.30), 


20+1 
mel= Soy = Df, (6.9, -4) (p.&m,ol 


= 7 ¥ m(0,4) (p,m, 91 (5.10.49) 


and 
(r,l,m,o|= f a9 ¥z,, (8,8) (pal: (5.10.50) 


One may also combine the orbital angular momentum and spin, using 
Clebsch-Gordan coefficients, to rewrite (5.10.49) as 


(p,o|= > Yom (0,6) (0m; s,0|J,M =m+o) (p,J,M,é,8| (5.10.51) 
Lym,J 


where due to the commutativity of L?, S?, P? with J’, J°, the states 
(p, J, M, £, s| are labelled by p, @, and s as well. 


Arbitrary Spins — Helicity States 


The projection of the spin along the momentum direction of a particle is 
referred to as the helicity. Since 


J-P=S-P (5.10.52) 


as follows from (2.7.6), helicity may be equivalently defined as the projection 
of the angular momentum along the direction of momentum. 
From (5.7.48), we also note the commutativity properties: 


[J,J-P] =0 (5.10.53) 
[J,P7] =0 (5.10.54) 
and 


[J-P,P*] =0 (5.10.55) 
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[P,J-P]=0. (5.10.56) 


According to (5.10.55), we may define a state (pp, A| labelled by py and 
the helicity such that 


(Po; A|P = Po (Po; A (5.10.57) 
(see (5.10.1), p= [pl 


J-P 
(Po, | pee RA (Po, Al - (5.10.58) 


On the other hand, the left-hand side of (5.10.58) is equal to (pg, A| J3. That 
is 


(pp, Al J? = Apa Als (5.10.59) 
As in (5.10.4), (5.10.5), we consider the state 


t 
(po. AI lexp (0 3)| (5.10.60) 

for which (see (5.10.8)), (5.10.3)) 
(Po; A| exp (— jon . 5) P = p (po, Al exp (— jn . 5) , (5.10.61) 


Also from (5.10.53), (5.10.58) 


finds 
(po, Alexp (-;,0n-3) “"* = Ad (po, Alexp (- 0 ; 5) . (5.10.62) 
Pp 


That is, 
. T 
(po. AI lexp (,n-3) | = (p,rl. (5.10.63) 


We introduce simultaneous eigenstates of the commuting operators P?, 
J’, B,I-P 


(p, J, M, | P? = p? (p, J, M, d| (5.10.64) 
(p, J, M, A| 5? = I(T +1) (p, J, M, Al (5.10.65) 
(p, J, M, A| J? = RM (p, J, M, AI (5.10.66) 
(p, J, M, A| a = hd (p, J, M, d| (5.10.67) 


and 
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_pedp” 
1= Sf Fae dM) Mal 

J,M,x 
(5.10.68) 


(p, J, M,A|p', J’, M’, 2’) = (20h 


PP 5s niuar dav. (5.10.69) 


Following a procedure as the one leading to (5.10.49), we obtain 


2I+1, 
(A= >) a 1(0;9, —4) (p, J, M, | (5.10.70) 
J,M 


and upon multiplying this equation by (DUP (4, 0, —¢))", integrating over 0 
and ¢, and using (5.5.52), it is easily worked out that 


MA = = fan (DEi(6.8,-0) (pal. (6.10.71) 


Helicity Versus Standard Spin States 


Consider the spin state (pg, a| in (5.10.42), (5.10.43) and the helicity state 
(Po, A| in (5.10.57), (5.10.58). Clearly, from the just mentioned equations, 


(Po.|] _, = (Po. Al (5.10.72) 


relating the spin (on the left) and helicity states for a particle with momentum 
Po along the z-axis in (5.10.1). 

We apply the operator exp(—idn - J/f) in (5.10.4) /(5.10.5) to both sides 
of (5.10.72), and use the commutativity of L and S to obtain from (5.10.46), 
(5.10.63) 


(p, | ep (— 50: s) = (p,\| (5.10.73) 


for an arbitrary momentum p as defined in (5.10.3). 
From (5.10.49), the left-hand side of (5.10.73) is equal to 


2l+1,_5 i 
20+1 s 

2 > fetta (4,8, -¢)D©(¢, 8, -d) (p, £m, o| 

_ /20+1 7 (s) = 

Ss Db (6,8, o)Dy, (0,9, ¢) 


Lym,o,J 
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x (€,m;s,0|J,M =m-+o) (p,J,M, 2,5 (5.10.74) 


where (€,m;s8,0| J, M =m-+0) is a Clebsch-Gordan coefficient. Due to the 
commutativity of L’, S*, P? with J”, J? the states (p, J, M, , s| are labelled 
by p, € and s as well. 

From (5.10.73), (5.10.70), (5.10.74) 


2.6.0, -4) DOL, 6,-4) 


Lm,o,J 


x (€,m;8,0|J,M =m-+o) (p, J, M, £, s| 


= = a * D($,6,—4) (p, J, M, Al. (5.10.75) 


We finally make use of the identity in (5.5.44) as specialized to the present 
case to read 


Din(, 9,9) DX (6,8, —9) 
= => 0,0; 8, A| J, 4) DY. (6,0, —@) (J, M|£,m;s8,0). (5.10.76) 
Upon substituting (5.10.76) in the left-hand expression in (5.10.75), and 
using the completeness relation 


S7 (U', M'|£,m; 8,0) (@,m;s,o|J,M =m+o)= 67 30M'M 


m,o 


we obtain from (5.10.75) 


+1. 
= DUG, 8, 9) (6,058, 1.2) (9, TM, 6 s| 
JIM, e mn 
2741 
= i Dr (6,0, -4) (p, J,.M, | = (p, Al. (5.10.77) 
J,M 


The latter leads to 


2é+1 
2F+1 


(J,M, | J,M,£, 8) = (£,0;8,\|J,d) (5.10.78) 


where (€,0; 8, A| J, A) is just a Clebsch-Gordan coefficient with M = X. 
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5.10.2 Two Particle States 


Consider two particles ?,, 2. Each of the particles may in turn be a cluster 
of particles (§2.3, §2.7), consisting, say, of n, and nz particles, respectively. 
The total angular momentum J of ®;, ®: may be written as (see (2.7.29)) 


J=XxPt+Jay +I) (5.10.79) 


where 
Ma 
Jia) = (Xq — X) x (P. = “P) +8), @=1,2 (5.10.80) 


and 81), Sig) denote the spins of ?,, . That is, J(q) is the total angular 
momentum of the a*” cluster in the center of mass system of nq the particles. 
Here P;, P2 are the linear momentum operators associated with the particles 
P,, Po. 

The momentum operator 


P.= M2P, _ M,P2 
eS M ’ 


M=M,+ M2 (5.10.81) 


is the relative momentum of P,, 2, as appearing in (2.7.40), where M,, Mz 
respectively, denote the sum of the masses of the n1, nz particles, and may 
be also rewritten as 


P,.= Mp @ wee! P) at (P. aP). (5.10.82) 


M M M M 
where P, — (M,/M)P are the momenta relative to the center of mass, with 
P=P,+Pz». (5.10.83) 


In a coordinate description, the total angular momentum J of particles 
may be rewritten as (§2.3, §2.7) 


J=Xx P+n x (-ifV,) + Sa) + Sa) (5.10.84) 
where MX MX 
x= ae (5.10.85) 


with X,, X_ denoting the center of mass positions of P, and P, 
n= X) — Xp. (5.10.86) 


We consider the problem of defining states for two free particles P,, Po. 
The n; particles making up ?; may, however, be interacting with arbitrary 
interactions. Similarly, the ng particles, making up ®; may have arbitrary 
interactions. This is typical in a scattering process (see Figure 5.3), where 
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So mip = Mi S/ moe = Mo 
B B 


Fig. 5.3. Two clusters of particles are initially widely separated and non- 
interacting before they merge together in a scattering process leading finally 
to a variety of possible outcomes. The intra-clusteral interactions (§2.5) are 
not necessarily vanishing. 


two clusters of particles are, for example, initially widely separated and non- 
interacting before they merge together, interact and produce finally a variety 
of possible outcomes. 

Let (p,,01), (P2,02) denote the momenta and spin projections (in units 
of fi) along the z-axis of the particles ®,, 22. We use the normalization 


(P1015 P2,03|P1,01; Pe, 72) = (27h)°d? (py — py) 0° (Po — P2) 5o401 5 e402 
(5.10.87) 
for the two particle states (p,,01; Po, 72|. 
From the formal definition of the Dirac deltas, it is easily verified, by a 
change of variables, that 


6°(pi, — p1)d° (ps — po) = 5° (py — pr)d*(p’ — p) (5.10.88) 


where 
Pr=Pi +P» (5.10.89) 


is the total momentum of the two particles (i.e., the center of mass momen- 
tum), and 


Mop, — Mp2 
M 


M2 M, My Me 
== (v; Pr) Ti (v. urPr) (5.10.90) 
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From (5.10.87), (5.10.88), we may re-parameterize the two particle states 
as (Pp, P, 01, 2| with the normalization 


(Pr, P’, o1; o IPr, p71, 72) = (27h)° 5 (py ~~ Pr) 5° (p’ = Pp) Detox Oohon- 
(5.10.91) 
Finally, we may introduce the states in the center of mass of the two 
particles P,, Po 


(P, 01, 72| = (Pr, P, 71,92) (5.10.92) 
Pr=0 
Two Spin 0 Particles 
In a center of mass frame, pp = 0, py = —p, and 
P=P, (5.10.93) 


for the relative momentum. The two-particle state (p| in the center of mass 
frame is obtained from a state (pp|, with relative momentum along the z-axis, 
by 


i 
(P| = (Pol lexp (- jn . L,) (5.10.94) 
where n is defined in (5.10.5), leading as in (5.10.29), (5.10.30) to 
(p] = 5° Yem (B) (p, £m (5.10.95) 
Lym 
(r,t,m| = fae ¥;, (8) (Pl (5.10.96) 


The normalization conditions are as before as given in (2.4.15), (5.10.17). 


Two Particles of Arbitrary Spins 


Consider two particles of spins s1, 52 and momenta p,, p, respectively. In 
the center of mass frame (5.10.93), (5.10.94), and due to the commutativity 
of S,, Sg with L,,we have 


: T 
1 
(p, 71, 02| (s8, + Sey) = (Po, 01, 72| (s8,) + Sty) lexp (Fan: L, 


. tT 
i 
= h(o1 + 02) (Po, 71, 02| lex» (;,0 . L,) 


= h(o1 + 92) (Pp, 01, 09]. (5.10.97) 
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Also 
(Po; 01, 72| L? = 0. (5.10.98) 


Equations (5.10.97), (5.10.98) then lead, as for (5.10.49), (5.10.50), to 


20+1 
(Pr orsoal = Dy Ge Pon ($,0, —¢) (p, £m, o1, o2| 


= =D) (p, l,m, 01, 02| (5.10.99) 
and 
(p,,m,01,02] = [a0 ¥ii(B) (B.01,021- (5.10.100) 


Here (p, €,m, 01, 72| are simultaneous eigenstates of the commuting operators 
P*.L7, i , Shy , Si (2)° The normalization conditions are 


- 04,05|P, 01, 02) = (27h)°5° (p’ — p) bo40,5o402  (5-10.101) 


6(p' — p) 


(p',f,m',01,05|p,0,m, 01,02) = 2 6000m'm5a! 15a: (5.10.102) 


For example, for the proton: ®, pion: 7 system, we may write for (5.10.99), 


(Pp, 2) +7 p)| = = tes) (p, &, m ,0| 


L,m,J,M 


where M =m-+o and 


1 3 
=-,7,... 10.104 
J 579 (5.10.104) 


@=Ji-. (5.10.105) 


For identical particles, we have to consider in our description the inter- 
change of particles ®,, 22 where now, in particular, s; = so = s. 

To the above end, in the center of mass frame, if (p,o1| corresponds to 
one particle, then (—p, a2| corresponds to the other particle. 

From (5.10.49), (5.3.43) 


(-p, 0] = S>(-1)'Vem (8, 4) (p, £m, o| (5.10.106) 


Lym 


and, in particular, from (5.3.50) 
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(-po, 0] = So(-1)'1/ at (p, £,0,0|. (5.10.107) 


£ 


On the other hand, we also explicitly have (see (5.2.13), (5.2.17)) 


/20+1 
(Po, 7| exp (-;x12) = y = dl (- ) (p, £,m, o| 
2 i 
= AN ae a £ (p, £,0,0| . 


That is, 
(—Po, 0] = (Po, 7] exp (-;722) (5.10.108) 
Similarly, (5.10.107), leads to 
(Po, 7| = (—Po, 7] exp (-;722) (5.10.109) 


Hence for the interchange of the two particles, we have 


(Po, 01 (—Po, 72| —= (—Pp, 72| (Po, O71 | 


= (pp, 02| (—Po, o1| exp (-j722) (5.10.110) 
where now L? denotes the total orbital angular momentum, in the 2?- 
direction, of the two particles in the center of mass system. 

According to the spin and statistics connection to be studied later in §16.9, 
we have to symmetrize for integral spins: s = integers, (bosons) and, anti- 
symmetrize for half-odd integer spins (fermions): s = (2k+1)/2,k =0,1,.... 

We are thus led to the states 


[(Po, 01, 72| + (—1)?* (—Po, 72, 711] 


iv 
(Pp; 1; F2| + (—1)?* (Pog, 02, o1| exp (-F22)| . (5.10.111) 


To obtain motion in an arbitrary direction p, we apply the operator in 
(5.10.4) to (5.10.111) leading in a straightforward manner from (5.10.99), 
(5.3.43) to 


[(P, 01, 09| + (-1)*8 (=p, 02, 04|] 


= S- Yem (p) to l,m,o1, 02| a (—1)?*t? (p, £,m, 02,01| : (5.10.112) 


Lym 
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Two Particles of Arbitrary Spins — Helicity States 


The two particle helicity states are obtained as for the single particle 
states in (5.10.70) by noting that 


(Po; 1, d2| yp = AQ = A2) (Po; a1, A2| (5.10.113) 


where A, is the spin projection of the first particle along the +z direction, 
corresponding to the state (pp, A1|, and Az is the spin projection of the other 
particle along the —z direction, corresponding to the state (—pp, A2|. 

From (5.10.113), (5.10.70), by using in the process (see (5.10.63)), 


(p, \1, A2| = (Po, Ar, Aa| exp (— jon ; 5) (5.10.114) 


we then obtain 


2 1 
(Aral = Dy ee D”(¢,6,—) (p, J,M,\1,A2| (5.10.15) 


where 


ee ee (5.10.116) 
We also note from (5.10.7), (2.1.4), 


exp (- jn 5) J’ exp (0 3) = J*R* (6,n) (5.10.117) 
and consequently ; 
J*R* (6,n)p' = 3° (5.10.118) 
giving from (5.10.114) 
(p, A1, A2| J» p = A(Ay — Az) (p, A1, A2| - (5.10.119) 


To treat identical particles, we first apply the operator exp (—inS? /h) 
o (5.10.108) and use (5.10.107). The expression on the right-hand side of 
(5.10.108) then becomes 


i = 1 
(Po, 7| exp (-¥-") = » = dye ( T) un TT) (p, £,m,o'| 


Lym,o’ 


ee +1 
(—1) a So(-1f Ar (Pp, é, 0, —o| 
L 


= (-1)**? (-pp, -¢| 
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iv 
= (—py,a|exp (-s") (5.10.120) 


leading to the equality 


in 
(-1)*t? (po, —o| = (pp, o| exp (-|-"). (5.10.121) 


Upon applying the operator exp (—inJ?/h) one more time to (5.10.121) and 
using (5.20) leads to 


(po, o| = (-1)7**? (—po, —o| exp (-Z-") . (5.10.122) 
By definition of helicity states, 
(Pp; Al J? = AA (po, AI (5.10.123) 
(=Po, Al (—J*) = hA(—po, Al (5.10.124) 
For a vector operator V, it is also easily checked from (5.7.42) that 
exp (-j2") V¥ exp (527) = —VF + 26h y?, (5.10.125) 


Hence from (5.10.121), (5.10.123)-(5.10.125), we have 


2 in 
(—1)** (—po, Al = (Po; Al exp (-2) (5.10.126) 
for the corresponding helicity states. 


Similarly, from (5.10.122)—(5.10.125), for the corresponding helicity states 
in (5.10.122) we have 


iv 
(Bo, A] = (17 (po, Alex (FE?) (5.10.127) 


Accordingly, for the interchange of the two identical particles with 51, s2 = 
(Po, A1,A2| > (—Po, A2; A1| 
= (—1)- 7882 (pp, Ag, Xa| exp (-2) (5.10.128) 


where now J? is the total angular momentum of the two particles in the 
center of mass system. Hence 


[(Po: Ai, Ag| a (—1)** (—Po, A2, Ail] 
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it 
= | (Po, 1, A2| + (1) (Po; A2, A1| exp (-7-")| (5.10.129) 


NS Ai — Ag. Upon using, the expression a’, 2 u(—™) = (-1)7~*6uz,a as given 
n (5.2.17), we obtain from (5.10.129), (5.10.15) 


hs Ar, A2| + (=1)?8 (=p, 2, Ar] 


27 +1 
= DY ($,9,-4) [(p, J, M, x, Ao] + (-1)7 (p, J, M, do, Aa] 


(5.10.130) 


where we have used the fact that \ = A,—2 is always an integer for both inte- 
ger as well as for half-odd integer spins s (= s1 = s2), and hence (—1)~?4 = 1. 


Helicity Versus Standard Spin States 


We follow the procedure developed for single particle states. The corre- 
sponding equation to (5.10.72) 


(Po, 71, F2| => (Po, A1, A2| (5.10.131) 


o1=A1 
og=—A2Q 


relates spin (on the left-hand side) to helicity states. Upon applying the 
ae exp(—idn-J/f) in (5.10.4) to (5.10.131), the corresponding equation 
to (5.10.77) becomes 


p2é+1 1 
oS + DY? (4,8, —$) (81, A1; $2, —A2|8, A) (6,0; 8 ,A| J, A) (p, J, M, £, | 


J,M,e 


2 ly 
J,M 


from which we finally obtain 


/2€+1 
(J, M, Ax, A2| J, M, 2, s) — sio4 (81, A13 $2, —A2|8, ») (€,0; s, A| J, A) 
(5.10.133) 


where (81,1; 82, —A2|s,A), (€,0;s,A|J,A) are just Clebsch-Gordan coeffi- 


cients. 


Problems 


5.1. Derive the relations in (5.1.26), (5.1.27) by the repeated applications 
of the operators J_, J; as well as the relations in (5.1.28), (5.1.29). 


5.2. 


5.3. 


5.4. 


5.5. 


5.6. 


5.7. 


5.8. 


5.9. 
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(i) Obtain the expressions of L?, L’, L+ in spherical coordinates as 
given in (5.3.7)—(5.3.9). 
(ii) Evaluate the commutators: [L? sx, [L’ pl, [L? , f([x|)] where 
f (|x|) is a function of |x|. Also show that x-L=0,L-x=0. 
(iii) Obtain a lower bound to the product 
£, m) 


(é,m| (1) |e, m) (¢ m | (i); 

and interpret this result. 
Upon the comparison of the matrix elements of rotation by the Euler 
angles in (5.2.4) with that in (2.1.4) obtain the relationships spelled 
out in (5.2.6)—(5.2.8). 
Use the definition of dY , (8) in (5.2.13)/(5.2.10), for the matrix el- 
ements of finite rotation by the angles 3 about the x? axis, to prove 
(5.2.15)—-(5.2.18). 
Verify explicitly the transformation rules of the spherical harmonics 
in (5.3.43), (5.3.44) under space reflection and complex conjugation. 
Use the transformation law of a vector x = (x!,2?,a?), (cl =2,2? = 
y,0 = oe in (5.2.3) under a rotation of a coordinate system by the 
Euler angles, to find the corresponding transformation law for the 
transformation of the spherical harmonics Yj, Yi,0, Yi,-1. [Hint: Use 
the identities in (5.3.52)—(5.3.54).| 
For the density operator p in (5.4.33) for spin 1, derive (5.4.35), and 
also obtain the expression for Tr [pS"S/]. 
Derive the equalities (5.5.26), (5.5.27) relating the Clebsch-Gordan 
coefficients. 
Consider two particles of spins S(1), $2) of equal masses. 
(i) Show that the orbital angular momenta of the two particles rela- 

tive to their center of mass are equal, i.e., 


pecan (8) (eB) aCe 8 


by referring to (2.7.39), (2.7.40), where 


X=(X,+X)/2, P=Pi+P, 


and hence each particle carries angular momentum L,./2 in the no- 
tation of (2.7.40), where L, is the total orbital angular momentum 
in the center of mass. 

Consider the deuteron (D) as a bound-state of a proton (p) and a 
neutron (n) both of spin 1/2, with the masses taken approximately 
equal. Upon defining the respective magnetic moment vector op- 
erators 


(ii 


Sy 


IphNn JnLN JDEN 
Mp = =. Sp, Hn = = Sn, tp = ——J 
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5.10. 


5.12. 


5.13. 


5.14. 


5.15. 


5.16. 


5 Angular Momentum Gymnastics 


where J is the internal angular momentum of the deuteron (see 
(2.7.39), (2.7.40)) 
J=L,+8,+S, 


show that " 
Lb r 
Lp >= a (5 + GpSp + nSn . 


Here xy = |e|h/2mpc is the nuclear magneton, and gy = 5.58, gn = 
—3.83 are the g-factors of the proton and neutron, respectively. 
Consider the deuteron in the state 7 = 1 and (S, + §,,) with cor- 
responding s value equal to 1. Use the Wigner-Eckart Theorem in 
the form given in (5.7.35) to evaluate approximately the magnetic 
moment Up = gpltn of the deuteron corresponding to ¢ = 0 and 
é = 2. [Experimentally, gp ~ 1.72 and the deuteron is predomi- 
nantly, about 98%, in the @ = 0 state and about 2% in the @ = 2 
state.] 

The Hamiltonian of the dipole-dipole interaction of two spin 1/2 par- 
ticles each of magnetic moment js separated by a vector a is 


wa 


(iii 


O1°aodg-a 

ee ee 

a a 

Suppose that a = a(0,0,1). Show that the singlet and triplet states in 

spin space are eigenvectors of H and find their corresponding eigen- 
values. 


. Suppose that the initial spins of the particles in Problem 5.10, at time 


t = 0, are along the x-axis, i.e., they are in the state (1 Ie (1 1), /2 

(see (5.4.9)). 

(i) Find the state of the system at any time t > 0. Consider a to be 
fixed. 

(ii) What is the probability that the system is found in its initial state 
at time t? 

Find an orthogonality relation for the D-functions in (5.5.51) for the 

cases when j is a half-odd integer. 

Establish the symmetry relations in (5.6.25)—(5.6.27), and the equali- 

ties in (5.6.28), (5.6.29) of the 3-7 symbols. 

Introduce the spherical vector components V(m) of the gradient V, 

and evaluate explicitly the commutator [L(m1), V(mz2)],where L(m) 

are the spherical vector components of the orbital angular momentum. 

Evaluate the reduced matrix element (¢; ||r||€2) of the unit vector Ff, 

for the position vector x = rr. [Hint: Use (5.3.52)—(5.3.54), together 

with (5.8.50) and (5.6.28), (5.6.29).| 

Deviation from spherical symmetry of a charge distribution, such as 

of the nucleus, is determined from its quadrupole moment. Quantum 

mechanically, the corresponding operator may be defined by Q?2 = 

327 — r?, 


5.17. 


5.18. 
5.19. 


5.20. 


(i) 


(ii) 
(iii) 
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Show that Q3 = ./167/5r? Yoo (0,¢). Thus, up to the factor 
\/167/5r?, you recognize this operator as the zero*” component 
of a second rank tensor and hence the notation Q3. 

Write the matrix element ( j,m |3| d; m) in terms of the reduced 
matrix element (j||Q?]||j). 

Show that for 7 > 1 


m? OPD go rg 
im) = (957 ay — 4) eapniy Cale 
introducing (27 + 1) quadrupole moments associated with the 
(27 + 1) possible orientations of the internal angular momentum 
J of the body written in terms of the one oriented along the 
z-axis. The angles 6,, of the orientations are given by cosOm = 
m//j(j +1). For 7 large, one has almost a continuous distribu- 
tion of orientations. From the result in (iii) you may then infer 
that for 7 large, one obtains its classical counterpart 


(j,m|Qo a) 


Q (cos ¥.) = ; (3cos? J. — 1) Q(1) 


where ¥, defines the angle between the symmetry axis of the body 
and the z-axis. 


Evaluate the matrix element on the left-hand side of (5.8.39) directly 
from the general formula (5.8.47). 

Verify the symmetry relations of the 9-7 symbols stated below (5.9.15). 
Use (5.10.30), (5.3.66) to establish the equality in (5.10.39) for 
(x|p,£,m). 

Follow a procedure similar to the one giving (5.10.49) to establish the 
equality in (5.10.70) for (p, A| expressed in terms of the helicity. 


6 


Intricacies of Harmonic Oscillators 


This chapter deals with several problems and intricate details associated 
with the harmonic oscillator potential. After solving the eigenvalue problem 
in §6.1, we study transitions that may be caused between its energy levels in 
the presence of a time-dependent external, i.e., classical, force coupled linearly 
to the position observable at zero and finite temperatures in §6.2 and §6.3, 
respectively. §6.4 deals with the construction of the so-called Fermi oscillator 
in analogy to the Bose one in 86.2. In 86.5, we combine the Bose oscillator to 
the Fermi one to construct supersymmetric theories and consider underlying 
supersymmetric transformations in the light of the analysis carried out in 
§2.9. In the final section (§6.6), the coherent state of the harmonic oscillator 
is constructed and relevant details are developed which allow one to compare 
the quantum mechanical problem with the corresponding classical one in the 
most natural way. 


6.1 The Harmonic Oscillator 


The Hamiltonian of the v-dimensional harmonic oscillator is defined by 
2 
Hee 4s ae: (6.1.1) 
2m 2 

Such a quadratic interaction may be considered as a perturbation, about an 
equilibrium point x = 0 (VV (x) lo — 0) of a spherically symmetric potential 
V(x) in R’, up to an additive constant. The harmonic oscillator Hamiltonian 
is, however, in its own right, a very useful one for various investigations and is 
the prototype of quantum field theories involving an arbitrary large numbers 
of so-called degrees of freedom. 

A lower bound to the spectrum of H in (6.1.1) is readily found by choosing 
g(x) = wh, with w > m, in (3.1.8) giving after re-arrangement of terms, 

vhw ( h? 
< 
2 


1 
YT 5m a (6.1.2) 
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Actually, the lower bound viw/2 coincides with the exact ground-state 
energy of H. To see this, introduce the trial normalized wave function 


®(x) = er exp (-S) (6.1.3) 


and note, by the very definition of the ground-state energy Eo, that the 
expectation value (®|H|®) cannot be less than Eo. That is, from (6.1.2) 


vw 
—< 


5X Eo < (®|A|®). (6.1.4) 
We explicitly have 
h2 2 h2 
(H®)(x) = (-—o2x? + x? + = va | &(x) (6.1.5) 
2m 2 2m 


and upon choosing a = mw/h, we obtain 


(H®)(x) = Ue (x) (6.1.6) 


and from (6.1.4) that 
hw 
(oe a (6.1.7) 


From Theorem 4.1.1, we may also infer that H has only a discrete spec- 
trum. 

To study the eigenvalue problem of the Hamiltonian (6.1.1), it is conve- 
nient to introduce the operators 


1 
xe,/x PS (-ihV) (6.1.8) 
h muh 
and rewrite H as k 

) 
H = —(P?+X?’). 6.1.9 
(p+ x?) (6.19) 

We also define the operator 

1 

a= —(X+iP). (6.1.10) 


V2 


The following are easily established (see Problem 6.1): 
[x Pt) 6! (6.1.11) 


H = hw (al-a+<) (6.1.12) 


la’) = 69, [a’ a’ ] =0 (6.1.13) 
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[H,a’] = —hwa' (6.1.14) 
[Ha] = +hwa'", (6.1.15) 


Since for a normalized state |w), (| H|w) = hw |lad||? + hw /2 > hwv/2, 
(6.1.12) alone shows that H is bounded from below by fwv/2 as expected. 

To solve the eigenvalue problem, we consider first the one-dimensional 
case vy = 1. The several dimensional case may be directly inferred from the 
one-dimensional one. 

To the above end, the eigenvalue problem reads 


H \a) =ala) (6.1.16) 
and from property (6.1.14), 
H (a)¥ la) = (a — fwk)(a)* la) (6.1.17) 


for k = 0, 1, 2,.... 
The boundedness of H from below implies that for a given a, there must 
exist a non-negative integer k and a Qmin such that 


Qmin = a — hwk (6.1.18) 
and 
a|Qmin) = 0. (6.1.19) 
The latter equation implies that 
ala|amin) = 0 (6.1.20) 
and hence hus 
HT |Qinin) = ~~ |Qmin) (6.1.21) 


where we have used (6.1.12) with v = 1. That is, 


h 
ala > (6.1.22) 


and the eigenvalues a in (6.1.16) are given from (6.1.18) to be 


1 
a=nw(n+5), n=0,1,... (6.1.23) 


fory=l1. 
For the subsequent analysis, we label eigenstates |a) by the non-negative 
integer n instead, i.e., we have 


H|n) = hw (n+ 5) In) . (6.1.24) 
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From the property (6.1.15), 
1 
H (al In)) = hw (n+ 1+ 5) (al |n)) (6.1.25) 


and infer from (6.1.24) that 
al |n) = en |n +1) (6.1.26) 


where the constant c,, depending on n, is to be determined. 
To the above end, we note from (6.1.13), (6.1.12), with v = 1, that 


HH 1 
Rei tpiess oe 6.1.27 
aa! =alat+ tS ( ) 


and hence upon taking the norm squared of (6.1.26), 


H 1 
len|? = (nl & + 5) |n) (6.1.28) 
giving 
Ch =Vn+1 (6.1.29) 


with a phase convention, where we have used (6.1.24) and the normalizability 
of the states |n). 
Equations (6.1.26), (6.1.29) give the recurrence relation 


t 
a 
providing the solution 

(at)" 


On the other hand (6.1.19), with (6.1.22), imply from a|0) = 0 that 


d mw 
where Wo(x) = (#|0), giving the (normalized) solution 
mw \1/4 mux? 
Wo(x) = (=) exp (- 5h ) (6.1.33) 


coinciding with the one in (6.1.3) for v = 1, with a = mw/h. 
For the first excited state ~(x) = (x|1), we obtain from (6.1.30), with 
n = 0, and (6.1.33) 


mw \1/4 iD mw 0 mux? 
vile):= (“) 2mw ( A 5.) exp (- 2h ) (6134) 
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or 
2 


mu \1/4 Ww Wr 
Vi(2) = (=) V2 — x exp (-“) : (6.1.35) 


We introduce the dimensionless variable (v = 1), using rather a conven- 


tional notation 
p= (a (6.1.36) 


to rewrite (6.1.33), (6.1.35) as 
(6.1.37) 


mw \1/4 p*\ (-1) afd e 
= (— ane lee 6.1.38 
( wh ) ae ( 2 ) J2 i dp ( ) 
To obtain the solution for all n, we proceed by induction. As an induction 
hypothesis, suppose that for some k, and n= 0,...,k, 


Vn(z) = (ey ew [" (Z) "| (6.1.39) 


which obviously agree for k = 1, ie., for n = 0, 1, with the expressions in 
(6.1.37), (6.1.38), respectively. 
Hence from (6.1.30), (6.1.10), 


ae (—1)* 1 


ve = (Ti V2kk! \/2(k + 1) 


erg 
. Qk+1(k +1)! 


eye oe 
e? FT e ? (6.1.40) 


thus establishing the validity of (6.1.39) for all n = 0, 1, 2,.... 
The expression within the square brackets in (6.1.39), multiplied by 
(—1)”, defines the so-called Hermite polynomials: 


H,,(p) = (-1)"e”" (=) e? (6.1.41) 


Ho(p)=1, Hi(p)=2p, He(p) =4p? —2, .... (6.1.42) 
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The orthonormality condition 
Onnt = (n|n') = fou Un (2) Wn (x) (6.1.43) 
as follows from the eigenvalue equation (6.1.16) and the normalizability of 


the states |r) in (6.1.30), (6.1.31), ¢mply the orthogonality property of the 
Hermite polynomials (6.1.41) from (6.1.39) and the definition (6.1.41): 


fe dp e-”” Hn(p) Hw (p) = San V2”! (6.1.44) 


as written in terms of the dimensionless variable p in (6.1.36). 
The following matrix elements should be noted 


(n! |a|n) = 4/ _ [vn + 15m) n+1 + Vi bn n-1| (6.1.45) 
(n|z?|n) = = (n+ 5) (6.1.46) 


h 
(n! [pln) = if S> [Vin FT bn net — VR Sn' nt] (6.1.47) 


1 
(n|p?|n) = hmw (n+ 5) : (6.1.48) 
These equations, in particular, imply that 
2 2 2 are cs 
o;(n) o,(n) = ¢ + 5) > Ae (6.1.49) 
where 5 
o2(n) = (nj (@ — (n|x\n) )° |n) = (n|2?|n) (6.1.50) 


and similarly defined for 07 (n). We note that the presence of the so-called zero 
point energy fiw/2 in (6.1.24), as reflected on the right-hand of the inequality 
(6.1.49) is consistent with the Heisenberg uncertainty principle. 

For the v dimensional case, with H defined in (6.1.1), we may immediately 


infer from the additive nature of V? = ye (0/0z' a x? = = (2*)” and from 
i=1 i=1 
(6.1.24), (6.1.39), (6.1.41), that 
H\ny,...,m) = fw (m1 +...+ny+ =) |1,.-+, My) (6.1.51) 


and with (X|Wn1,....nv) = Virgenes (x), 


(6.1.52) 


—p?/2 
I jem mat! 
where p = ((1,..-, Pv), P=1/mw/Ax. 
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6.2 Transition to and Between Excited States in the 
Presence of a Time-Dependent Disturbance 


We add a linear coupling of the position variable x to a time-dependent, 
c-function, F(t) in the harmonic oscillator problem in one dimension, and 
define the Hamiltonian 


2 1 2 
H(t) = 2 + <mw*a? — ,/ a F(t) (6.2.1) 
2m 2 A 


where the factor (2mw/h)/? has been introduced for convenience, and up 
to this proportionality factor, F(t) represents an external force which is also 
referred to as an external source. 

In terms of the annihilation and creation operators a, a’, (6.2.1) takes the 
form 


A(t) = hw («ia + 5) — F(t) (a+ a’) ; (6.2.2) 


The Hamiltonian in (6.2.2) is a prototype of field theories in “zero” dimen- 
sion of space, in the presence of an external source F(t), and where (a + a‘) 
is the “field” at time t = 0. 

In the present section, we investigate the problem of transitions from the 
ground-state to excited states and transitions between different states, in 
general, due to the disturbance provided by F(t), not barring, however the 
possibility that the system may stay in its initial state. 

We choose F(t) to vanish for t < T;, t > T2 for some T,,T2,T2 > T,.1 
Before F(t) is switched on, i.e., for t < T,, we consider the system to be in 
the ground-state, and choose for the initial state 


|(T1)) = |0) = |0_) (6.2.3) 


borrowing a notation |O_) often used in field theory. 

After F(t) is switched on, and then later on when it ceases to operate 
after the time 7, the system may, or may not, be found in some excited 
state. 

To investigate such possible transitions, we solve the Schrédinger equation 


nS (W(d) = HCE) WO) (6.2.4) 


for |¢)(T2)), with the initial condition in (6.2.3). 
Equation (6.2.4) is readily solved by making the ansatz 


W(t) = exp |-+ (a()a" + (4))] [0-) (6.2.5) 


' Actually, one may introduce such a function as F(t) which together with its 
derivative are continuous, and vanishes at some given points, see, for example, 
§8.7, (8.7.21), §12.6, (12.6.33). 
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where ((t), y(t) are c-functions satisfying 
BT) =0, (Ti) =0. (6.2.6) 


Upon substitution of (6.2.5) in (6.2.4), using the fact that a|0_) = a|0) = 
0 and the identity in Problem 6.6, we obtain 


[ate)at + 4(@)] |0-) 


= ce (5 a ate! F(t) ( + a(t) “')) |0_) (6.2.7) 


which with the initial conditions in (6.2.6) gives the solutions 


A(t) = -e"' / dt! ci" F(t’) (6.2.8) 
ni) = 2-7) 


a i dt! i dt” et") F(t”) @(t — t”) O(t” —t') F(t’) (6.2.9) 


where we have extended the integrations in (6.2.8), (6.2.9) beyond the range 
T, <t< T> since F(t) vanishes there, i.e., it ceases to operate. Note that the 
second term on the right-hand side of (6.2.9) vanishes for t = T, in addition 
to the first term, and recall also the property of the step function 


d 
—O(t—t”) = 6(t-t”). (6.2.10) 
dt 
Hence from (6.2.5)—(6.2.9), the ground-state persistence amplitude, nor- 
malized to one for F(t) = 0, is given by 


(0|%(72)) 


(410) p= 
"owt ],._ 


_ _(lexp [-in(72)/A][0) 
(Ol exp [-in(T3)/Al [0) | 


(6.2.11) 
F=0 


using the fact that (0| a! = 0, and we have 


1 cog ce : Ww , 
(04|0_) » = exp l-=f at" f dt’ e#—-*) F(t”) @(t" — t') F(t’) 


(6.2.12) 
where we have set O(T> — t’’) = 1, since F(t”) = 0, for t’” > Tp and have 
used the convenient notation (0, |0_),, for the amplitude in question. The 
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denominator in (6.2.11) simply cancels out the phase exp [—iw(T2 — T;)/2] 
due to the zero point energy (see (6.2.9)). 

Given that the system is initially in the ground-state prior to the switching 
on of F(t), (6.2.12) leads for the probability that the system will remain in 
the ground-state after F(t) is switched on and then after it ceases to operate, 
the expression 


2 2 
[04 0)e[? =e |— = fae" 


x fo F(t") F(t’) cosw(t” — t') O(t” — 2) ' 
a (6.2.13) 


The persistence probability in (6.2.13) may be rewritten in the more con- 


venient form 
2 1 ee —iwt 7 
|(0,|0_),-.| = exp —a/ fh dt e' F(t) (6.2.14) 


which upon introducing the Fourier transform 
Fw) = / dt e' F(t) (6.2.15) 


gives the simple expression 


|(04|0-) p|” = exp (4S) , (6.2.16) 


From (6.2.16) we also obtain for the probability of having any excitation 
from the ground-state due to the disturbance provided by the source F(t) to 


be given by 
c exp ( rer) : (6.2.17) 


To obtain the transition amplitude (n;|0_) from the ground-state to an 
excited state |n+) after a disturbing source is switched on and then off, we 
proceed as follows. 

We write a given source F(t) as a sum of two sources: 


2 


F(t) = F\(t) + F(t) (6.2.18) 


where the source F(t) is switched on after the source F)(t) is switched off. 
Hence directly from (6.2.12), we obtain 


? Such an approach is used in particle production by external sources, see, for 
example, Schwinger (1970). 
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(04+ |0_) p = (04 |0_) p, (0+ |0-) p, 


1 coed O9, . uw ys 
x exp l-zf at" | dve ve “One F(t)| (6.2.19) 


where we have used the fact that F(t”) = 0 when F(t”) 4 0. 
The amplitude in (6.2.19) may be rewritten in the more convenient form 
as 


(04. [0-) » = (040) p, (04 |0_) », exp aS a 


= 55 (04 [0-)p, EOE CME (040) - (6.2.20) 
n=0 : * 


We compare this with the expression obtained from a completeness rela- 
tion, referred to as a unitarity sum, i.e.,° 


(04|0-) > =) (04 |m_) p, (m|n)o (r4|0-) p, (6.2.21) 


n,m 


where after F is switched on, the system may (or may not) make a transition 
to a state |n,). After the source F)(t) is switched off the system will remain 
in the same state until F(t) is switched on. The amplitude (m|n)j arises in 
a force-free interval, and if t, is the time F is switched off and tz is the time 
F» is switched on, then (m| nyo» developing in time with a free Hamiltonian, 
is given by (m|n)q = exp [—iwn (te — t1)] Omn- 

From (6.2.20), (6.2.21), we may infer that for given external forces F(t), 
F'(t): 


i —iwT> w n 
helo a AN (04 |0-)p (6.2.22) 


. / n 
[ici Fw) A] 
Vn 
where TJ, Tj denote the times that sources F', F’ are switched off and on, 
respectively. 
That is, the probability of having a transition from the ground-state to an 


excited state |n;) after an intervening force is switched on then off is given 
by 


(04. |r) pr = (04. |0-) po 


(6.2.23) 


3 Note that the + signs attached to n+, m— in (n+|0-)p,, (04|m-—)p, refer to 
the stages after the source F(t) ceases to operate and before the source F(t) is 
switched on. These amplitudes a priori not known are obtained by comparing 
(6.2.21) with (6.2.20). 
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| (n4|0-) | = LPONIET exp (- Per) (6.2.24) 


where we have also used (6.2.16), and on the average the system, initially in 
the ground-state, will be found in the state (n) = 7 = |F(w)|?/h? after the 
intervening source is switched off. 

To obtain the amplitude for the transition from a state |n_) to a state 
|n’,.) after a given intervening source is switched on and then off, we write 


F(t) = Fi(t) + Fa(t) + Fa(t) (6.2.25) 


where the source F2(t) is switched on after the source F(t) is switched off, 
and F3(t) is switched on after the source F(t) is switched off. 
From (6.2.12), we have 


(04 |O_) = (Ore [Oye (04 |0_) p, (04|0_) p, exp eee! al 


es aS = ap Ee = 


= 55 (04|nt),, (ni |m_},, (r+ 10_) (6.2.26) 


n,n’ 


where in the last step, we have written (0;|0_),, in terms of a unitarity 
sum, as in (6.2.21). In (6.2.26), (n4|0_),, denotes the amplitude that we 
have a transition from the ground-state to a state |n+) after the source F\(t) 
is switched off. The latter state may then make a transition to a state |n’,.) 
after an intervening source F(t) is switched on and then off. We note that the 
signs in (n/, | n_) p, refer, respectively, to the stage before F (t) is switched 


on and after it is switched off. The amplitude (04 | n'_) Ps is similarly defined. 
The amplitude (n/, | a oe 
upon the comparison of the second equality with the first one in (6.2.26) and 
using, in the process, the expressions in (6.2.22), (6.2.23) for the correspond- 
ing sources. 
To the above end, we note the identity 


: pase A] = paw 20) = Es aa 


h A A h h 


a priori, not known will be then obtained 


=e AB) rae) ny rg) 0 F/T 


LT! M! N! 
L,M,N 


(6.2.27) 


Upon setting 
L+M=n, L+N=n (6.2.28) 
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we may rewrite the summand in (6.2.27) as* 
[FS (w)/A" a [iFow)/A)" "GES (@)/A"—" [iF (w)/2)” 
a7 he eae GE. fal 


which from (6.2.26) and the identifications in (6.2.22), (6.2.23) for the corre- 
sponding sources, gives for the amplitude (n‘, | n_) p for a given source F(t) 


(6.2.29) 


operating within an interval from T, to T»,° 


(n'|n—) » = (04 |0_)p Vint n! 


mo”) (LF (w) /A]” 2 e—iwn’ Te ein Ts (i F* (Wy) /A]"" 
(n’ — L)! L! (n— L)! 


(6.2.30) 


In particular the amplitude of making no transition in the presence of a 
disturbing source F' is 


(ns |n_)» = (04 |0_) pay Corr (6.2.31) 


where we have divided by exp [—iwn(T> —T 1] for proper normalization of 
(nz|n_) »  (n|n) = 1 for F = 0, and where (0+ |0_), is given in (6.2.12). 


6.3 The Harmonic Oscillator in the Presence of a 
Disturbance at Finite Temperature 


We consider the system described by the Hamiltonian given in (6.2.1) at 
non-zero temperature T 4 0, where T is not to be confused with the time 
limits Ti. 

Temperature dependence is introduced by averaging the expression for 
the persistence amplitudes (n+ |n_) for all n = 0,1,2,... with the familiar 
Boltzmann factor exp [—hw(n+1/2)/kT], where k is the Boltzmann constant. 
This defines the thermal average 


(G4/G_)2 = chew - (n+ 5) be nec (6.3.1) 


4 This treatment parallels similar, but more involved, methods used in field theory: 
Manoukian (1986b). 

5 Such processes are referred to as stimulated excitations where an initial state 
is already in some excited state, prior to the switching on of the intervening 
sources. 
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involving all possible excitations. Here C’ is a normalization constant deter- 
mined from the normalization condition 


= hw 1 
The latter gives 


hw 
C = 2sinh (sex): (6.3.3) 


The expression for (n+ |n_), in (6.3.1) was obtained in (6.2.31). 
Upon using the notation 


= (6.3.4) 


we may rewrite (6.3.1) as® 


Co n 


(G4|G_)p = (0410-)p [1 sical >) 


n=0 


n! e 7 hwn/kT 
FO (Aye: <(6:3:5) 


The double sum in (6.3.5) may be conveniently rewritten as 


3 enue & + 1] ae (6.3.6) 


n=0 


Upon using the integral 


a= fd “dese NP 


=f af eiv(2—A) (z)” (6.3.7) 


we obtain for the sum in (6.3.6) 


pee ay —_— dy eae 1)]" Pron: 
-[ af o Tony oP liy (2 - zea hw/kT _ r)] ep [zo-M/* 
= fu (2 E - acai _ ) exp [ze M/T 


° This treatment parallels similar, but more involved, methods used in field theory: 
Manoukian (1990). 
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1 x 
~ [Le eT] oeP ae (6.3.8) 


Hence for the thermal average in (6.3.5) we have the expression 


(G4|G_)p = (04|0_) p exp | — — = 4 ; (6.3.9) 


Here we recognize the Bose-Einstein (eho iee: a 1) factor occurring in the 
amplitude. 

From (6.3.9), (6.2.16), we obtain for the probability that the system stays 
in thermal equilibrium, the expression 


vr? |F(w)|? hw 
(G4 1G-)F| = exp |- = coth (=). (6.3.10) 
We note that 
hw 2kT 
coth (sex) —— ( es ) (6.3.11) 


Hence, in particular, for the persistence probability in (6.3.10) at high 
temperatures, one has the exponentially damping expression 


ex@z] SS niegs | (6.3.13) 


kT |F(w) |? 
T-0 Rw ; 


hw =i? 


The amplitude in (6.3.9) may be rewritten in a more convenient form by 
introducing, in the process, an integral representation for the step function 


: oo —iw't 
e(t) = — pws e eho (6.3.14) 


eee 7 : 
27 Jog wi +i€ 


encountered in the expression for (0, |0_),, in (6.2.12), to obtain 


(G1|G_), = exp lz / at’ [ dt” F(t") A,(t" —t;T) Fa)| (6.3.15) 


where 


A, (t;T) = / (6.3.16) 


du! aay (-1) _ 271 d(w" — w) 
oo 2T ° w!—(w—ie) © ele’ /kT _] 


and T denotes the temperature. 
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6.4 The Fermi Oscillator 


In analogy to the Bose oscillator in (6.1.12), for vy = 1, where [a, at] =, 
defining the Bose-Einstein statistics, we define a Fermi-oscillator with Hamil- 
tonian 


1 
H = hw G: ap — 5) (6.4.1) 
where the annihilation, creation operators ap, al, satisfy anti-commutation 
relations: 
{ap , ar} = 0, {aj,,ah} =0 (6.4.2) 
and 
{arab} =i, (6.4.3) 


We note that the zero point energy in (6.4.1) was chosen to be —hw/2 
rather than +fw/2 in contrast to bosons. We will see in the next section 
that when we invoke supersymmetry, alone, implying a symmetry of the 
Hamiltonian for the supersymmetric version of the Bose-oscillator Hamil- 
tonian hw (ala +1/ 2). under the boson-fermion exchange, consistency leads 
to a zero point energy of —hw/2 for the Fermi-oscillator. 

The ground-state |0) is defined by 


ar |0) =0 (6.4.4) 
and the single particle-state by 
|t) = ap |0), ap 1) = (0) (6.4.5) 
with 
H|\0) = -= |0) (6.4.6) 
Hy =+" ha. (6.4.7) 


We note that since (ai.)? = 0, there are no two-particle states. 
A representation of the annihilation and creation operators is given by 


01 — 10 00 
fh O1 +109 0 1 
a= G ae (6.4.9) 


where 01, 02 are Pauli matrices. The ground- and single-particle states may 
be then represented by 
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|0) = (7) [1) = (5) (6.4.10) 


Now we couple the Fermi-oscillator to external anti-commuting sources’ 


n*(t), n(t) as follows 


H(t) = fw («) ar 5) n*(t) ap — ah n(t) (6.4.11) 


where 7*(t), 7(t) anti-commute with ap, at, and 


{n(t) .n(t’)} = 0, {n(t) n° (t')} = 0. (6.4.12) 


To solve the dynamics involved with H(t) in (6.4.11), we follow the pro- 
cedure given in §6.2 for bosons, subject to the constraints (6.4.2), (6.4.3), 
(6.4.12) and the fact that », 7* anti-commute with both ap and al. As for 
the Bose-oscillator, we choose 7(t), 7*(t) to vanish for t < T,, t > T> for some 
Ti, To (To > T\). 

To the above end, the Schrédinger equation reads 


n= W(t) = HCH) [w(0)). (6.4.13) 


By setting |7)(t)) = |y(t)) e*/?, we obtain 
30 : 
ih-5; Ix(t)) = (wap ar —n* ap — apn) [x(t)) (6.4.14) 


By making the ansatz 


Itt) = 2 exp (—Eo(t)ak 0) (6.4.15) 


where ¢(t) is a c-function, p(t) is an anti-commuting c-function, substituting 
in (6.4.13) and using the constraints (6.4.2), (6.4.3), (6.4.12), we obtain as in 
(6.2.8), (6.2.9), 


t 
o(t) = | dt! eC) n(t’) (6.4.16) 


Ty 
p(t) = ae / dt!’ il dt’ n*(t’) Q(t — t”) O(t” — t') ec #O"—ENn(t’)_ (6.4.17) 
with the boundary conditions that ¢(T,) = 0, p(T1) = 0. 
The ground-state persistence amplitude is then given by (see (6.2.11)), 


” These objects are referred to as Grassmann variables as also noted in §2.9. Such 
variables will be studied in great detail in §10.6. 
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1 co 
(0-[0=),, a = exp l-=f dt” 


—Co 


x i aerial (eV OE" — F) Hf) (6.4.18) 


Co 


which is normalized to unity for 7 = 0, 7* = 0, with a persistence probability 


(04. 10-)yne] = exp re) 
nto) nw) 
=1-7*) (6.4.19) 


written in terms of the Fourier-transforms of the sources. In the last step in 
(6.4.19) we have used the anti-commutativity property of 7, 7 
By a similar procedure as in obtaining (6.2.22), (6.2.23), we have 


(n4.|0—) ne = [in(w) ei /A]” (040) pn (6.4.20) 
(04 [2 )ayrgts = (04 10) [inl (w) el /r] (6.4.21) 
where n = 0,1, and hence we have dismissed with the factorial factor (n!)~1/?. 


In analogy to (6.2.31), we also obtain 


(ns |2=)y ne = c a ees (OF Oe (6.4.22) 


for (n+ |n_),,,.,* properly normalized to one for n,7* — 0, where n = 0,1. 
Equations (6.4.18), (6.4.20)—(6.4.22) give all the relevant amplitudes.® 


8 For completeness we note that although the thermal average in the present case 
with only two states is quite formal, the corresponding amplitude is given by 


(G4|G ae = exp Fal cat fe dt” n*( Ase" - eT) n(0)] 
where 
A (4D) — dw! _ia't (-1) Qi d(w’ — w) 
Aer) = fo on e wes ehw!/kT 4 1 


in analogy to the Bose case in (6.3.15), (6.3.16), the derivation of which is 
left as an exercise to the reader. Here we recognize the Fermi-Dirac factor 
[exp(hw/kT) + 1] ~" in the amplitude. 
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6.5 Bose-Fermi Oscillators and Supersymmetric 
Bose-Fermi Transformations 


To construct a supersymmetric Bose-Fermi oscillator which combines the 
Bose-oscillator (§6.1) and the Fermi-oscillator (§6.4), we define the supersym- 
metric generators (§2.9, $4.7) 


Q = hy 2wal, ap (6.5.1) 
Qt = AV 2w ag al, (6.5.2) 


where, together with their adjoints, 
[azar] =0, [an af, =, (6.5.3) 


For convenience, in this section, the Bose operators have been denoted by 
QB, al. 
The generators Q, Q! lead for the Hamiltonian (§2.9) given by 


1 
H= aE {Q,Q"} (6.5.4) 
the expression 
AH = hw (a apt al, ar) (6.5.5) 
where we note, in particular, that 


QQt = 2h?w al, QB AF al, 
= 2h?w al, ag (1 _ al, ar) . (6.5.6) 


The Hamiltonian in (6.5.5) may be rewritten as 
i 1 i 1! 
H = fw | a, ap + 5 + hw | ap ap — . (6.5.7) 


We recognize the first term as the Hamiltonian of the Bose-oscillator and infer 
a zero point energy of —hw/2, rather than of +fw/2, for the Fermi-oscillator. 

The cancellation between the zero point energies of the Bose and Fermi 
oscillators is a very special and attractive feature of supersymmetry. This 
is quite significant when one is dealing with an infinite degrees of freedom 
(v — ov, see (6.1.12)) and similar cancellations occur in field theory which 
would otherwise lead to ambiguities. 

The ground-state of the Bose-Fermi oscillator |0,0) is defined by 


ap|0,0)=0, ap |0,0) =0 (6.5.8) 
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and the state with n boson excitations and m fermion ones, with m = 0,1, 
are given by (see also (6.1.31)) 


) (ai.)” |0,0) . (6.5.9) 


In,m) = 


By using the property 
ap(at)” = n(at,)"* + (al,)"ap (6.5.10) 


one obtains 
H |n,m) = fiw(n + m) |n,m) (6.5.11) 


as expected. 
The generators Q, Q carry out the Bose © Fermi transformations as 
follows. We explicitly obtain 


Qt |1,0) = Av2w ag af, al, |0, 0) (6.5.12) 
or 
Q* 1,0) = AV 2w 0,1). (6.5.13) 
Similarly we have 
Q|0,1) = hV2w 1,0). (6.5.14) 


More generally one has 
Qt |n, 0) = hV2w4/n |n — 1,1) (6.5.15) 
Q |n,1) = AV2wV/n + 1|n + 1,0) (6.5.16) 


and from (6.5.11), we note that the states |n — 1,1) and |n,0) are degenerate 
corresponding to the energy hwn. 

One may introduce interaction terms in the elementary Bose-Fermi oscil- 
lator Hamiltonian in (6.5.5) by adding to it, for example, a Yukawa term, de- 
scribing a direct interaction between bosons and fermions, and a cubic term, 
describing a direct self-coupling of the bosons. To this end, we introduce a 
Bose “field” at time t = 0 defined by 


¢=aptak (6.5.17) 


and consider the Hamiltonian H defined by 


Al 

re (a) ap +ah ar) + Xo ah, ap d+ Ad?. (6.5.18) 
In relativistic quantum field theory, as quantum mechanics with an infinite 

degrees of freedom, a generalization of (6.5.18) shows, in particular, that the 

anharmonic cubic term in the Bose field gives rise to a Hamiltonian which is 
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unbounded from below, that is, its spectrum goes down to —oo. Here suffice 
it to say, as will be shown below, the simple Hamiltonian in (6.5.18) with the 
Bose “field” of one degree of freedom, leads to a spectrum which has a strictly 
negative part for all A 4 0, and emphasize that an immediate supersymmetric 
generalization of (6.5.18) “shifts” the spectrum to a non-negative value. 

To the above end, we note the following expectation values, in the state 
|0,0), involving the anharmonic cubic term ¢°: 


(0, 0| ¢° |0,0) = 0 (6.5.19) 

(0,0| ¢° 6° |0,0) = 15 (6.5.20) 

(0, 0| 6° ¢° 6° |0,0) = 0 (6.5.21) 
(0, 0| 4 al, ap 6° |0,0) = 27 (6.5.22) 


as obtained from the commutation relations involving ap, al. 


As a trial state, we choose the normalized state 
mel see) |0, 0) (6.5.23) 


for A # 0. 
By definition of the lowest point E of the spectrum (see also Prob- 

lem 6.18), 
E< (wlHly). (6.5.24) 


To obtain the expression on the right-hand side of this inequality, we use 
the ones for the expectation values given in (6.5.19)—(6.5.22) and note that 
the terms in the Hamiltonian depending on the Fermi operators ar, ap, more 
specifically aap, does not contribute to it. This is because ap, al, commute 
with the Bose operators, and ar |0,0) = 0. 


Accordingly, 
E 1 2 
—< i 3 gi 3 ty 3 3 es 
Fa S TEP (0:01 6" ab ap 6° |0,0) — 25 (0,0] 9° 6" 10,0) (6.5.25) 
or : 
BS — hw (6.5.26) 
for all X 4 0. 


A supersymmetric version of the Hamiltonian in (6.5.18) which includes 
correct terms to cancel out, in particular, the negative contribution of the 
anharmonic term \¢? is immediate. To obtain precisely this cubic term in the 
supersymmetric Hamiltonian, we introduce the supersymmetry generators 


Q = hV2w (ah + 6?) ap (6.5.27) 
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Qt = hv2w (ap + 4”) al, (6.5.28) 


as a generalization of the non-interacting case in (6.5.1), (6.5.2). 
It is readily verified, that these generators give rise from 


i 
Hs = a {Q,Q'} (6.5.29) 
to the Hamiltonian Hg, 
H 1 
on = aap talap +4 (<f, ap — 5) b+ rd? + 74%. (6.5.30) 


The non-negativity of the spectrum of Hg follows, as before (see §4.7), by 
noting that for a state |x) 


(xlHslx) = 55 ((l@txl? + ll@ull?) > 0. (6.5.31) 


It is remarkable that a supersymmetric version of a Hamiltonian provides 
a Hamiltonian with a non-negative spectrum especially if the former would 
have an unbounded spectrum from below. It is also interesting to note that 
supersymmetry provides constraints on the couplings (such as Ag = 4X in 
(6.5.18) /(6.5.30)), thus reducing much of the arbitrariness in choosing inter- 
actions. Such facts have been quite useful in field theory. 
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A coherent state is a very special linear combination of the states |n), 
given in §6.1, which brings the quantum mechanical treatment into very close 
proximity with its classical counterpart description in the most natural way 
and a contact with classical notions is most suitable to the experimentalist. 
Such a state arises naturally in the following way. 

The matrix elements of the position and momentum operators x(t), p(t) 
at any time t, and the Hamiltonian H = p?/2m + mw?x?/2, with respect to 
the states |n) may be read from (6.1.45), (6.1.47), (6.1.24) to be given by 


1/2 
(n! |x(t)|n) = (s=3) [Vn 1oy nse + Vn bain ie) (6.6.1) 


2mw 


1/2 
(n'|p(é)|n) = i (=) [Vn + lon nti et — Vndnn-1e i") (6.6.2) 


(n! |H|n) = barn Fw (w en 5) (6.6.3) 
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The expressions in (6.6.1), (6.6.2) are not only non-diagonal but also any 
resemblance to the classical solutions 


Xe(t) = |A| cos(wt — 6) (6.6.4) 
p-(t) = —|A| mw sin(wt — 6) (6.6.5) 
Hos eee (6.6.6) 


is quite remote. 

For given w, m these classical expressions are parameterized by the ampli- 
tude |A| and the phase 6, or equivalently by the complex number A = |A]e’®. 

We will construct a state, referred to as a coherent state, as a linear 
combination of the states |n) and conveniently parametrized by the complex 
number A, denoted by |A), such that the expectation values of the operators 
x(t), p(t), with respect to |A) coincide with the classical counterparts (6.6.4), 
(6.6.5), and such that the expectation value of H in |A) comes as close as 
possible to the classical expression in (6.6.6). As we will see below, the reason 
why (A|H|A) cannot exactly coincide with H; in (6.6.6) is due to the presence 
of the zero point energy fw/2 in the spectrum of H in (6.1.24) and due to a 
positivity constraint. 

That is, we introduce a state 


= 5°C,(A) |n) (6.6.7) 


(A|A) = > |Cn(A (6.6.8) 

such that 
(A|a(t)|A) = |A| cos(wt — 6) = x(t) (6.6.9) 
(A|p(t)|A) = —|A| mw sin(wt — 5) = p-(t) (6.6.10) 


and, as we will see the closest we can come to H, for the expectation value 
of H, is given by 


2 Iw hw 
— b> SHet+ 5. (6.6.11) 


(A|H|A) = |A 
The coefficients squared |Cn(A)|? have the usual interpretation of the 
probabilities of the system, described by the state |A), to be found in the 
states |n). 
From (6.6.1), (6.6.7), we explicitly have 
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oo 1/2 
(Ala(t)|4) = > (+) (Cn41(A))* Cn(A)Vn + Le + cc. (6.6.12) 


2mw 
n=0 se 


which from (6.6.9) requires that 


* 


oo 1/2 
», (sas) (Cn41(A))” Cn(A)Vn +1 = aa (6.6.13) 
n=0 


where we recall that A = |A| exp id. 

Equation (6.6.10), for the expectation value of p(t) in the state |A) gives 
the same condition as in (6.6.13). 

On the other hand, from (6.6.3), we have 


(A|H|A) = hoon |C,(A P+ (6.6.14) 


and the non-negativity of ye n |Cn(A) |’ in this equation should be noted. 


n 
Upon defining the convenient complex number 


mw 1/2 
the conditions (6.6.8), (6.6.13), (6.6.11), (6.6.14) give 
bee oncen (6.6.16) 
CG 
yy HO, Vnt1=1 (6.6.17) 
n=0 
co C* 
yo Cn+1 (nt+1)=1 (6.6.18) 
z z 
n=0 


where we have assumed n Neale = 0, for n = 0 in writing the last sum, and 
suppressed the A (or z) dependence of the Cy. 

We note the summands in (6.6.16), (6.6.17) have the C,, factor in common, 
while (6.6.17), (6.6.18) have the C%,, factor in common. It is easy to see 
that the following recurrence relation obtained from the comparison of the 
summands in (6.6.17) and (6.6.18), 


Ch4i= Ch, =0,1,... 6.6.19 
+1 oF eng n ( ) 


satisfies all of the sums in (6.6.16)—(6.6.18) (see also Problem 6.20, (6.6.26)). 
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The solution of (6.6.19) is elementary and is given by 


c, = 2" terse (6.6.20) 
n! 
where we have finally used the normalization condition (6.6.16) again to solve 
for Co. 
That is, apart from the expected zero point energy hw/2 in (6.6.11), the 
state, now parametrized by the complex number z in (6.6.15), 


—|z2|?/2 = (2)? 
jz) =e "lt / Dea (6.6.21) 
(z|z) =1 (6.6.22) 


gives rise to expectation values for x(t), p(t), coinciding with the classical 
solutions, and for H coming close to its classical counterpart. 

Let us recapitulate the physical meanings of the complex number z, with 
which the coherent state in (6.6.21) is labelled. To this end, upon writing 


z= |z/eé (6.6.23) 


|z|, up to ascaling factor, denotes the amplitude of the corresponding classical 
solution (6.6.4), and 6 provides its phase, and the privileged state (6.6.21) 
brings the quantum oscillator in the light of classical notions well suited to 
the experimentalist. Many additional properties of the coherent state |z) will 
be obtained as we go along. From (6.6.18), (6.6.20), we also have 


oo co Qa\n 
Son |Cn|? = Son (l21") el? = |2)2 =m. (6.6.24) 


n! 
n=0 n=0 


That is, |z|? denotes the mean excitation quantum number n in the state |z). 
In this respect the probability of the system, described by the state |z), to 
be found in the state |n), is then given from (6.6.21), (6.6.23) to be 


l(n|z)|? = P.(n) = on n=0,1,... (6.6.25) 


which is the celebrated Poisson probability mass function.® 

A property of central importance of a coherent state |z) is that with 
respect to it, the annihilation operator a (86.1) becomes a multiplicative!® 
one, L.e., 


° For detailed properties of a very wide range of probability distributions, see: 
Manoukian (1986c). 

10 Many authors introduce coherent states by beginning from (6.6.26) as a defining 
equation for the state |z). 
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a|z) = z|z) (6.6.26) 

as is easily checked, and although a is not a self-adjoint operator, (6.6.26) 
signals its importance in making contact with classical solutions. 

The unitary operator exp(—itH/h) applied to |z) gives 

(e~#*z) am 


vil 


as follows from (6.6.21) and (6.1.24). That is, we may write 


Co 
e itH/h \z) = |z;t) = e7lzl?/2 e wt/2 ys 


n=0 


In) (6.6.27) 


|z;t) = |2(t)) ee"? (6.6.28) 


where 


z(t) = ze? = |z| ett), (6.6.29) 


The coherent states (6.6.21), for all z, satisfy at all times t, the very 
optimal minimum uncertainty principle criterion: 


2) (|p) — @)?|2) = (6.6.30) 


where (x(t)) = (z|x(t)|z). The equality in (6.6.30) follows by noting that 


(z\e(@\z) = 2” Jal? cos? (ut —o)+ — (6.6.31) 
z|p?(t)|z) = 2hmw|z|? sin? (wt — 6) + pat (6.6.32) 
2 


and using (6.6.9), (6.6.10), (6.6.15). 

It is expected that the coherent states |z), for all complex z, providing 
the totality of all possible amplitudes and phases, admit a (completeness) 
resolution of the identity. This is indeed the case and it reads: 


[= z) d(Im z) 


T 


|z)(z| =1. (6.6.33) 


This is easily established by working, for example in polar coordinates and 
setting z = re? in (6.6.21) and on the left-hand side of (6.6.33) (see Prob- 
lem 6.24) giving 


oo 20 oo 
-{ rar [ dé re? )(re!?| = x |n)(n| = 1. (6.6.34) 
0 0 n=0 


This resolution of the identity is not to be confused with the corresponding 
one for a self-adjoint operator (§1.8), as the z are not some eigenvalues of 
a self-adjoint operator and the states |z) do not satisfy any orthogonality 
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conditions. As a matter of fact one explicitly has, directly from (6.6.21), and 
(6.1.31), that 


(2'|z) = (Olexp (za) exp (za") |0) exp — (6.6.35) 

and since 
la, [a,a"]] =) (6.6.36) 
fa’, [a,a"]] =0 (6.6.37) 


an application of the Baker-Campbell-Hausdorff formula (see Appendix I) 
gives 


2 1/2 
(z'|z) = exp (<2 - a - er) : (6.6.38) 

In particular, one has 
|(2" |z)|? = exp (-|z’ — z|?). (6.6.39) 


The coherent states in (6.6.21) also satisfy the continuity’! condition: 


Il 12’) — lz) || 0 (6.6.40) 


Zz 


which is an important property in the mathematical definition of coherent 
states, and follows from (6.6.22) and (6.6.38). 

In the light of the minimum uncertainty principle in (6.6.30), it is worth 
investigating the time development of the x-space coherent wavepacket as 
well as of the p-description. We work out the details for the x-description 
only; the p-description is left as an exercise (see Problem 6.26). 

From (6.6.28), (6.6.21), (6.6.29) and (6.1.31), we may write 


2; t) = e7l21?/2 eivt/2 gx(t)a! gy (6.6.41) 
and from the definition (6.1.10) we have 


(x|z;t) =e7!71°/2 e-i#t/2 (| exp ax = iP) |0) . (6.6.42) 


v2 


Using the commutation relation [X ,P] = i in (6.1.11) for vy = 1 and the 
Baker-Campbell-Hausdorff formula, this gives 


11 More precisely this is referred to as strong continuity, while weak continuity is re- 
ferred to the weaker condition (| z’ — z) — 0, for normalizable states |), which 
follows from strong continuity by a direct application of the Cauchy-Schwarz in- 
equality. 
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; t 
(x | 2;t) = e7I°/? e~iwt/2 gx) X/V2 e—"()/4 (| exp iP |0). (6.6.43) 


Upon rewriting the ground-state wavefunction in (6.1.33) in terms of X: 


Yo(2) = Gay exp (-=) (6.6.44) 


we obtain 


(a|z;t) = (Bey eriwtr exp | id | 205 a) 


x exp -} (x - al (6.6.45) 


Finally using the expression for z(t) in (6.6.29), the definition (6.6.15), 
the fact that X = (mw/h)'/?x and (6.6.4), we get 


(x|2;t) = Ey ete |-S= (x — we(t)) "| exp|id(e,t)] (6.6.46) 


where exp (i¢(x, t)) is a phase factor with 


wt» lal SiN : 
(x,t) = Sp + > sin [2(wt — d)] — ee |z|a sin(wt — 6). (6.6.47) 
This gives a Gaussian probability density 
mw \1/2 mw 2 
a | 2; t)/? = (=) exp |-~ (x — x(t) (6.6.48) 


centered about the classical solution (6.6.4), and most importantly it is non- 
spreading in time as the variance o?(t) is time-independent given by 
h 
2 2 
oz(t) = 0,(0) = — 


x 


(6.6.49) 


A similar analysis may be carried out in the p-description. 
As a function of the dimensionless variable X in (6.1.8), the probability 
density corresponding to (6.6.48) may be rewritten as 


1 (X — V2Re[z(t)])” 
2 i? 


1 1 
Vim (1/3) 


where we have used (6.6.29), (6.6.15), (6.6.4), corresponding to a standard 
deviation equal to 1/\/2. In particular we note that for z(t) pure imaginary, 
the Gaussian distribution in (6.6.50) is centered at the origin. The density in 
(6.6.50) is normalized with respect to the measure dX. 


|(X |z;£)|? = exp (6.6.50) 
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Problems 

6.1. Derive the properties given through (6.1.11)—(6.1.15). 

6.2. By writing the Schrédinger equation in the momentum description and 
using the solutions given in (6.1.39), (6.1.41), in the position descrip- 
tion for vy = 1, obtain the eigenstates (p| 7) = v(p) in the momentum 
description satisfying the normalization condition 

dp 2 
| sm WOP=1 

6.3. Derive the expressions for the matrix elements given through (6.1.45) 
(6.1.48). 

6.4. A particle is moving in a one-dimensional harmonic oscillator potential 
and is in the state n = 1. 

(i) Derive an expression for the probability of finding the particle in 
the range 71 <a < 2. 

(ii) Derive an expression for the probability of finding the particle’s 
momentum in the range p; < p < po. 

6.5. A uniform electric field E is applied to a charged particle moving in 
a one-dimensional harmonic oscillator potential. Find the eigenvalues 
and the eigenvectors of the resulting Hamiltonian. 

6.6. For an arbitrary number (3, show that 

aes’ = 0%! (a + 8) 
where a, a! are the Bose annihilation and creation operators. 

6.7. Verily that G(t), y(t) in (6.2.8), (6.2.9), are, respectively, the solutions 
of (6.2.7) satisfying the initial conditions in (6.2.6). 

6.8. Solve the equation (6.2.4) for the Hamiltonian (6.2.2) by the method 
developed in Appendix to §2.5. 

6.9. Show that the expression in (6.2.14) is equivalent to the one in (6.2.13). 

6.10. Derive the expression for the normalization constant C in (6.3.3) for 
the Boltzmann factor. 
6.11. If the disturbing source F(t) in (6.2.1) is given by 
hw La i 
F(t) = cos (=) ; for -|t)< 
0, for |t| >T 
find the probability of having a transition from an initial state n = 2 
to a final state n = 3. 
6.12. Show that the double sum in (6.3.5) may be rewritten as the expression 
in (6.3.6). 
6.13. Obtain the solutions in (6.4.16), (6.4.17). 
6.14. Derive the equations (6.4.20)—(6.4.22) for the Fermi-oscillator. 


6.15. 


6.16. 
6.17. 


6.18. 


6.19. 


6.20. 


6.21. 


6.22. 


6.23. 


6.24. 


6.25. 


6.26. 


6.27. 


Problems 357 


Verify the Bose = Fermi transformation rules given in (6.5.15), 
(6.5.16). 

Obtain the values for the matrix elements in (6.5.19)—(6.5.22). 

Show that the supersymmetric generators in (6.5.27), (6.5.28) lead to 
the Hamiltonian Hg given in (6.5.30). 

Investigate further the nature of the lowest point F of the spectrum 
of the Hamiltonian in (6.5.18), i.e., whether F = —co or E > —co, by 
constructing states, in analogy to the one in (6.5.23), containing an 
arbitrary large number of boson excitations. 

Show by invoking, in the process, the normalization condition (6.6.16), 
that the recurrence relation in (6.6.19) leads to the solution given in 
(6.6.20). 

Can you find another solution for the coefficients C,,(A) in (6.6.7) 
consistent with the equations (6.6.16)—(6.6.18)? Comment on this. 
Show directly from the expression of the probability density 


(Cp a 


n! 


P.(n) = 


that 


Ss nP.(n) =a). 
n=0 


Derive explicitly the multiplicative nature of a as given in (6.6.26) for 
coherent states. 

Establish the equalities in (6.6.31), (6.6.32) for the expectation values 
of x(t), p?(t), in the coherent state |z). 

Derive the resolution of the identity resulting from the coherent states 
|z) as given in (6.6.34). 

Prove the non-orthogonality relation of the states |z) as given in 
(6.6.38), (6.6.39) and establish the continuity property spelled out in 
(6.6.40). 

In analogy to (6.6.46), (6.6.47) derive the expression of the time- 
dependent coherent states in the momentum description (p| z;t). 

Use the generator G = i€* Q — iQ? 85€, for infinitesimal transforma- 
tions introduced in §2.9, to derive the infinitesimal transformations 
dag, Sah, bar, Sah, corresponding to the supersymmetric generators 
Q, Qt in (6.5.27), (6.5.28). 
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Intricacies of the Hydrogen Atom 


The purpose of this chapter is to carry out a detailed study of the hydro- 
gen atom. The first section deals with its stability whose importance cannot 
be overemphasized and is based on the earlier analysis in §3.1 on uncertain- 
ties, localization and stability. In the process of this analysis, we will derive an 
upper bound for the average kinetic energy of the electron in the atom, and 
obtain a non-vanishing lower bound to its average radial extension as well 
as determine its exact ground-state energy without even considering first 
the eigenvalue problem. In particular, the boundedness of the spectrum of 
the corresponding Hamiltonian of the (bound) atom from below implies the 
vanishing of the probability of the “fall” of the electron on the proton. The 
eigenvalue problem is studied in §7.2, §7.3 and the inclusion of the spin of 
the electron and relativistic corrections is the subject of §7.4-8§7.6. In §7.4, we 
will see how a direct generalization of the Schrédinger equation may be car- 
ried out to include such effects! as the fine-structure and hyperfine-structure 
of the atom for all orbital angular momenta. The so-called non-relativistic 
Lamb shift is presented in detail in $7.7 where the electron, treated non- 
relativistically? in the atom, interacts with radiation. This provides a split- 
ting of some of the energy levels which would otherwise be degenerate by an 
analysis based on the Schrédinger equation, as well as on the Dirac equation, 
with a Coulomb interaction alone, if radiation is not included in the treat- 
ment. Here due to the ever presence of radiation accompanying the electron 
we will encounter the concept of mass renormalization and the analysis will 
provide a glimpse of the fascinating world of quantum filed theory, namely 
that of quantum electrodynamics. This section is followed by one dealing with 


1 The quantum description of relativistic particles will be studied in more detail 
in Chapter 16. 

? In the atom, the speed of the electron is roughly suppressed by a factor a relative 
to the speed of light, where a ~ 1/137 is the fine-structure constant (see (7.1.16), 
(7.4.34)). 
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the decay and determination of the mean lifetimes of excited states. Finally 
in 87.9, we consider the hydrogen atom in external electromagnetic fields. 


7.1 Stability of the Hydrogen Atom 


Due to the central importance of the subject matter of this section in 
quantum physics, we recapitulate some of the details given in §3.1 as special- 
ized to the hydrogen atom. 

First we derive a finite lower bound to the spectrum and prove that it ac- 
tually coincides with the exact ground-state energy of the atom. The bound- 
edness of the spectrum from below is important, otherwise the electron would 
fall into such an unbounded level releasing an unlimited amount of energy 
and is physically meaningless.® 

We then use the finite negative property of the lower bound of the spec- 
trum derived to investigate the localization of the electron in the atom and 
study further the stability of the atom. 

The Hamiltonian of the relative motion of the electron to the proton is 
given by (§2.5, (2.5.30)) 


Hat ==, (7.1.1) 


where T = p’/2u, p = —ihV, pu is the reduced mass of the electron-proton 
system. By choosing g(x) = —e?/|x| directly in (3.1.8) and using the fact 
that V - (x/|x|) = 2/|x|, we obtain from (3.1.8) 


4 2 
se ~ € 
<(T JL? 
ne < (PF) an 
providing a lower bound for the ground-state energy E of H in (7.1.1). Ac- 
tually, the bound —pe*/2h? provides the exact value for E. To see this, note 


that for any normalized trial function ®(x), we may write (see the introduc- 
tion to Chapter 4) 


4 2 
pe ~ € 
—-— 7, <E<(®|(T-— ]|®). 7.1.3 
ne <E< (8|(P- 5) ) oe 
In particular, for 
1 
2(x) = Te prlte-Pr xl =r, B>O (7.1.4) 


3 From the analysis given in §4.1 alone, we already know that the spectrum is 
bounded from below since the Coulomb potential —e?/|x| satisfies the sufficiency 
conditions (4.1.2), (4.1.3) for v = 3, that is, it is locally square-integrable and it 
vanishes for |x| — oo. 
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F 2 10,20 . 
with V° > 4,7" 5, we obtain 


(7- =) ®(x) = (2 3) - “| 200. (7.1.5) 


Hence by choosing @ = pe?/h?, the right-hand side of (7.1.5) becomes 
— (e*/2h?) ®(x), and for (7.1.3) we have 


4 4 
je je 
-—<E<-~,; 7.1.6 
2ne ~~ 2h ( ) 
thus establishing that the exact ground-state energy is given by 
4 
je 
B=-_—,. wl. 
p=- (7.1.7) 
Incidentally, 1/3 = h?/e? is the celebrated Bohr radius: 
h2 
ao = —5 (7.1.8) 
[se 


which gives an order of magnitude of the average of the radial extension of 
the atom in its ground-state, since 


(® |r|®) = 3a9/2 (7.1.9) 

where |®) is the ground-state as defined in (7.1.4) with 6 = 1/ao. 
Quite generally, we may derive a Heisenberg uncertainty-like inequality 
between the average radial extension of the atom and the average kinetic 


energy of the electron in the following manner. By choosing g(x) = 1/|x|,a = 
0,b = 0 in (3.1.3), we obtain for the electron in a normalized state |y) 


A h2 
(wlF Ie) > 3, (° 
Lb 
On the other hand, we may invoke the Cauchy-Schwarz inequality 


1= [a°x\vGo)? = far SPF \wG0) 


1 


r 


vy. (7.1.10) 


1 1/2 wf 
< (|= Ie) (wird) (7.1.11) 
giving 
(v E *) 2 — (7.1.12) 
r (b |r|) 


to infer from (7.1.10) that 


362 7 Intricacies of the Hydrogen Atom 


2 
(oF fy) Ge lrlay? > oe (7.1.13) 

An interpretation of this inequality is that the smaller the average ex- 
tension of the atom, the larger is the average kinetic energy of the electron. 
The complete balance between the increase of the average kinetic energy of 
the electron, as implied by localization, and any energy loss from the atom 
provides the final stable state of the atom. 

For the hydrogen atom, as a bound-state, the average kinetic energy can- 
not increase by an unlimited amount and a finite upper bound to it may be 
derived as shown below. In the same way, the average radial extension of the 
atom cannot become arbitrarily small and cannot vanish as is also shown 
below. To establish these properties, we note that for a bound-state 


(7-=) <0 (7.1.14) 


G)s G =) (7.1.15) 


where T'/2 — e?/|x| is the Hamiltonian of the hydrogen atom when pi is 
replaced by 2y (see also (3.1.18), (3.1.24)). Hence for a bound-state, we have 


from (7.1.15), (7.1.2), 
a 2pe* 
(f) < =. (7.1.16) 
This gives an upper bound for the average kinetic energy of the electron 
bound in the atom. 
The inequality (7.1.16) in turn, provides from (7.1.13) the non-vanishing 
lower bound 


or 


(r) > 5 (7.1.17) 


for the average radial extension of the atom, where we have used the definition 
of ao given in (7.1.8). 

It is important to realize how quantum physics has built-in inequalities 
as in (7.1.17). The boundedness of the spectrum from below by the finite 
negative number in (7.1.2), (7.1.7), leading, in particular to the bound in 
(7.1.16), provides also a rigorous bound on the probability distribution of the 
radial extension of the atom not only for its average value as is shown below. 

To the above end, (7.1.10) leads to the following chain of inequalities: 


2 (wv) > Jo ay > [es uae (7.1.18) 


Tolfr 


for any 0 < ro < ov, where we have used the positivity of the integrand 
\~o(x)|"/r. Since in the last integral in (7.1.18), we have the constraint 1/r > 
1/ro, we obtain 
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2 (u|fjvy >> fat woor (7.1.19) 


rolr 
or equivalently to 
Qn pa\ 1/2 
Prob [r < ro] < roy) 55 (f) (7.1.20) 
with |) normalized. For a bound-state, (7.1.16) then leads from (7.1.20) to 
Prob [r < ro] <2 (7.1.21) 
ao 


giving the consistent and satisfactory result of a vanishingly small probability 
for a vanishingly small ro and rigorously vanishing for ro — 0, i.e., for the 
“fall” of the electron on the proton. 
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From the analysis carried out in $4.1, one may readily infer the nature 
of the spectrum of the Hamiltonian with a Coulomb potential. The latter 
potential is locally square-integrable and vanishes at infinity, i.e., it satisfies 
conditions (4.1.2), (4.1.3). Also we have shown in §7.1, that the lower bound of 
the attractive Coulomb Hamiltonian is strictly negative (see (7.1.2), (7.1.7)). 
Hence from Theorem 4.1.1 alone, we may immediately conclude that this 
Hamiltonian has a discrete spectrum on the negative real axis of at most 
finite degeneracy, and a continuous spectrum on the positive real axis [0, 00). 
[For a repulsive Coulomb potential, the corresponding Hamiltonian is positive 
and hence has only a continuous spectrum on the positive real axis.| 

To study the spectrum of the hydrogen atom, we rewrite the Hamiltonian 
(7.1.1) in detail as 


2 
ye pe L e? 


= Qu Dr? ram ee |x| =?PTr (7.2.1) 
where . ' 
pr = —ih (+ + ~) (7.2.2) 


and L? is the square of the orbital angular momentum, which when written 
in spherical coordinates is given by 


ee eee ee ts ee 
L* = —h Ee 0 sin 6 06° sm26 OG? | (7.2.3) 


It is easily verified that H,L?, L, commute and hence we may find simul- 
taneous eigenstates for them which we conveniently denote by |, ¢,m) and 
set 
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H|B,£,m) = E|E,£,m) (7.2.4) 
L? |E, £,m) = h70(€+1)|E, ¢,m) (7.2.5) 
L,\|E,£,m) = hm |E, ,m) (7.2.6) 
where @ takes on non-negative integer values (§5.3) and m = —é,—€+ 


1,...,0,1,...,é By definition the states |E,2,m) are in the domain of defi- 
nition of the operators H,L?, L.. 
From (7.2.4), (7.2.5), we may also write 


H|E, l,m) = Hy |E, l,m) = E|E,é,m) (7.2.7) 
where 2 R2ee41) ; 
Dy +1 e€ 
Ay = | a2: 
. Qu Qur? r VG28) 
and clearly He, L?, L. commute. We define the operators 
ee Ih. pie” 
De = Pr : 2s 
as es ( r (+h wey 


To define the adjoint Di of De, we must impose appropriate boundary 
conditions, in the process, in defining the operator p,. With a measure of in- 
tegration d°x, we consider functions f(x), g(x) such that rf (x), rg(x) vanish 
not only for r — co but also for r — 01. The adjoint DI of Dz, satisfying 


(9|Delf) = (Dj a|f) (7.2.10) 
obtained by integrating by parts, is then given by 
((E+ Ah pie? 
Dt =p. +i(! 7.2.11 
eee ( r @+DR a) 


(see Problem 7.4). 
The following relations are easily established for = 0,1,... 


(i) Hey: Di} = D! Ae (7.2.12) 
(ii) He De = De Hes (7.2.13) 
and that 
4 
fi) De D! 295 (Fee 214 
ys ete u( ta oan) Gen4) 
4 
‘ ; _ pe 
(iv) D; De = 2p (Hess + aH} - 


(7.2.15) 
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Since pe*/2(¢+1)h? in (7.2.14) is decreasing in @, the positivity of Ded} 
implies from (7.2.14), (7.2.7) that for the ground-state energy E = —pe*/2h?, 
obtained in (7.1.7), @ = 0; for larger @ this positivity is violated. 

Quite generally, (7.2.12) and the right-hand side equality in (7.2.7) imply 
that 

Fria (Di |E, 0, m)) =E (Di |E, 0, m)) (7.2.16) 


with the same eigenvalue E. 
That is, D} |E,0,m) is an eigenstate of Hy,1, and we may write* 


Di |E,£,m) = cf |E,£+1,m). (7.2.17) 


From (7.2.14), one may infer that, 


pet 1/2 
eae: 2 (2 + eae) (7.2.18) 


with a chosen overall phase convention. 

Hence from the positivity of DD} the sequence D} |E,£,m), 
Di ers): |E,£,m),... must terminate for some € = max for an eigenvalue 
E,E<E <0. Asa matter of fact if D} |E’,£,m) does not vanish for such an 
é, then D},D} |E',£,m), if not the zero vector, would violate the positivity 
of Dey1Di,,, in (7.2.14) for 0 €+1, since E + pe*/2(€+ 1)?h? would be 
negative. That is, there exists an @ = Cmax, for an eigenvalue EF < 0 such 
that DI. |E', £max, m7) is the zero vector. As we have seen above fax = 0 
for the ground-state energy E. Since (max is a non-negative integer, we may 
set (max = n—1,n=1,2,..., with n = 1 corresponding to the ground-state 
energy L. 

The vanishing of Cp. = c{_,, then imply that the eigenvalues E may 
be parameterized by n, and, from the vanishing of c_, in (7.2.18)/(7.2.17), 
take the values: 


4 
jue _ 
Bn =~ spp N= 1,2). (7.2.19) 
with FE, = E as seen in §7.1. 

For the subsequent analysis, we use the labelling |n,@,m) for |E,2,m), 


and we recall the condition 
Di_,|n,n—1,m) =0. (7.2.20) 


Since for a given £, m may take on (2¢+1) values, the degree of degeneracy 
of an energy level E,, in (7.2.19), taking into account the spin of the electron, 
is given by 

n-1 
2S 5 (2041) = 2n?. (7.2.21) 
f=1 
* Here it is sufficient to assume that any operator which commutes with He11, L?, 
L, and which is not a function of the latter operators also commutes with DI. 
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7.3 The Eigenstates 
The x-description state wavefunctions may be represented as follows: 


(r, 0, 7) | n, £, m) = Rnie(r)Yem (9, ¢) = Vnem(T, 6, ¢) (7.3.1) 


where the Y~,,(0,@) are the spherical harmonics defined in (5.3.26), (5.3.30), 
(5.3.51), satisfying (5.1.19), (5.1.20): 


L?Yem (0,6) = h(E + 1) Vem (9, @) (7.3.2) 
LzYem(0,¢) = h MYim(0, 4) (7.3.3) 


and from (7.2.20), (7.2.11) and (7.2.2), ((r|n,) = Rne(r)): 


0 (n—-1). pe? _ 
(> - + nie Ran-i(r) =0 (7.3.4) 
whose normalized = 
i r? dr Rei’) =u (7.3.5) 
0 
solution is 
2 3/2 prt 
Ran—1(r) = ex 2 7.3.6 
mat) = (SE) Fe exnt-0)2) (7.3.6) 
where 5 
p= (=) r (7.3.7) 
nag 


and ao is the Bohr radius in (7.1.8). 
In particular, (7.3.6), (5.3.45) imply from (7.3.1) that 


1\3/2 
Rio(r) = (=) 2 e— 7/40 (7.3.8) 
and for the ground-state 
1 1 3/2 
wr100(r, 9, ¢) = Tk (—) ge TI (7.3.9) 


which coincides with the expression in (7.1.4), with 3 = pe?/h? = 1/ap. 

A similar analysis as carried out in (7.2.16)—(7.2.18), in reference to 
(7.2.12), as now applied to (7.2.13), (7.2.15) with ¢ — @—1, leads to 

2\/in2 — 2 
Dy_1|n,é,m) = i“ In, £—1,m) (7.3.10) 
hne 

for € £ 0 (see Problem 7.7). The —i factor in (7.3.10) is chosen for convenience 
(see, (7.2.18) and (7.3.12)). Equation (7.3.10) then gives 
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ihen 
Rnve-i(r) = Po a las i (7.3.11) 


Equation (7.3.11) provides the differential equation 


Rnw-i(r) — 


at E Cee ee bee (7.3.12) 


Vn2— 2 |do- p 20 


for #0, and R,¢_1 are necessarily normalized for normalized Rp». 
In particular, for =n — 1, (7.3.12) gives from (7.3.6) 


2 a u n—2,.—p/2 
Ran-2(r) = (=) Gn = Dion p76 9/7 |_9 +2(n—1)] (7.3.13) 


for n > 2. 
Equations (7.3.6), (7.3.12), (7.3.13) suggest developing a proof by in- 
duction in @ to find the explicit expression for Rye. To this end, as an in- 


duction hypothesis, suppose that for all 2 = k,k+1,...,n—1, for some 
k:0<k<n-1 


AN Ge PN 
= {| — ha cc ech —p/272€+1 
Rne(r) (=) (nt Dn pe Ly 1(p) (7.3.14) 
where 6 
M -me’? (d M+N.-p 
Ly (p) =p ial dp [p e€ | : (7.3.15) 


Equation (7.3.14) is easily checked to coincide with (7.3.6), (7.3.13) for @ = 
n—1,€=n-— 2, respectively. We then show that (7.3.14) holds also true for 
£ replaced by @—1. 

The L'(p) are so-called associated Laguerre polynomials, having the se- 
ries expansion 


N t 


with the normalization in (7.3.15) adopted,°® and satisfy for M > 2, the 

recurrence relation 

(N + 1)(N +) -u-2 
M—-1 N+1 


(p) = 0. 


(7.3.17) 
Upon substitution of the expression (7.3.14) in the right-hand side of 
(7.3.12), it is readily checked by using the recurrence relation (7.3.17) with 


Te] EMU) 


° Note that different normalizations are sometimes adopted for the associated La- 
guerre polynomials by different authors. 
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M = 20+1,4=k,...,n—1, for some k > 0, that Rp, e_1(p) is also given by 
(7.3.14) with @ replaced by ¢— 1. This establishes the validity of (7.3.14) for 
all €=n-—1,n—-2,...,1,0 by induction. 

All told, the normalized wavefunctions in (7.3.1) are given by 


a er =p 1)! 
Unem(r, 0, o) = (=) rare peel? L2tth 1 (p)Yem(8, ) 
(7.3.18) 


where p is defined in (7.3.7) and (7.1.8). Note that p depends on n. The 
associated Laguerre polynomials are given in (7.3.15), (7.3.16). Here n = 
1,2,...;€=0,1,...,n-1; m=-—,-€4+1,...,-1,0,1,...,@-—1,2@. Also 


ie ar f'sinoao [ “ae Wsacte 8 6,6)? =1. (7.3.19) 


To study the orthogonality of the wavefunctions in (7.3.18), we first recall 
from (5.3.51) that 


T 20 
| sin 0 d0 il do Yiim(9, 6) Yem(8, 6) = 605mm: (7.3.20) 
0 0 


Hence it remains to establish the orthogonality of Rne(r), Rne(r) for n’ £ n. 
Due to the n dependence, in particular, of p, the argument of the Laguerre 
polynomials in (7.3.18), establishing the orthogonality of Ry¢(r), Ryre(r) for 
n’ ~ n based on any orthogonality properties of (weighted) Laguerre polyno- 
mials is not obvious. One may, however, establish this orthogonality property 
directly and rather easily from the Schrédinger equation satisfied by Rpe(r), 
which by using (7.3.2), (7.2.19) and (7.2.1), is given by 


cE (5 =) e per hee +1) 


(r) = 0. 31 
ace as Rye(r) =0. (7.3.21) 


rr Qn2h2 Qu? 
Upon setting 
1 =7/a0 (7.3.22) 


and 
Une(n) = 7 Rne(r) (7.3.23) 


(7.3.21) simplifies to 


d 2 1 &e+1) 

a cater ere une(n) = 0. (7.3.24) 
Finally by writing the corresponding equation to (7.3.24) for un/e(7), and 

using the boundary condition 


[oe) 


=0 (7.3.25) 
0 


funel)  tne( a) — une even) 
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one readily derives (see Problem 7.10) the orthogonality relation 


n!2 


0= (=F) [ay uweta) unsl. (7.3.26) 
Hence from (7.3.23), we have for n’ 4 n 
ie dr Rye(r) Rne(r) = 0. (7.3.27) 
If the Coulomb potential in (7.1.1) is replaced by 
V(x) = —Ze?/ |x| (7.3.28) 


corresponding to a nucleus of charge Z|e|, then (7.2.19) simply becomes 


pZe4 
2n?h? 


fe (7.3.29) 
and 1/ag as appearing in (7.3.18) (also in p) is to be replaced by Z/ao. 

The radial wavefunctions (7.3.14) are non-vanishing at the origin only for 
£=0. To see this note that 


ENP) 7s (7.3.30) 
p= 
yielding 
Af ZS? 
2 
— — 4 oO. 1 
Rne(0) = = (2) deo (7.3.31) 


Upon using the fact that Yoo(Q) = 1/47, as given in (5.3.45), we have for 
the wavefunctions in (7.3.18), (m= 0), 


3 
IWnem(0)? = 5 (=) 5, (7.3.32) 


With the definition, 


(f(9)) = [ox H(r) [nem (x)? 


— i re dr f(r)R2,(r) (7.3.33) 


0 
where for simplicity of the notation we have suppressed the dependence of 


(f(r)) on n and £, the following expectation values are useful: 


(r) = [3n? — ee +1)] (3) (7.3.34) 
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(r?) = [5n® + 1 —3¢(6 + 1)] (Sr) (7.3.35) 
(*) = a (7.3.36) 


ar (2) RIDES een 


(3)- (2) qe eee (7.3.38) 


7.4 The Hydrogen Atom Including Spin and Relativistic 
Corrections 


In the present section, we extend the work in §7.2 to include spin and 
relativistic corrections. This may be done from Dirac’s equation studied later 
on dealing with the quantum description of the relativistic electron. We find 
it, however, more suitable to treat this topic right in this chapter for conti- 
nuity, especially since the Schrédinger equation may be readily generalized 
to a relativistic treatment as we shall see below. Needless to say, we will, 
in the process, discover the Dirac equation which will be treated afresh in 
Chapter 16. 

The time-independent Schrédinger equation for a free particle of mass 


may be written as 
2 
p 
—-E =0 7.AL 

& ) » ( ) 


providing the constraint that is imposed between the energy and the mo- 
mentum. Its interaction with an external time-independent® electromagnetic 
potential (U,A) may be then obtained from (7.4.1) by the substitutions 
p-p-tA,E-E-U. 

For the relativistic free electron, we may write 


[p22 + MPct — B] v =0 (7.4.2) 


providing the relativistic constraint between the energy and the momen- 
tum,’ where c denotes the speed of light. We multiply (7.4.2) by the operator 
p2c? + M?c* + E to obtain the more manageable equation 


° The interaction with an external time-dependent electromagnetic potential, in 
general, is easily obtained by considering the time-dependent Dirac equation — 
see for example, the appendix to this section. 

” The square-root operator in (7.4.2) of the positive operator p??+M?c4 is a well 
defined operator, for example, by Fourier transform theory. See also (4.5.34), and 
§16.1. 
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(p?c? + M?c* — E?) U=0. (7.4.3) 


Using the identity 
Oj,0; = bij + le ijk (7.4.4) 


for the Pauli matrices, we may rewrite (7.4.3) as 


(E — Mc?) (E+ Mc?) 


(o-p)(o-p) : 7 v=0. (7.4.5) 
Upon setting 
E+Me 
o-pv= eM (7.4.6) 
E-Me 
spare 2a); (7.4.7) 


Equation (7.4.5) may be rewritten in terms of the following two equations 


E 
o-p&-—W+ Mev =0 (7.4.8) 


E 
—o-p¥+—O+ Mcb = 0. (7.4.9) 


The Pauli matrices are 2 x 2 matrices, and with ®, W each as a two-row, 
one-column matrix, we may combine (7.4.8), (7.4.9) into the equation 


(2.3):9- G2) Erme(2)][f]-0 ao 


which is the famous Dirac equation to be studied later in Chapter 16 and 
we will not consider it here further, as it stands, except in the appendix 
to this section dealing with normalization problems. We are here interested 
in (7.4.8), (7.4.9) to study, approximately, the bound-state problem of the 
hydrogen atom including spin and relativistic corrections. 

We isolate the rest energy Mc? in E 


E=e+Mc (7.4.11) 
and rewrite (7.4.8), (7.4.9), respectively, as 


eW =co-p® (7.4.12) 


(e+ 2Mc’) 6 =co- pv. (7.4.13) 


The interaction with an external time-independent electromagnetic poten- 
tial may be then obtained by the substitutions p - p—£A(x), ¢ — e—U(x), 
giving rise to the equations 
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(c-U)®=co- (p = <A) ® (7.4.14) 


(2Mc? +¢-U)® =co- (p = “A) w. (7.4.15) 


We will see below, how the familiar term —u- B, where u is the magnetic 
dipole moment of the electron and B = V x A, naturally arises in the 
modified Hamiltonian as obtained from (7.4.14), (7.4.15). 

We use the identity 


1 1 U-e 
= 1+ —_ ._—___ 7.4.16 
2Mc2 +e—-—U sure ( - aca) ( ) 


and note that for —U > 0 and |e| < 2Mc?, the denominator 2Mc? + « — U 
does not vanish. Here we have in mind the attractive Coulomb potential 


U(x) =-— (7.4.17) 


and where |e| < 2Mc?. 
Using the identity (7.4.16), we may write for ® the exact equation 


® 


va [tty U-e 


e€ 
~ Me 2M? +e — al a (p _ <A) WV. (7.4.18) 


Upon substituting (7.4.18) in (7.4.14) we obtain for the latter 


e- ow =| F rae (P A) 7 ae me (P “A)] ¥ 


2M! IMe 2Mc? +e-— 


1 e e 9 
+57 |: (p- <A) o- (p- <A) -P?] (7.4.19) 
where we have added and subtracted the term p?/2M on the right-hand side 
of (7.4.19). 

As an order of magnitude estimate, we note that (« -U) ~ Me*/2h? = 
Mc?a?/2, with the latter (the Rydberg) providing the energy scale of the 
hydrogen atom. Here a = e?/hc ~ (137)~! is the fine-structure constant. 
Hence we neglect the (e—U) term in comparison to 2Mc? in the denominator 
on the right-hand side of (7.4.19) as it is suppressed by a factor a?. We will 
keep track of terms up to the order 1/c?. The vector potential A considered is 
due to the magnetic dipole moment (or the spin) of the proton and involves 
a factor 1/c (see (7.4.39), (7.4.40)). For the time being we will treat A in 
(7.4.19) as arbitrary, as it is readily identified, and assess its contribution 
later. This will save us some work in the sequel. 

With the order of magnitude sought in mind, we may rewrite (7.4.19) as 
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(e-U)U= [Po + eae PU -2)o-p| w 
a aw lo. (p- <A) es (p- <A) -p?| Uw. (7.4.20) 


In the appendix to this section, it is shown that the correct normalization 
of U is given by 


[ox wt (x) c 3 es U(x) =1 (7.4.21) 
for the accuracy required. Accordingly, we may set 
p? 
v= E - eel x (7.4.22) 
where now 
[ox x1 (x)y(x) =1 (7.4.23) 


to the accuracy needed. 
In terms of x, (7.4.20) reduces to 


2 


Pp 
(e —U)x—-(e U) ae X 
2 4 
p p 1 
=e -p(U —e)o- A2A 
x iz 16M * ae? p(U —«)o p| x (7 ) 
where 
1 é , 
Fee og |e a) -( ~<a) -p?| A2 
MT Sap Ne eee ee p’| x (7.4.25) 


With A given in terms of the magnetic dipole moment of the proton (see 
(7.4.39), (7.4.40)), F involves a factor 1/c? (see (7.4.31)). In (7.4.24), we 
have used the notation p* for (p?)?. 

The following equalities are easily established (see Problem 7.12) 


o-p(U —2)o- py = p*(U —2)x + ?(V7U)x 


+ih(VU)-pytho-[(VU) x ply (7.4.26) 


(e —U)p*x = p?(e —U) x + (V7(e — U))x 
+ 2ih(V(e —U)) + px (7.4.27) 


where p = —ifV, and note the positions of the various brackets in (7.4.26), 
(7.4.27). 
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Upon multiplying the equality (7.4.27) by 1/2 and adding the resulting 
expression to the one in (7.4.26), we obtain, to the accuracy needed, for 
(7.4.24) 


2 4 2 2 
_[pP Pp p(U—e) fh 2 
(6 —U)x = E ioe + eee + amzalY Y) 
1 /fld 
+ sya (=U) S-L+F x. (7.4.28) 


Here S = fo/2, and we have used the relations 


$-(VU xp) = (250) 8+ xp) 


= Ga S-L (7.4.29) 
rdr 
with L denoting the orbital angular momentum. For an s-state for which 
the orbital angular momentum is zero, i.e., £ = 0, it is advisable to use the 
expression on the extreme left-hand side of (7.4.29) and not of the first one 
on the right-hand side in the evaluations of the energy corrections. 
Now we combine the second and the third terms on the right-hand side 
of (7.4.28) as follows: 


2 2 2 

p p = p p p 1 

8M222 Ee ve u)| x= 3ue P oi (=)| ie 
(7.4.30) 

where we have used, in the process, the equality (7.4.28) for (« — U)y. That 

is, for the accuracy needed, the second plus the third terms on the right-hand 


side within the square brackets of (7.4.28) may be replaced by —(p*/8M%c?). 
Finally we use the operator identity 


o: (p- “A) o: (p- “A) = (p- fa) - MoV x A) (7.4.31) 


(see Problem 7.13) and the properties that V- A = 0, and that A involves a 
factor 1/c (see (7.4.39), (7.4.40)), to simplify the expression for F’ in (7.4.25), 
and rewrite (7.4.28) as 

4 


2 74 
Pp p h 2 
F rEg * gigagt 


1 1d e 
fae cee es See -L——_A-p-u-B =0. 4.32 
2M?2c?2 Geak Mc et cae aoe ne 82) 


Here 
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e€ 
u= Wee (7.4.33) 

is the magnetic dipole moment of the electron, B= V x A. 
The magnetic dipole moment wu in (7.4.33) is more precisely given by 
u = (ge/2Mc)S, where g is the so-called g-factor of the electron given ap- 
proximately® by g = 2 (1 + =) according to quantum electrodynamics, due 

to Schwinger, and 

a =e?/he~ 1/137 (7.4.34) 


is the fine-structure constant. Since B = V x A has already a factor 1/c 
(see (7.4.39), (7.4.40)) in it, one may simply take g ~ 2 as given in (7.4.33), 
(7.4,32).° 

We investigate the meanings of the new terms arising in the modified 
Schrédinger equation (7.4.32). 

Since 


2 4 
22 24)1/2 ee ae p 
(p?c? + M*c*) Mc ou aMae t (7.4.35) 


the — (p*/8M%c’) term gives the relativistic correction to the kinetic energy. 
The interaction (h? /8M 7c) V°U is non-vanishing only at the origin since 


[| 


Vv: (=) = —4763(x) (7.4.36) 


(see Problem 7.14) implies that 
V?U(x) = 4me753(x) (7.4.37) 


and contributes only to s-states. It is referred to as the Darwin term. We will 
see in §16.6, that the physical origin of this may be explained to arise due to 
fluctuations of the position of the electron (referred to as “Zitterbewegung”) 
in the atom, and the electron encounters a smeared-out Coulomb potential 
giving rise to the term V7U(x). 


The term 
1 1 1d 
sana (V0) x Pl = sarza (75,0) SE 


290): Pree xs (7.4.38) 


2M 2c? 


8 See §8.5 for a computation and observation of g. 

° One may add a term —(Ke/Mc)S - B to the left-hand side of (7.4.32), with the 
latter as obtained from the Hamiltonian of the system (7.4.14), (7.4.15), giving 
rise to the term —(ge/2Mc)S - B instead of —(e/Mc)S- B, with g = 2(1+ 4), 
«& = a/2z. For the accuracy needed here, however, this is not essential. 
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is referred to as the spin-orbit (SL) coupling.’° It may be interpreted as 
the interaction of an electric dipole moment (p x S/2M?c?), set up by the 
motion of the electron carrying a magnetic dipole moment u (or spin S), with 
the electric field (up to the charge) —WU in the proton’s rest frame where 
we will be carrying out the computations.!! 

The vector potential A due to the magnetic dipole moment pH, of the 
proton may be written as 


1 
A=-u4,xV (=) (7.4.39) 
r 
and lel ool 
e Ip 
= 7.4.40 
Me OMe ey) 


where g, is the so-called g-factor of the proton approximately equal to 5.56, 
My is its mass and I its spin. 

We have retained the last two terms in the modified Schrédinger equation 
(7.4.32) because they involve the factor 1/c? in them. This will obviously save 
us some work. One should note, however, that as far as orders of magnitudes 
are concerned, since M/M, ~ 0.545 x 1073, we have |u| ~ 10~* |u|. Accord- 
ingly, the energy corrections due to the combination of the last two terms in 
(7.4.32) are suppressed by a factor of 1073 relative to the other corrections 
in (7.4.32). For this reason, and for the subsequent analyses, we separate the 
corresponding potentials and set 


p* h2 


V; = 
. 8M32 | 8M 


‘ t. ¢ley 
(V U) apes pap jou (7.4.41) 


e€ 
Vur = A- -B. 7.4.42 
HF Me p-uU ( ) 


Because of the suppression of the contribution of Vyp relative to that of 
Ve, we first treat Vp separately, and then consider Vy as a small perturbation 
to the former. Every energy level resulting by including the so-called fine- 
structure correction due to Vr is then split due to Vy. 

The energy levels of the hydrogen atom obtained in $7.2 are of the or- 
der ~ Mc?a?. The Vp interaction provides corrections to these levels which 


1° The 1/2 factor in the spin-orbit coupling in (7.4.38), referred to as the Thomas 
factor, was automatically obtained in (7.4.32) “without tears”, which is otherwise 
obtained, by some labor, by considering successive Lorentz transformations cor- 
responding to the motion of the electron as it changes its velocity in infinitesimal 
periods of time. This in turn implies that the spin of the electron precesses with 
angular frequency involving a factor 1/c’, to the leading order (v/c)?. 

11 This may be equivalently interpreted as the interaction of the magnetic dipole 
moment wu of the electron with a magnetic field set up by the proton, not however 
due to t,, in the electron’s rest frame. 


7.4 The Hydrogen Atom Including Spin and Relativistic Corrections 377 


are suppressed by a factor a”, i.e., are of the order Mca‘, and is said 


to provide the fine-structure of the atom as mentioned above. For low ly- 
ing states n = 1, n = 2, these corrections are about ~ 1.8 x 107‘4eV, 
~ 5 x 107° eV, respectively, depending on the total angular momentum of 
the electron. On the other hand, because of the further suppression by a fac- 
tor (M/M,), discussed above, the corrections due to Vy are of the order 
~ (M/M,)9pMc?a*, where gy is the g-factor of the proton, and the Var 
interaction is said to provide the hyperfine-structure of the atom. We note 
that (M/M,) = .545 x 1073 ~ 10.2”. For the low lying state n = 1, the 
hyperfine corrections are about ~ 1.46 x 107% eV, ~ —4.39 x 107~® eV for the 
spins of the electron and proton aligned or opposed, respectively. 

The expression for the energy levels for the hydrogen atom in (7.2.19) 
may be rewritten as 


MM, Ca? 1 M Mc?a? 
M+M,/ 2n?2 — Mp 2n2 


M De, M 24 
ear SMTA Wy ear 
Qn? Qn? 


(7.4.43) 


That is, the reduced mass gives rise to a correction to the levels, when initially 
determined in the infinite proton mass limit, by multiplying these levels by 
the corrective factor (1 — M/M,). 

Since the interaction Vp dominates over Vy, the question on the contribu- 
tion of the reduced mass effect on the fine-structure arises. The corresponding 
analysis is not as straightforward as one might think as it is involved with a 
relativistic treatment of the electron and the proton as well. The same rule 
as above approximately holds again and one may multiply the fine-structure 
splittings by the corrective factor (1 — M/M,), but also an additional cor- 
rection arises which is the same for a fixed value of the principal quantum 
number n of the order ~ 107“ eV or less, and will not contribute, or more 
precisely gives negligible contributions, to the computations of energy dif- 
ferences between levels with the same n. On the other hand, the correction 
(M/M,) xfine-structure is again of the order ~ 1077 eV or less. Accordingly, 
we will not dwell on the reduced mass effect in the fine-structure, as well as 
in the hyperfine-structure in the sequel. 

The interaction of the electron with radiation (the photon) produces a 
shift, the so-called Lamb shift, of the order ~ Mc?a° In(a) which for the low 
lying state n = 2, €=0 is about ~ 4.3 x 10-% eV. 

The fine-structure of the hydrogen atom is studied in §7.5. Every energy 
level resulting by the inclusion of the fine-structure correction is also split 
further due Vyp, providing the hyperfine structure of the atom. This is treated 
in $7.6. The Lamb shift, in a non-relativistic setting for the atomic electron, 
is worked out in §7.7. 
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Appendix to §7.4: Normalization of the Wavefunction 
Including Spin and Relativistic Corrections 


Upon introducing the 4 x 4 matrices!” 


o_ (I 0 (060 
po, aes) (at 
with properties, 
(V)'=7, ye-y, Py=-y7" (A-7.4.2) 
0\2__ (10 : 
()" = a 7) (A-7.4.3) 


writing i0/Ot for E, and introducing the interaction of the electron with an 
external electromagnetic potential U(x, t), A(x, t) 


BO!) Herd 
ina, => ihe, — eU(x, t) (A-7.4.4) 
p—-p- “A(x, t) (A-7.4.5) 


providing the so-called minimal electromagnetic coupling, the time-dependent 
Dirac equation (7.4.10) becomes 


Pik O Uw 
c Ot |® 


= Ly-( iaV <A) + McI 4 *| | (A-7.4.6) 


where J, in the latter, is the 4 x 4 identity matrix. 
The adjoint of the matrix equation (A-7.4.6) is 


ath 0 [wt ot] 4° = [wt of] | (in <A) Mer eee! 
c Ot c c 
(A-7.4.7) 
— 

where we have used (A-7.4.2), and the arrow on V means that the gradient 
operates to the left. 

Upon multiplying (A-7.4.6) from the left by —(ic/h) [U1 ®'] 7°, and (A- 
7.4.7) from the right by (ic/h)y° [U ©] Fe and adding the resulting equations, 
we obtain 


a 
© (yt ' -J= 3 
5 (W'U + 616) +V-J=0 (A-7.4.8) 


12 The Dirac equation will be studied in detail later. Here we introduce the mini- 
mum to discuss just the normalization of the wavefunction W in (7.4.19), (7.4.22), 
(7.4.32) to the accuracy needed there. 
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where 
J=c([Wi of] yy | (A-7.4.9) 


is the probability current density. 
The normalization condition becomes 


[ex (WIW + O16) =1. (A-7.4.10) 
From (7.4.18), 
1 1 
= -pwv4 A-7.4.11 
Me 8 o(3) eae 
which from (A-7.4.10) implies, by integrating by parts, that 
3. wt p? 


to the accuracy needed in §7.4, where we have used the facts that of = o 
and (o- p)(o- p) = p”. Hence, to the accuracy sought, we may write 


p” 


yielding to the normalization condition 


[x xy =1. (A-7.4.14) 
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According to (7.4.32), (7.4.41) the fine-structure of the hydrogen atom is 
described by the Hamiltonian 


H = 3 + U(x) + Vex) (7.5.1) 
where 
Ve(x) = eT) ia (W?U(x))+ dane Set, (789) 
aie am 2M c? 8M 2c? = 2M2c2 \ r dr i* 


with Vp(x) treated as a small perturbation, where, using the same 
reasoning as in (7.4.30), we have replaced (—p/8M°c”) effectively by 
— (En, —U)*/2Mc? in (7.5.2) to the leading order. 

We note that H does not commute with L and S. It commutes, however, 
with J’, L?, S’, J,, where J = L+S is the angular momentum of the electron. 
That H commutes with J? follows from the fact that 
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J =1?48?428.L (7.5.3) 


and that every term on the right-hand side of (7.5.3) does. Since Be Jy =0, 

we also have the easily verified property [S - L, J,] = 0 which is a statement 

of rotational invariance of the scalar product (about the z-axis). For a radial 

function U(r), V?U(r) is also a radial function and hence commutes with 

L. Actually, V?U(r) is proportional to 5°(x) (see (7.4.36)), and 53(x) is 

essentially 5(r)/4ar? when multiplied by a function f(r) of r in an integral. 
We are thus led to consider the wavefunctions 


W5,t,m;,.n 7, 9, 9g) = (9, O| 3,4, m5) Rne(r) (7.5.4) 


where R,e(r) are the radial wavefunction in (7.3.14) with ~ — M. Also 
|j,£,m,) are expressed in terms of the states |¢,m) |s, ms) via Clebsch-Gordan 
coefficients: 


l3,2,m;)= S>  [é,m)|s,mg) (é,m,s,me|j,€,mj) (7.5.5) 


M+Ms=Mj 


and are given in the appendix to this section, the details of which, however, 
are not necessary here. The quantum number s = 1/2, and for = 0, 7 = 1/2 
and for 240, 7 =€+1/2. 

Since |j, 2,m,) is a linear combination of the states |¢,m), for different m, 
the wavefunctions in (7.5.4) are eigenvectors of the hydrogen atom problem 
with eigenvalues —Me*/2h?n?. 

Undoubtedly, a reader of quantum physics presented at this level is famil- 
iar with the elements of perturbation theory? and that the leading correction 
to energy levels due to Vp is given by the expectation value of Vp in the un- 
perturbed states in (7.5.4) ie., by the application of first order perturbation 
theory.'* We note that for a given n there are exactly 2n? |j,0,m,;) states. 
To see this, note that for n = 1, we have ¢ = 0, 7 = 1/2, m; = +1/2, and 


hence we obtain two states. For n # 1, £= 0, j = 1/2, m; = +1/2, we 
obtain two states which we have to add to the states 4 0, 7 = €41/2, 
m,; = —j,...,j- The total number of the latter states is 
n—-1 1 n—-1 1 
— 9,2 
SPs) 2-3) + = =9 (7.5.6) 


thus confirming the above result. 
We first consider s-states, i.e., for which ¢ = 0. 


'3 Perturbation theory will be systematically developed later on in Chapter 12. 

14 Here we remark that since H commutes with J’, L?, S?, J., the perturbation Vr 
is diagonal in the states |7,2,m,) and so-called degenerate perturbation theory, 
to the leading order, reduces to elementary first order perturbation theory. 
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£ = 0 States: 


For the = 0 states m; = ms, and we may write for (7.5.4) 
1 
SR 
VAr , 
From (7.3.29), (7.3.36), (7.3.37), with — M, Z — 1, we have 


a? ( Met 3 
1/2 — 2, 
- O.ma) n (Saaz) | =| 
(7.5.8) 


Ora osannlts 6,6) = (r) |1/2,ms) . (7.5.7) 


(E, —U)? 
2Mc? 


7 (1/20. 


to the leading order. 
For the Darwin term (h?/8M?c?) (V7U) we have from (7.4.37), (7.3.32), 


h? 2 a? ( Met 


Finally an € = 0 state corresponding to no angular momentum, for the 
last term in (7.5.2), we may use the explicit expression given on the left-hand 
side of (7.4.38), i-e., 


(1/2,0,m,n 


1 —ife* . ( O 7) 
—{,55:|(VWU) X p| = Ss -U 7.5.10 
2M?2c2 UVP) 2M2c?2 (a ) Oxk ( ) 
with a summation over repeated indices understood. 

The expectation value of (7.5.10) in state |1/2,0,ms,n) may be explicitly 
carried out and there is no need to “guess” on what its value is as is often 
made in the literature. To this end, by using 


O vi d 
Ont (r) 7 ql) 
Jj 
= es (7.5.11) 
0 ak @ 
Bk enol") = = By nol) (75412) 
and that 
Fok Ag. 
Joo te le (7.5.13) 
y i 
we obtain 


(1/2,0,mayn 


Sager to 
—ihe dks (sa) dak 1/2,0,1.,n) 
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Pee ; 1 a) 6) 
= —ihe* (1/2,0|S*|1/2,0) [x Rno(r) (sa) Dgk nol”) 


jk O° 7.2 
= —ihee'!* (1/2,0|S*|1/2,0) “ >| oo © Rioir) 
0 


ihe2eIk ; 4 (Me?\* 
saa’ (1/2,0|5t[1/2,0) 5°" ( =) =0 (7.5.14) 


since e’/*§J* — ( and the latter is multiplied by a finite constant. We have 
also used (7.3.31), and the fact that R29(r) > 0 for r — oo. 
Unfortunately,!° many books have the Darwin term “missing” in the cor- 
responding expression to (7.5.2) and then use (7.5.3) to infer that the expec- 
tation value of the last term in (7.5.2) is finite and non-zero for € = 0 so that 
the first plus the third terms alone in (7.5.2) give the same expression as the 
one provided from a leading order expansion of the exact Dirac expression 


(see (16.6.18), (16.6.55)). The integral in question, as seen in (7.5.14), is zero 
2 

and the fine-structure for € = 0 is provided by — ea plus the Darwin 

term (h?/8M?c?) (V°U). The fine-structure correction to the hydrogen atom 


energy levels is, for = 0, j = 1/2, then given from (7.5.8) and (7.5.9) to be 


a? (a?Me2 3 
AFn 5.0 = = ( 572 )ft ak f=), (7.5.15) 


£ A 0 States: 


For ¢ 4 0, the Darwin term in (7.5.2) gives a zero contribution from the 
property (7.3.32) and (7.4.37). On the other hand as in (7.5.8), we have from 
(7.3.29), (7.3.36), (7.3.37) 


= | bog, n| o8 I) | 5 bong. = ~ (25) | a 
DoD pare eds n\n?) |€+1/2  4n 
(7.5.16) 
for p— M. 
Also from (7.5.3), we may solve for S- L 
ore ee 
S-L=5(J’-L?-s’) (7.5.17) 


use (7.3.38) for € 4 0, to obtain 


: 1 1d 
(i.t.mj.n ae (+50) S-L 
15 Fortunately, there are some exceptions to these books emphasizing that the inte- 


gral in question is zero, e.g., Bethe and Salpeter (1977), p. 60. A regularization 
was, however, used in their analysis to conclude that this integral is zero. 


isf,mj.n) 
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_ a? ( Met \ [(9(f +1) — £2 4+1) - 3/4] (7.5.18) 
~ 2n \ 2h2n2 (f+ 1/2)(€+ 1) = 
where j = £+1/2. 
From (7.5.16), (7.5.18) we have 
a? (a?Mc? 1 3 
AEnj2= ~~ ( Bre ) - +172 = ; (7.5.19) 


Upon the comparison of (7.5.19) with (7.5.15), we note that the former 
coincides with the latter if we formally set ¢ = 0, 7 = 1/2 in the former. That 
is, (7.5.19) holds for 7 = €— 1/2, 2=1,...,n—-—1, (n 41); 7 = €4+1/2 for 
£=0,1,...,n—1 as well. 

Using the notation né;, we see from (7.5.19), for example, that the levels 
2P, /2/2P3/2 are split as well as the levels 3P,/2/3P3/2 and 3D3/2/3D5/2, 
where P, D correspond to ¢ = 1,2, respectively, The shift between 2P3/2 and 
2P,/2 is about 10.95 GHz in frequency unit. 

On the other hand for fixed n, j the fine-structure correction (7.5.19) 
shows that corresponding energy levels which differ by the ¢-value are degen- 
erate. For example, the levels 2.9) /2/2P,/2, with S corresponding to = 0, 
are degenerate, while experimentally it has been confirmed not to be so. This 
splitting is, however, quite elegantly predicted by quantum electrodynamics 
and is referred to as the Lamb shift. The latter is reduced by a factor of a 
relative to the fine-structure, and also involves a in a non-trivial way (such 
as on the logarithm of a). A non-relativistic derivation of the Lamb shift, i-e., 
with the atomic electron treated non-relativistically, will be given in §7.7. 


Appendix to §7.5: Combining Spin and Angular 
Momentum in the Atom 


In (7.5.5), we combined the spin of the electron with the angular mo- 
mentum. The states |j,2,m,) in (7.5.5) may be written down directly from 
Table 5.3, below (5.5.35), by making the substitutions in the latter: 

jivé,my 34m, m2 > ms, M> ™M;. (A-7.5.1) 

Hence the states |j,2,m,;) are explicitly given by (¢ ¥ 0): 

1 |€,m, — 1/2) Vl+1/2+m, 
pany = = (A-7.5.2) 
(2€+ 1) | lem, +1/2) (0+ 1/2—m; 
for 7 = €+1/2, 


1 — |€,m, — 1/2) J£+1/2—m; 


py) = ———— (A-7.5.3) 


(2€+1) | em, +1/2) 041/24 m,; 
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for 7 = @-—1/2. 
Accordingly, we may write 
(9,6|9,0,m;) = MOE Ty Vets 07218,9) 0 
VJl£+1/2—m,; 0 
Ye A-7.5.4 
+ (f+ 1) @,m;+1/2(9, &) 1 (A-7.5.4) 
for 7 = €+1/2, 
(9,63, £,m;) <* (20+ 1) Ye,m,—1/2(9, ) 0 
VJ£+1/2+m,; 0 
re (on 0 A-7.5.5 
al (041) 2m; +1/2(9, &) 1 ( ) 
for 7 = €—1/2. 


A relation which follows from (A-7.5.4), (A-7.5.5) and will be useful later 
in Appendix to §16.6 in an exact treatment of the Dirac equation in the 


bound Coulomb problem is the following. Let 
n=—~ =%= (sin 0 cos ¢, sin @ sin ¢, cos 6) (A-7.5.6) 


Ix! 


then 


o°X(0,6|j,C=jF 1/2, m5) =~ (6,¢|7,2=j= 1/2,m;) * (A-7.5.7) 


The spin I of the proton may also added to J, 
F=I+J (A-7.5.8) 


as done in (7.6.4) in the next section and the eigenvectors |f,7,0,m,) of 
the commuting operators F’, J?, S*, I?, L?, F,, (see Problem 7.17), where 
h?f(f +1), and hm, are the eigenvalues of F? and F,, respectively, are 
similarly constructed (see Problem 7.18, see also §5.9). 


7.6 The Hyperfine-Structure of the Hydrogen Atom 


The hyperfine-structure of the hydrogen atom is provided by the interac- 
tion term given in (7.4.42): 


e 
Var = -~—A-p-u-B 7.6.1 
HF Me p-wuU ( ) 


where A is given by 
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A=-—u,x V (=) (7.6.2) 


and B = V x A. The magnetic dipole moment uw, of the proton is defined 
in (7.4.40). The first term on the right-hand side of (7.6.1) may be written 
in various forms: 


€ ie*h n7>4( 01\ O 
EES Rats Sa 8 SE os eee ee en 
Me ~ oM2 9°" ( ) 


1 
IMM, 9? 73) "© 


ek a ee F (7.6.3) 


representing an interaction between the magnetic dipole moment pu, of the 
proton and the orbital angular momentum L of the electron in the atom. The 
—u-B term denotes the familiar interaction between the magnetic dipole 
moment u of the electron and the magnetic field set up by the proton due to 


Hy. 
We add the spin I of the proton to J and define 
F=J+I (7.6.4) 


and note that F’, J, S*, I?, L?, F, commute in pairs (see Problem 7.17). 
The following operator equality is useful 


I-J = ; [F? — 1° —J7] (7.6.5) 


and will be used in our analysis of the hyperfine-structure. 
For a given fixed pair (n,@), for € 4 0, the splitting due to the fine- 
structure between successive levels 7 = £— 1/2, 7 = €+1/2 is given by (see 


(7.5.19)) ees 
§(AEn, 36) = n ( 2n? ) (e+ 1) Kee) 


and is larger by a factor of (M,/M) relative to the hyperfine-structure effect 
as discussed below (7.4.42). Also for a fixed n, € = 0, we have only one j 
value = 1/2, with the fine-structure correction given in (7.5.15). Finally the 
differences between the energy levels in (7.2.19) for successive n’s are much 
larger than the energy splitting due to the fine-structure and the hyperfine- 
structure for the n-value in question. Accordingly, we define the Hamiltonian 
as obtained from (7.4.32) for a fixed triplet (n, J, @): 


Anje = Eye + Ane (7.6.7) 


nN. 


where 
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Hae = >> |n,5,4my) (n,5,6m;|H° 


7 , 
mi,m; 


n,j,£,mj5) (n,j,e,mi| (7.6.8) 


2 2 
p e€ 

H®°=—_-——+YV;, 7.6.9 
mM oro (e822) 


(ng§ Gi mM; |H° 


n,j,e,m';) = Om; 4m’ En,j,0 (7.6.10) 


with E,,;,¢ denoting the energy levels of the hydrogen atom (with u — M) 
including the fine-structure correction in (7.5.19). Also 


Aye = S- |7259, fi m5) Grigg m3 | Var 


- t 
mi,;mi 


n, 9, &, m';) (n, J, £, m,| (7.6.11) 


(xe ln 9,05 my) = W3,t,m;,n(1, 9, ) (7.6.12) 
with the latter defined in (7.5.4). 
One should note that (n, j,e,m; | Vee In, De, m'.) is an operator depending 
on I — the spin of the proton (see (7.6.1)—(7.6.3), (7.4.40)). 
Now we consider the states 


In, f,j,2,mz) = D> |n, J, £, m5) |¢, ms) (n,j,€,m;,1,mi|n, f,9,£, mf) 
(mit+mj=ms) 
(7.6.13) 
where 
(x|n, f,9,4,my) = (9,6|f,5,4 mp) Rne(r) (7.6.14) 


and h? f(f+1), hms, h?i(i+1), hm; are the eigenvalues of F?, F,, 1’, I,. The 
states |f,7,@,m,) are readily constructed out of the states |7, 2,m,;) given the 
Appendix to §7.5 and of the states |i,m,;) (see Problem 7.18), the details of 
which, however, are not necessary here. The important thing to note is that 
they are linear combinations of the states |j,2,m,) |¢,m:), and note that the 
states |n, f,j,4, my) are eigenvectors of Bi with eigenvalues Ey, ;,¢ as given 
in (7.6.10). 


£= 0 States: 
We consider the states |n, f,1/2,0,my,) = |n, f, my) in (7.6.13) for 2 = 0, 
where f = 1,0. 


Using the first equality in (7.6.3), we have for the matrix elements 


ie*h 
2MM,c2 °° 


ijk 
ven [ox Rnyo(r) (7) oe RG 
Ar 


(n, fms 


-s—A- pln, f,ms) = 


= (n, fm "In, F.my) 
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=0 (7.6.15) 


directly from the identical evaluation of the integral in (7.5.14). 
The second term on the right-hand side of (7.6.1) may be conveniently 
written as 


1 O 

= i_igk k 

—u-B pied agit 
0 O 1 
= ae kij .klm “ 
(u'np) ete Oxd Ox™ (=) 
= w(t \aaie Oo 
=H, €E ps, Bal Aci r (7.6.16) 


with a summation over repeated indices understood. Upon using the defini- 
tions (7.4.33), (7.4.40) of u, u,, respectively, we obtain (e < 0) 


(n, f’,m’,;|-w-B 


(n, f’,m', |S 


n, fmf) = n, fms) 


e2 
ae 
2MM,c 


cy eee vary pl a0f1 
* tn Ceol") c ™ (*) Ox? Oxi \r 


59 2 1 
n, f, mf) An 3 [ox R24(r)V? (*) 


r 


(7.6.17) 


2 
€ / , 177 
~~ 2MM,2" (n, fim; |S 


where we have used the spherical symmetry of R29(r). From (7.4.36), (7.3.31), 
with Z > 1, p — M, we obtain 


4 e2 Me2\° 
(n, f/,m's|—H- Bln, fym5) = 3n3 aria ( [ ) 


x (n, f’,m' |S +1 


n,f,myp). (7.6.18) 


From (7.6.5), or simply by the addition of two spin 1/2’s, we have 
2 


h 3 
n, fmf) = 67 5°Om'-ms 2 ie + 1) | (7.6.19) 


(n, f',m) |S -I 


with f = 1,0, and Vp is diagonal in f, my, i-e., Fy 1/2, in (7.6.11) is diagonal 
in |n, f, my) for a fixed n. 

From (7.6.15), (7.6.18), (7.6.19), we then have for the hyperfine-structure 
corrections for s-states: 


4a? (M 2M 3 
ABnY = 3 (Fz) m(* - ) [r+ - 3), £=0 (7.6.20) 


2n? 
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where f = 1 or 0 corresponding, respectively, to the spins of the electron and 
proton aligned or opposed. 

The frequency of radiation (~ 1420 MHz) arising from the transition from 
the excited state f = 1 (the triplet state) to the ground-state f = 0 (the 
singlet state) for n = 1, is one of the most precisely measured quantities 
in physics and corresponds to a wavelength of 21.1cm, well known in radio 
astronomy. 


£ A 0 States: 
We evaluate the matrix elements 
co £29; £, my | Ange 


of the perturbation H;,,, in (7.6.11) to H}),, defined in (7.6.8). 
To the above end we first note that we may rewrite Vyp in (7.6.1) as 


e? I-L afl _, 0 O [1 
= LS.I ‘Ti — .6.22 
We MMe | ee (=) ol Oat bai 6] os 


n,f,9,4 mys) (7.6.21) 


where we recall the relation (7.4.36) for V7(1/r). Since R?,(r), vanishes for 
r = 0 if € 4 0, (see (7.3.31)), we may omit 5°(x) terms arising from the 
last two terms on the right-hand side of (7.6.22). That is, for 24 0 we may 
effectively omit the second term in (7.6.22) and effectively write the last term 
as 


S-I I-x S-x 


3 +8 


(7.6.23) 
That is, for 04 0, we may effectively take for Var the expression 


7 1 


V; _ e€ 
HE 9MM,c2"" 73 


[[-L—1-S+3I-n S-n] (7.6.24) 


where n = x/|x| = (cos dsin 0, sin dsin O, cos 0). 
Now we use the important identity in (5.8.47) 


1 


VV aatand _ 
(é,m'|n'n'|é,m) = Gey aaa) 


(€,m'| (20? + 2€ — 1) 5” 


1 he _ 
Mis (L'L) + LIL") |e,m). (7.6.25) 


These matrix elements allow us to compute the _ expressions 
(n, j, £,m; |Var|n, 9, 2, m’,) in (7.6.11) by using, in the process, the expan- 
sion (7.5.5) for the states |n,7,2,m,) involving |¢,m) states for different m. 
We may then replace Vp in (7.6.24) effectively by 


2 


1 
[[-L + (3ag — 1)1-S + 3b,D] (7.6.26) 


e 
Viv = ——— = 
HF ~ 9M M229? r3 
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where 1 
D=,(1-L L-S+L-S I-L) (7.6.27) 
(20? + 2€—1) 
= 7.6.28 
oe (20 — D(2L43) wee) 
1 
by = (7.6.29) 


(20 — 1)(20 +3) 


Upon using the operator relation for S-L in (7.5.17), we note that in com- 
puting the matrix elements in (7.6.21) we may replace S-L by its eigenvalues, 
and replace D in (7.6.27) by 


pD=lig+ 1) e@+1)-5 I-L. (7.6.30) 


That is, we may replace Vy, in (7.6.26) by 


e Ip 
Vite = IMM, Aa [(3a¢ 1)I -“S4 cel - L] (7.6.31) 
vay (j +1) — &(@ +1) —3/4 
jG +1) — (e+ 1) - 
r= 7.6.32 
ee (2 — 1)(2¢ +3) eee) 
Now we use the operator identities (see Problem 7.20): 
1 2 2 a 271% iq2 4 2 2 2 
55 (9, [5?, £4] = 2 (0?z' + L's?) — 25 (J? +0? -8?) (7.6.38) 
1 2 2 4 2 oi 472 4 2 2 2 
= |3, [5?, s']] = 2 (02s' + sis?) - 27) (PP - 1 +8?) (7.6.34) 
to obtain 


1 


5 (J, [J?,1-L]| =2 (0? 1-L+1-L J?)—21-J (J? +L? —8?) (7.6.38) 


ah [5?,1-8]] =2 (0? I-S+1-8 J?)-21-5 (J? - 1? +8"). (7.6.36) 


Upon using the operator identity for I- J in (7.6.5) and taking the ma- 
trix elements of (7.6.35), (7.6.36) (see Problem 7.21) between the states 
CEagae: m,| and |f,j,£,mp) one obtains after some algebra!® (suppressing 
n) 


‘6 The equalities (7.6.37), (7.6.38) are the contents of the Wigner-Eckart Theorem 
in 85.7, §5.8 (see Problem 7.22). 
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h? GG +1) 4+ €(€+1) - 3/4] 
a ar +1) 


(f',5,2,m' |L-L 


x [FF +1) -— 9G +1) — 8/4] Op pS ms (7.6.37) 


ies aoe _ iG +1) — 441) 43/4] 


x [FF + 1) — 99 +1) — 3/4] Op Foi my (7.6.38) 


where we have used the identity 


h2 
eee 


That is H} je 18 diagonal in f, ms. 
We recall the values taken by 7 are 7 = €+1/2. Accordingly, for 7 = @+1/2 


jG+1) + C41) — 5 = e043) (7.6.40) 

a 3. 5a 
HG +I) — ME +I) + 7 = 5 (2€+ 3) (7.6.41) 
j(j+1) e+) -F=¢ (7.6.42) 

and for 7 = €— 1/2, 

jG +1) 4+ e(€41) -= (20 —1)(€+1) (7.6.43) 

- od 
JG +1)— el +1) + 5 = —5(2€- 1) (7.6.44) 
jG+1) —Ml+1)-F= (+0), (7.6.45) 


Upon substituting (7.6.40)—(7.6.42), (7.6.43)—(7.6.45) in turn in (7.6.37) 
and (7.6.38), we obtain directly from (7.6.31), for the matrix elements 
(7.6.21), in reference to (7.6.11), 


(n my j,€,m | Hn, tt Gbnp) 


n 10) (+1) 


e2 
= of Om! I 2M Myc DATAT 12 9P (ne| L|— 
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ye pe +) 55 + 1) — 3/4] 
2j(j +1) 


n,£) are given in (7.3.38), with Z > 


(7.6.46) 


where the matrix elements (n, ¢| (1/r*) 
1, u— M, for 24 0. 
All told, we obtain for the hyperfine-structure correction 


orf M. a’ Mc?\ [f(f +1) - jG +1) - 3/4] 
AEN fit = 5p Gr) Ip ( on ) (CEST CES (7.6.47) 


If we formally set @ = 0, 7 = 1/2 in the above formula, we see that it 
coincides with the expression in (7.6.20) for ¢ = 0. That is, (7.6.47) applies 
for all £=0,...,n—1. 


7.7 The Non-Relativistic Lamb Shift 


This section provides a glimpse of the fascinating world of quantum field 
theory, namely that of quantum electrodynamics. More precisely, this section 
deals with the splitting of energy levels of the hydrogen atom due to the 
interaction of the atomic electron with radiation, that is, with the photon. 

An atomic electron has an average speed v of the order of e?/h = ac, 
and is thus much smaller than that of the speed of light c. We shall therefore 
content ourselves by treating the electron non-relativistically only and not 
dwell on the relativistic regime here. We will also see below that a detailed 
treatment of radiation is not necessary for the problem at hand and the 
following introduction suffices. 


7.7.1 The Radiation Field 


The photon may be simply described as follows. Let |k, 4) denote a state 
of a photon with momentum /fik, energy fi|k|c and polarization described by 
a polarization three-vector e,(k), with taking on two possible values, say, 
1 and 2, such that 

k-e)(k) =0 (7.7.1) 


and 
e)(k) “ey (k) = Oxyd (7.7.2) 


A, = 1,2. Without loss of general of generality we consider real polarization 
vectors. 

With k/|k|, e1(k), e2(k) representing a complete set of vectors in the 
three-dimensional Euclidean space, we may write a completeness relation as 
follows a 

kik = Sy ts eed 
Oy = Ke +eje} + ese, (7.7.3) 
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i,j =1,2,3. In particular, this gives us the property 


a BR 
Sa =o" — TE (7.7.4) 
A=1,2 


which will be useful later on. 
One may introduce a completeness relation over the momenta and the 
two polarizations, specified by A, for a single photon, by 


Y fae TE |k, X)(k, A] =1 (7.7.5) 


(see also Problem 7.24). The factor 1/2|k| comes from relativity for the ever 
relativistic particle — the photon. It arises as follows. With the energy- 
momentum constraint between energy, denoted by hk°c, and momentum hk, 
we may instead integrate over k and k° with the measure (k° > 0) 


© (k) aka? 5( (k°)” — |?) (7.7.6) 


which leads immediately to the factor 1/2|k| in (7.7.5). The measure in (7.7.6) 
turns up to be relativistically (so-called Lorentz) invariant. 

We introduce a radiation (real) field ARap(x), as well as the Hamiltonian 
of the atomic electron and radiation system given by 


Pe 
[| 
where Ho Rap is the free Hamiltonian of radiation whose detailed structure, 
as will be seen below, is not needed for the problem at hand. For simplicity 


we omit the spin of the electron (see also Problem 7.27). 
The Hamiltonian H in (7.7.7) may be rewritten as 


1 € 2 
ae (p = <Apap) + Ho,Rap — (7.7.7) 


2M 


H = Ho+ Ao rap + Ay (7.7.8) 


where Hc is the Coulomb Hamiltonian 


2 2 
p € 
i Me 
°C 9M {xl ey) 
Hee A px Agee) + mee (7.7.10) 
b= = Me RAD*P tT P°* 4ARAD 9M RAD: e 


We treat Hy as a perturbation to Hc in H. 

To zeroth order in Hy, the radiation field Apap may create or destroy a 
single photon. Let |0) denote the no photon (vacuum) state. Then we may 
define the matrix element of a single photon state as 
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(0|Arap(x)|k, 4) = V4m Viice**e(k). (7.7.11) 


Due to the orthogonality of the vacuum and a photon state, the vacuum 
expectation value of Apap(x) evidently satisfies 


(0|Arap(x)|0) = 0. (7.7.12) 


Also the matrix elements of Agap between the vacuum and states containing 
more than one photon are also zero. 
Upon taking the complex conjugate of (7.7.11) we obtain 


(k, \|ARap(x)|0) = V4 Vice **e, (k) (7.7.13) 


for real polarization vectors e,(k). Here Apap(x) creates out of the vacuum 
a single photon. 

To all orders in Hj, and in a relativistic treatment, (7.7.13) is simply 
replaced by 


(k, AJARap(x)|0) pay = V4 Vic \/Z3 ee, (k) (7.7.14) 


where 0 < Z3 < 1, is a constant and is referred to as the photon wavefunction 
renormalization constant. The reason why (7.7.13) is to be modified by the 
presence of an additional multiplicative factor < 1 in (7.7.14), in the general 
case, is easy to see. In this latter situation, Apap may create out of the 
vacuum, in addition to a photon (for Z3; 4 0), electron-positron pairs and 
other particles. (See also Problem 7.25). The amplitude (7.7.13) is thus, in 
general, reduced by a factor (= /Z3), as given in (7.7.14), such that the 
probability that Apap creates all other possible particles as well adds up to 
one. 

Now we turn to the expression in (7.7.11). The factor 47 in it is in- 
troduced because of the definition of the fine-structure constant a = e?/he, 
in (7.4.34), adopted here. If the Coulomb potential in (7.7.7) is taken as 
—e?/4r|x|, then (7.7.11) does not involve the V4 factor and the fine- 
structure is defined as e?/47he. [In field theory, the latter definition is usually 
adopted.] The factor Vhic arises because of dimensional reasons. 

As far as the second power AR p(x) of ARap(x) in (7.7.10) is concerned, 
we will only encounter its (photon) vacuum expectation value to zeroth or- 
der in e in our leading order perturbative treatment of Hy. That is, only 
(0|Azap(x)|0) is encountered. The latter is formally given from (7.7.5), 
(7.7.11), (7.7.12) and the property that the matrix element of ARAp between 
the vacuum and states containing more than one photon are zero, by the 
expression 


3 
e? (0|ARap(x)|0) — ?> fase | (0|A(x)|k, A) it (7.7.15) 
r 


to the leading order. 
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This formal expression, although an infinite constant (as is easily checked) 
is common to all the energy levels and hence does not contribute to the 
computation of the energy level differences we are seeking. More precisely, 
after carrying out a so-called renormalization, as a physical requirement due 
to the ever presence of radiation accompanying a charged particle, this term 
gets altogether cancelled out in the precess as will be seen through (7.7.37)— 
(7.7.40) below. 

For the free Hamiltonian Ho. Rap of radiation in (7.7.7), we have, by def- 
inition, 

Honan |k, A) = Ailk| c |k, ») (7.7.16) 
corresponding to the energy of a photon of momentum /ik. 

We also note from the orthogonality relation of the polarization vectors 
e)(k) of a photon and of its momentum /fik, as given in (7.7.1), the commu- 
tativity relation 


p+ (0|Arap(x)|k, A) = (0|Arap(x)|k, A) + p eal 


holds true (p = —ihV) by using (7.7.11). 


7.7.2 Expression for the Energy Shifts 


Let |y,) denote formally the state of the atomic electron in the absence of 
radiation. That is, the |w,,) are the eigenvectors of Hc, with eigenvalue E,,, 
where n denotes the principal quantum number, suppressing for simplicity of 
the notation other quantum numbers. 

In the present case, we are working in a larger space due to the presence 
of radiation and the state |2,,) above, corresponding to such a larger space, 
will be written as |w,) |0) = |%n;0) emphasizing, in the mean time, that it 
involves no photons. This state is often referred to as the bare state of the 
electron in the atom. 

We assume that in the presence of the interaction Hy with radiation, the 
atomic electron may be described by some state | ¢,,), in this larger space, at 
least for small Hj, corresponding to the same n, with an eigenvalue denoted 
by En. 

One may then write 


(Ho “ Ho Rap) | Wri 0) =E, | Wri 0) (7.7.18) 


Also for the state |¢,) |k, A) = |Wnjk, A), 


(Ho + Horan) |nik, A) = (En + hw) |Win; k, A) (7.7.20) 


where 
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w = |kIc. 7.7.21 
| 


Upon multiplying (7.7.19) from the left by (w,,;0| and using (7.7.18), we 
obtain 


(En — Ew’) (Wn/30| bn) = (Wn: 0 [Eh bn). (7.7.22) 
In particular, for n’ = n, this gives 
AE, = ——— 7.7.23 
(Wns 0] On) ne) 
where 
1 ee en (7.7.24) 


On the other hand, by multiplying (7.7.19) from the left by a single photon 
state (k, A|, we generate a state corresponding to Ho only given by!” 


1 


(k, A] én) = Shes) 


k, \|Hi| bn). (7.7.25) 
From (7.7.11), (7.7.12), the fact that the matrix element of ArRap be- 
tween the vacuum and multi-photon slates are zero, and from (7.7.25), (7.7.5), 


(7.7.1), we have up to order e? in Hy, the following key matrix element plus 
the term in (7.7.27) to follow: 


e€ 
= = (Uni; AR n 
Me (Wn; 0 |p RAD(X)| bn) 
A4nhce? d°k 
M22 = lm )32|k| 


x (wn 


obtained by simply inserting 1, given in (7.7.5), between 
(p- Arnap +Arap <p) and |¢,), and making use of (7.7.11), (7.7.17) and 
(7.7.25). 

The Apap term in (7.7.10) already involves e?, and hence we may imme- 
diately write to this order 


ik-«x 1 —ik-x 
jo ey(k)eiX (he hoy k e)(k) “p 


vn) (7.7.26) 


e2 
(tn; 0 |ARap(X)| on) = Me (%n;0|ARaD(X)| dn; 0) 


TE 


ae (0|Agap(x)|0) (7.7.27) 


1” The method to follow is a slight variation of perturbation theory. The latter will 
be studied afresh in Chapter 12. 
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and is independent of n (and related quantum numbers). The formal expres- 
sion on the right-hand side of (7.7.27), although infinite (see (7.7.15)), is 
common to all the levels (see (7.7.23)), and hence will not contribute to the 
differences in the levels, and it may be omitted in the analysis.!® On physical 
grounds, however, this integral cancels out, as will be seen below, when defin- 
ing the concept of a free electron, accompanied by the ever present radiation 
in the process of renormalization as mentioned before. 

Since the expression in (7.7.26) (as well the one in (7.7.27)) is already of 
order e7, as originating from Hy, we may set (%p;0|¢,) in the denominator 
on the right-hand side of (7.7.23) equal to one. 

Making the change of notation AE, — 5E,,, for having omitted the term 
in (7.7.27), common to all levels, (and for working up to e? in Hj), we then 
have from (7.7.26) with n’ = n, and (7.7.23) 


ah? 
ohn = TMB 
d°k ik-x 1 —ik-x 
Xf Te (fb en00e Gee age ten Pn). 
r n 


(7.7.28) 


If we introduce the resolution of the identity of Hc (see (1.8.18)), then 
we may formally write 


1 1 
@.=feoka ~ lan Sate) dPu,(£) (7.7.29) 
and singularities, in general, develop for hw = E, — E > 0. For FE = Fi, 
for example, and n = 2, the system may decay to the state n = 1, with the 
emission of a photon of energy fw as just given. Accordingly, we interpret the 
meaning of the singularities arising from the denominator!? E,, — Ho — hw, 
appearing in (7.7.28), as follows. We add to it a tie term with « > +0. The 
shift then acquires from (7.7.28) a negative imaginary part, as will be shown 
later by using in the process that 
1 


ln ea 15(En — Ho — hw) (7.7.30) 


and one may write ; 

SE, = Re (SE) — so (7.7.31) 
with [ > 0. Thus a state, specified by n and related quantum numbers, will 
develop in time by acquiring an exponential damping factor 


'8 To be rigorous, one may, in intermediate steps, insert a large (so-called ultravi- 
olet) cut-off A, with h|k|c < A in the integral in (7.7.15) before considering the 
limit A — oo. 

19 See also the interpretation of such singularities of Green’s functions in Chapter 9. 
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Sy oe 
exp zt ( st) =e = (7.7.32) 


in t > 0, leading to the decay of excited states, as described above, for a 
correct interpretation of the vanishing of the denominator in (7.7.28). This 
will be taken up in §7.8. [See also §3.5.| 

The real part Re(5£,,) in (7.7.31), corresponding to the energy shift, we 
are seeking, then is obtained by taking the principal value of (FE, -—Ho—hw)~ 
and this should be understood in the sequel. 

Since 


2 2 
ik-x { P vvik-x _ (p — hk) 
e + Uclx)) e a aera Uc(x) (7.7.33) 


and for any function of [p?/2M + Uc(x)], where with Uc(x), in this case, 
given just to be —e?/|x|, we have 


1 1 


ik-x —ik-x 
aaa = . . 4 
e o-= Fu E, Cao Ug= hw (7.7.34) 
in (7.7.28). 
Thus we may rewrite (7.7.28) as 
ah? d°k 
dE, = 
An? M2 a, |k| 
« (ve forede ex) Pin) 
n "&* Xr ° te f+ 
(Bs — H+ By — BE ha) 


(7.7.35) 


This expression is not complete by itself, and before evaluating the energy 
shifts, there are various physical points that will be considered first. 

The corresponding expression to (7.7.35) in the absence of the Coulomb 
interaction, that is, for the electron in the presence of radiation only will be 
worked out first. 

In analogy to (7.7.22), we then have 


(<() - 2) (w'so|p) = (wom) (7.7.36 


and by including the term in (7.7.27), we obtain, to the leading order, 


ah? 


2 
e€ 
= sqq@ (0 |Afap(x)|0) (27h)*5° (p' — p) + Fo 
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Sat Per rar =yehP) 


2M 
(7.7.37) 


In the appendix to this chapter, it is shown that for a free electron?° for 
Hy # 0, i-e., in the ever presence of radiation, (7.7.37) gives rise to a shift in 
the energy of an electron and, in particular, its mass gets shifted from the 
mass parameter appearing in p?/2M in the original Hamiltonian. This neces- 
sitates to renormalize the theory. In simplest form, this amounts to adding 
an operator 5H, to the original Hamiltonian such that the corresponding 
expression to the right-hand side of (7.7.37) is zero when computed with the 
modified Hamiltonian and the mass now appearing in the theory represents 
the physically observed mass of the electron. Related details to these are 
spelled out in the appendix to this section. Here it is sufficient to note that 
5H, for the problem at hand, is formally given by 


a 2 3 
Site = —gonag Df Tey” ag : e,(k)-p 


ae B — SeE — he) 
TT ar (0|ARap(x)|0) (7.7.38) 


of course working to lowest order in e? in Hj, and the modified interaction 
Hi be comes 
Hi = H+ 5H. (7.7.39) 


The last term in (7.7.38) is a c-number, while p in the first term is an operator. 
With the above definition, then the electron stays stable, at least to order 
2 in Hj, retaining its kinetic energy p?/2M as directly seen from (7.7.37), 
when Hy is replaced by Hj as defined in (7.7.39), (7.7.38). By doing so, the 
mass M appearing now in (7.7.39) represents the physically observed mass 
taking the ever present radiation accompanying an electron into account. 
Also note that the c-number in (7.7.27) will automatically cancel out when 
working with the interaction Hamiltonian Hy. In the literature 5H, is referred 
to a counter-term and the physically observed mass is referred to as the 
renormalized mass. 


7.7.3 The Lamb Shift and Renormalization 


The shift in ene is now to be replaced by 


Ren d°k 
bE, = =" 7 ik (bn p: ey(k) 


20 By a free electron, of momentum p, it is meant that its energy should be equal 
to p?/2M. 
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1 
: (z Piety Bt, 2 (BAO 2 Fy By — Pap — hw 
n ct 2M 2M ) (fn 2M 
x ey(k) + p |v) 
ah? [? : 
= (up| p+ ey(k) 
iPM? 2 ik| [(8z) + ke hile (1 + HIS) 
1 
x (Ho — En) e,(k) - p|{n) 
[Ba — He + (8) - hike — alle (1+ 2S)] 


(7.7.40) 


where Ren in 5E Re" stands for renormalized. We are interested in the cases 
n= 2 in (7.7.40). 

By counting powers of |k| in the integral on the extreme right-hand side 
of (7.7.40); for |k| — oo, we note that the integral in question is convergent 
at high energies A|k|c — oo. Although this integral in (7.7.40) is convergent 
at high |k|,?! having started with a non-relativistic Hamiltonian, and hence 
in a non-relativistic setting, integration over high energies exceeding the (rel- 
ativistic) rest energy Mc? of the electron is not justified. Accordingly, we 
restrict the integration over k in (7.7.40) for energies h|k|c smaller than Mc? 
by providing an upper bound cut-off at Mc? as an upper limit to the integral 
in question and as a natural cut-off for the applicability of the non-relativistic 
treatment. The main values taken by the atomic electron |p| are of the order 
aMc, and hence (|p|/Mc) Alk|c, in order of magnitude, is suppressed by a 
factor of @ relative hlk|c. 

Accordingly, one formally replaces (7.7.40) by 


hn d*k 
bE Ren == a / ——— n . k A = En 
n A4n2 M2 ik] (Alk|c) (a |p ey( )( Cc ) 
(k|<Mc/h) 
1 


* (E, — Ho — hilk\c) ey(k) +p |~n) (7.7.41) 


or with the integration expressed in terms of the photon energy hw, we have 


Mc? 
Ren _ a . n 
bE, ~~ Aq? M22 a, d(fiw) dQ (Wn| p e)(k) (Ho E,,) 


21 The convergence has resulted by retaining the exponential factors in (7.7.26) 
which provide additional powers in 1/|k|, as seen from (7.7.34), for |k| — oo in 
examining the integral in (7.7.40). This was particularly emphasized by Au and 
Feinberg (1974), Grotch (1981). 
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1 
x CE a TT >P|Un) (7.7.42) 
which is known as Bethe’s non-relativistic approximation.?? We note that 
the values taken by (Hc — E,,) may be comparable to hw, in its integration 
range, and both terms should be kept in the denominator in (7.7.42).?° 
We may sum over A in (7.7.42) by using (7.7.4). Also to carry out the 
Q-integral, we note the following angular integration 


ab 
foo (s" _ ~~) = oe (7.7.43) 


Therefore, by finally carrying out the (Aw)-integral, we obtain 


2 a Me? 
5 pRen — Alec 
n (v Ae “(ae=m) 


~ 3m M22 

where we have replaced p(Hc — E,,) by Ip, Hc] in (7.7.44), since (W,| (Ho — 
E,,)p = 0, and noted the large energy scale provided by Mc?. 

The right-hand of (7.7.44) involves the Coulomb Hamiltonian Ho, the 
hydrogen atomic functions w,, and the eigenvalues E,,, which are all known. 

We are particularly interested in the splitting of the levels 24/2, 2P,/2. 
These levels are degenerate not only in the Schrédinger theory but in the 
Dirac theory as well, in the absence of radiation. 

The following expression, involving the above mentioned known quanti- 
ties, has been evaluated numerically by several authors?* 


vn) (7.7.44) 


(2,| [p, Ho] «In (245) P22) 
(2, 0| Ip, Hc] *p |2, 0) 


= 2.815¢.0 + 036¢,1 (7.7.45) 


where n = 2, Ry stands for the Rydberg energy equal to Me*/2h? = 
Mc?a? /2. 
Finally by using (7.4.37), it is readily shown (see Problem 7.26) that 


(n,£,m|[p, He] «pln, £m) = 2me?h? |Wrem(0)|? 


2M3 4 
ie — ade (7.7.46) 


where we have also used (7.3.32) for Z = 1. 


?2 Bethe (1947), in which the expression in (7.7.44) was first derived. 

23 Actually, Hc — En provides a lower bound cut-off to the hw-integral signalling 
the fact that the Coulomb potential cannot be considered as a perturbation in a 
perturbation series expansion, in the analysis of the Lamb shift, as seen by the 
logarithmic singularity in a in (7.7.47). 

*4 Tn particular by Bethe et al. (1950), Schwartz and Tiemann (1959). 
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Equations (7.7.44)—(7.7.46) then lead to the following expression 


7 ae 2 
SERS = as @ aM (in (3) - 2381) deo + 035, (7.7.47) 


giving for the Lamb shift 
SESE - SERS 


; = 1043.5 MHz (7.7.48) 


which compares favorably well with the experimental value of ~ 1058 MHz.?° 
There are many other small corrections which contribute to the shift of 
energy levels and a more detailed treatment will not only necessitate to use 
the full machinery of (relativistic) quantum electrodynamics but to dwell on 
some phenomenological aspects as well (see for example, Problem 7.29). 


Appendix to §7.7: Counter-Terms and Mass 
Renormalization 


By summing over 4 and using (7.7.4), the integral corresponding to the 
second term on the right-hand side of (7.7.37) becomes 


he dk 1 > (p: =) 
a p (A-7.7.1) 
4 oa | k hk)? hk ( k/2 
” ic CaF es gag hw) ik] 


multiplied by (27h)? 53(p — p’). 
Accordingly, upon integration over p’ in (7.7.37) we obtain to order e?, 


p? e 2 
e(p) — IM = Mee (0 |ARap(x)|0) 


h? d°k 1 > (p:k)? 
OT? M2 | Wa? cs: (» 5]. (A-7.7.2) 
42M? | kl (9 — pik + hu) Ik| 


The latter may be conveniently rewritten as 


e(p) = (1 Tr) as +a+0b(p) (A-7.7.3) 
where 
5M 8a [© d(fw) 
M =| (fw + 2Mc?) ere 


whose evaluation requires an ultraviolet, i.e., a high-energy, cut-off, 


5 The early experiment was due to Lamb and Retherford (1947). 


402 7 Intricacies of the Hydrogen Atom 


= gqga (lARaD()|0) (A-7.7.5) 


b(p) ah? d?k 1 
ager p pe 
~ An2M3 k| ( (hk)? 


1 ke Ke 
Sa aT ke (3% - 7“) : (A-7.7.6) 
(+ a hw) 


The moral of the calculation leading to (A-7.7.3) is the following. 
We may rewrite the first term on the right-hand side of (A-7.7.3) as 


5M p p 
(1 M ) 2M ~ 2(M+5M)° nee) 


That is, the mass parameter M one starts with in the Hamiltonian does not 

represent the physically observed mass. Due to the ever present radiation 

accompanying the (charged) electron, its mass gets shifted from M to M+ 

5M, and the latter is actually the physically observed mass of the electron 

Mpnys ~ M + 5M. It is also referred to as the renormalized mass. 
Accordingly, if rewrite the Hamiltonian as 


2 
p 
A= ——+H,+H A-7.7.8 
2Mo + 4] + Ho,RAD ( ) 
then Mp does not represent the physically observed mass. It is referred to as 
the bare mass. One may, however, rewrite Mo ~ Mpnys — 5M, to obtain 
p? p? _ 8M p? 


2Mo - 2Monys Mpnys 2M hye 


(A-7.7.9) 


Hence if we express the Hamiltonian in terms of the physical mass Mpnys, 
then an additional term arises in the Hamiltonian, referred to as a counter- 
term, which contributes in making the kinetic energy one calculates to be 
p?/2Mpnys, as it should be. The situation is, however, more complicated 
than that, in that we have also to subtract the term a+ b(p) from H, to 
ensure that the kinetic energy is just p?/2Mpnys- 

All told, we define a new Hamiltonian 


p e2 


H'= 
2M |x| 


+ Hy + Ho rap + dH (A-7.7.10) 
to replace (7.7.8), where the counter-term 5H, is given by 


5H, = —=— —a- Wp) (A-7.7.11) 
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to lowest order, where b(p) is now an operator, and M in (A-7.7.10), (A- 
7.7.11) now denotes the physically observed mass. This is what we have done 
in computing 5E®™ jn (7.7.40) with M in it denoting the physical mass. 
We note that although 5M in (A-7.7.4) is divergent, it does not have to be 
computed (as a cut-off integral) explicitly in calculating the physical quantity 
in (7.7.40). Also the term in (7.7.27), here denoted by a, is automatically 
cancelled out in the process of renormalization. 

For the non-relativistic electron, considered above, it is easily shown (see 
Problem 7.28) that 


b(p) =a P40 ( IpI ) (A-7.7.12) 


for |p|/Mc « 1, hence the last term in (A-7.7.11) is small, consistent with 
the fact that the kinetic energy of the electron is p?/2M, in the ever presence 
of radiation, where M is the renormalized mass.?° 


7.8 Decay of Excited States 


As discussed through (7.7.29)—(7.7.32), by inserting the +ice, « — +40, 
factor in the denominator in (7.7.28) giving (E, —Hco —hw +ie)~1, we obtain 
from (7.7.42) for the decay constant [ in (7.7.31), (7.7.32), 


a 87 
. 4a (=) [etn 


x (tn | [p, Hc] 5 (hw +He- En) ; Plvn) seo 
(7.8.1) 


r 
om ry Dias 


Since iw > 0, the resolution of the identity of Hc, inserted just before the 
Dirac delta in (7.8.1) will be restricted to 


Snr) dn’ (7.8.2) 


, 
n 


(E,,1<En) 


giving from (7.8.1) the explicit expression 


4 
P= sap Mw Ipltdn)[? En - Ew’). (7.8.3) 


(E,,1;<En) 
The norm of a state |¢,,¢) will involve the damping factor?’ 
26 For a mathematically rigorous treatment of renormalization theory, see: 


Manoukian (1983). 
27 See also §3.5. 
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ett (7.8.4) 


for t > 0, and if we translate t by i/T the exponential factor will be reduced 
by the factor e~+, defining h/T’ as the mean lifetime of the state specified by 
n and related quantum numbers. 
For any Hamiltonian H = p?/2M + U(x), it is easily shown that 
iM 
p= —-[H, x]. (7.8.5) 


Accordingly, we may rewrite (7.8.3) as 


_ 4a 
a 


7 (aa) Yo (En — Ent)? (nr |xIPn)/?- (7.8.6) 


, 
n 


(E,1<En) 


where € (tn |X|¢%n) is referred to as the electric dipole moment associated 
with the transition from state specified by n to n’. 

As an application of (7.8.6), we compute the mean lifetime of the state 
2P; /2. The only state to which a transition may occur in this case is the 15} /2 
state. 

Thus we are led to evaluate the matrix elements (n’, ¢’,m’|x|n, €,m) with 
nv=1,l=0,m' =0,n=2,=1,m=0,41. 

For the radial functions in question we have from (7.3.14) 


1\3? 
Rio(r) = (=) Qe77/40 (7.8.7) 
1 af? r 
Ro (r) = @ ; 7 a (7.8.8) 


The radial integration part of (1,0,0|x|2,1,m) is given from (7.8.7), 
(7.8.8) to be 
foe) 9 9/2 
| dr r? Ryo(r) Rai (r) = 8ao (=) . (7.8.9) 
) 


For the angular integration part, we use the expansions of the components 
of the vector x in terms of spherical harmonics as given in (5.3.52)—(5.3.54) 
and use the orthogonality relation of the latter to obtain 


(Yoo |= |¥im) = a (5m,—1 tile 4b 4), V26m,0) (7.8.10) 


From (7.8.9), (7.8.10) we have 


915 
(1,0, 0|x|2, 1,m)|? = 310% (7.8.11) 
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which is independent of m and is a statement of rotational invariance and 
that there is no preferred direction in space. One may then average (7.8.11) 
over m, for the initial state, with arbitrary weights. [This independence of m 
is explicitly seen in the general case as well given in (7.8.15) below.] 

Finally using the expression for the energy shift 


3 
E,—- EE, = gMc’a® (7.8.12) 


we obtain from (7.8.6), (7.8.11), (7.8.12) 


2\8 
P= @ Mea? (7.8.13) 


corresponding to a frequency of 99.75 MHz which is much smaller than the 
Lamb shift between the 2.5; 2 and 2P,/2 states. I’ is also referred to as the 
decay width of the state in question. 

From (7.8.13) we obtain for the mean lifetime of the 2Pj/2 state 


h 
p & 1595 x 10~°s. (7.8.14) 


To evaluate I’ in (7.8.6) for an arbitrary initial state |n,é@,m), we may 
use again the expressions for the components of the vector x expressed in 
terms of spherical harmonics, as given in (5.3.52)—(5.3.54), and the integral 
(5.8.36) involving the product of three spherical harmonics with €2 = 1, m2 = 
0,41, 4, = €,m; =m. A long but straightforward computation gives 


5 (rm 


which is again independent of m and is a statement of rotational invariance 
and that there is no preferred direction in space. 
As in (7.8.9), one is also led to evaluate the integrals 


Pn (ee DT) Y 
[Yim] = (20+ (041) + (+i) (7.8.15) 


Ip rae dr r? Rye (r)Rne(r) (7.8.16) 


in particular, for @’ = +1, and for ¢4 0, @ = @—1, as seen from (7.8.15). 

We note that the expression in (7.8.6) applies to other potentials U(x) as 
well, not only to the Coulomb one, where now the states |W,) correspond to 
the potential U(x), for a charged particle, of charge + |e|, interacting with 
radiation and this given potential. To this end note that the expression on 
the right-hand side of (7.8.1) follows directly from (7.7.28) by making the 
replacement FE, = E,, + ice, and neglecting the momentum of the photon Ak 
in (p — hk)?/2M in (7.7.34). The latter is equivalent to the replacement of 
the exponential factors 
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elk] . jen (7.8.17) 


in (7.7.28), (7.7.34), (7.7.35) by unity. That is, it is obtained by assuming 
that the wavelength of radiation is much larger than the spatial extension of 
the system (e.g., the atom) so that exp (ik - x) ~ 1. This is the so-called long 
wavelength approximation associated with dipole radiation. 


7.9 The Hydrogen Atom in External Electromagnetic 
Fields 


7.9.1 The Atom in an External Magnetic Field 


Quite generally, the shift of the energy levels of an atom induced by an 
external magnetic field is referred to as the Zeeman effect. We consider the 
hydrogen atom in an external uniform magnetic field B..4 such that the shift 
induced by the field is large in comparison to the one corresponding to the 
hyperfine structure but much smaller in comparison to the Rydberg — the 
energy unit of the elementary hydrogen atom. 

In view to the above application, this amounts to replacing B and A in 
(7.4.32) /(7.4.42), respectively, by Bex, and Acx,, and the latter will be taken 
as 


1 
Aext = 5 (Bext x x). (7.9.1) 


Accordingly, for the Hamiltonian, in question, we first consider the ex- 


pression 
eh e€ 
A=H Ve — ——o- Bext — —Aext ° 7.9.2 
c+ Ve — 55750 * Bext — 77pAext * P (7.9.2) 
where we have omitted the A2,, term in comparison to Acxt +p, based on an 
order of magnitude estimate given below. Here Vp is the potential in (7.5.2) 
responsible for the fine-structure splitting, and Hc denotes the Coulomb 
potential with potential energy —e?/|x|. In reference to the A. +p term, we 
also note that V + Acxt = 0. 
With the magnetic field Bex, chosen along the z- (3-) axis: Bex = 
(0,0, B), and with A = B(—y,x,0)/2, we may write 


e€ 

— —Agt> p= -——L,. 7.9.3 

Me ~~ oMe ee) 

For the magnetic quantum number m small, this term, as an order of magni- 

tude estimate, is of the order —ehB/Mc. We consider magnetic fields B such 
that 

leB\ h 

2Mc 

where Ry denotes the Rydberg unit of energy Ry = Me*/2h? = Mc?a?/2. 

For the quadratic term in A.x, we have 


< Ry (7.9.4) 
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= (2. =“ (2 +y?). (7.9.5) 


For the principal quantum number n small, we may estimate (2? + y?) by 


the Bohr radius squared aj = (h?/M e2)”. That is, as an order of magnitude 
estimate 


2 1 /ehB\? 1 Blh 
ey (5 ) oi (7.9.6) 


eo RA py 
2Mc2 * 16 \ Mc) Ry 2Mc 
where we have used (7.9.4). Accordingly, for n small, one may formally neglect 
the term quadratic in Agx in comparison to the linear term for magnetic fields 
B satisfying (7.9.4).7879 
We may then rewrite (7.9.2) as 


AHA=Heo+ Mf (7.9.7) 
with 
[en ea Ee (7.9.8) 
I— VF Me Z z od 


and S, = ho3/2, such that (7.9.4), (7.9.6) hold true for |m|, n small. 

With J? written as in (7.5.3), we note that since (L, + 2S.) does not 
commute with S-L, J* does not commute with H. But H commutes with 
L”, J, and S?. 

We are thus led to consider a linear combination of the states ~j,¢.m,.n 
in (7.5.4), for different 7, with the angular parts (6,¢|7,@,m,) as given in 
(A-7.5.4), (A-7.5.5). In detail 


/ 1 : 
Rne(r) e+ Dy} +m; Ye,m;—1/2(9, ¢) 


W4,£,m;,n(X) ee EES ( me ) 
£,M;, 2l+1 Q b 
e+ 5 —™;5 Yom;+1/2( 2 ) 


for j = £+1/2, |mj| < £+1/2 


ee Co ea [= 2+ 3 = 3 Yom, 1/216, 9) | 
case ey) ea | 
Ja+1 Ot E+ mj Yerms+1/2(9,) | 


for j = €—1/2, |m;| < €-1/2. 


(7.9.10) 


8 More precisely, we deal with principal quantum numbers n such that |eB|h/Mc 
is much smaller than the differences between successive energy levels E,,. 
°° The contribution of the e? A?/2Mc? term will be also analyzed in Chapter 12. 
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For £4 0, we introduce linear combinations 


dnem = S> agy(é,m’)j,¢,m/ n(x) (7.9.11) 


j=l+1/2 


for |m’| < 2— 1/2, and the states 
Pn,tym' (X) = Vj,’ nl je41/2 (7.9.12) 


for m! = 4+(€+ 1/2). 

For a sufficiently strong magnetic field B such that Vp is small compared 
to the second term in (7.9.8) and such that the conditions in (7.9.4), (7.9.6) 
hold true, we may consider the Hamiltonian 


ah Hoe. \: (7.9.13) 


[Later we treat Vp as a perturbation to H’.] 
The eigenvectors of (7.9.13) are given by 


vnim() (q)> boom) (7) (7.9.14) 


where Wnem(x) are the Coulomb wavefunctions given in (7.3.18), with corre- 
sponding eigenvalues 


Ey, + (m+ 2m,), ms, = +1/2 (7.9.15) 
respectively, where E,, = —Ry/n?, and we have set 
ehB 
- = 0. 7.9.16 
oMe ="? ( ) 


The Zeeman effect for a strong magnetic field is usually referred to as the 
Paschen-Back effect. 

To treat Ve as a perturbation to H’, we consider the linear combination 
introduced in (7.9.11), and the states in (7.9.12). The eigenvectors in (7.9.14) 
may be equivalently rewritten in the form of (7.9.11), (7.9.12) as follows. For 
£ #0, the first eigenvector may be rewritten as in (7.9.11) (see Problem 7.31) 
with 


£+1/2+m’ £4+1/2—m' 


se) VY beet 7 MEMO TV eT 


(7.9.17) 


corresponding to the eigenvalue E,, + 7(m’ +1/2) with m’ = m-+1/2,|m'| < 
€—1/2, while for the second eigenvector, 


€+1/2—m’! (/€+1/2+m! 
Ge41/2) = W+1 ’ Q(e—-1/2) = ya (7.9.18) 
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corresponding to the eigenvalue EF, +7(m’—1/2), with m’ = m—1/2,|m’'| < 
£—1/2. For m’ = +(€+ 1/2), the eigenvectors are given in (7.9.12), corre- 
sponding respectively, to the eigenvalues FE, + y(€+ 1), Fn — n(€+ 1). 
Since Ve commutes with i Be J,, it is diagonal not only in %j,e.m;,n but 
also in @nem’- 
The corrections to the eigenvalues of H’ are then given by 


AE nem! = (Onem! |\Ve|Onem’) . (7.9.19) 


The latter is easily evaluated to be 


+ _ 1 
AE im! = Ss +n{m' += (7.9.20) 
neem 2 2 
for |m’| < €— 1/2, 
+ 1 
AEntm =e+ +n (« x 5) (7.9.21) 
for m’ = +(€4+ 1/2), where, 
E+ = AE n, jel j=041/2 (7.9.22) 


and the AE,,,;,¢ are the fine-structure corrections in (7.5.19). 

More generally, since we are interested in the splitting of the energy level 
FE, for given fixed n, due to the interaction term Hy, assuming that the latter 
induces a small correction to E,,, we consider the matrix element of Ho + Hy 
in (7.9.7) between the state R,»¢(r), thus defining an effective Hamiltonian 


Heg(n, ¢) = | d?x Rne(r) [Ho + Hy] Rne(r). (7.9.23) 


For ¢ = 0, we have from (7.5.15), 


2 2nS. 
Hop (n,0) = En — En (1 : ue (7.9.24) 
n 4n A 


and for 4 0, we have from (7.5.16), (7.5.18), in particular, that 


(; ae z) 


1 (J? — h?l(6 +1) — 3/4) 
2h2 &(€ 4+ 1/2)(€ +1) 


ae 
Heg(n, €) = En En 
n 


7 
+ (Lz + 282). 
(7.9.25) 


These effective Hamiltonians, as the Hamiltonian in (7.9.2), commute with 
J,. Note in particular that Heg(n,@) in (7.9.23) does not commute with J’. 
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The eigenvalue problem is now formulated as follows. 
The eigenvectors corresponding to (7.9.24) are given by 


Yo,0(0, | | 0 
/ , 7.9.26 
| 0 Yo,0(9, ¢) ( ) 
with eigenvalues 
a? 3 
En Ey {1 +7 (7.9.27) 
n An 


respectively. 

From (7.9.12), for m; = m! = £(€41/2), F?0j,emn = FI(G+1)Dj,e,m,n's 
with 7 = (€+ 1/2). That is, for m’ = +(€+ 1/2), the eigenvectors of (7.9.25) 
are given by 


Yoe(9, | | 0 
, 7.9.28 
| 0 Y,,-2(9, d) ( ) 
with respective eigenvalues 
En + AEn,5,¢\j=041/2 x n(l + 1) = Eynt+ept n(l + 1) (7.9.29) 


using the notation in (7.9.22). 
For |m’| < £—1/2, the situation is more involved. To this end, we consider 
eigenvectors, as obtained from (7.9.11), to be 


‘ fb+ s +m Yo m'—1/2 
ee 
e+ 37 m! Yom! 41/2 
7 —\/l+ 5 —m’ Yom—1/2 


a 7.9.30 
w+ : veo) 
[e+ btm! Yomsap2 
The eigenvalue equation then reads 
Aeg(n, £)Xe,m! = (En + Entm’ ) Xé,m! « (7.9.31) 


Upon substitution of (7.9.30) in (7.9.31) using (7.9.25), and the definitions 
in (7.9.22), we obtain 


_, feti/2—m! (e-e_ — nlm’ +1/2) 
nny eam (ea ante +19) (7.9.32) 
e+ 1/24m! (e—e~ ~ nlm’ — 1/2) 
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where e€ stands for €negm’, and by setting e—nm’ = ec’, we obtain the quadratic 
equation 


2 
1D So 1) (E+ es) = 
e —e(e,+e_)+ (cre ‘ ) ic Say Pr a 0. (7.9.34) 


The solutions of (7.9.32)—-(7.9.34) are readily obtained, by using in the 
process that |u|? + |u|? = 1, to be 


i a ER ee (7 ehh n\2 4m’ 71 
“nem! = 2 ene ee (ae) wD re i) a 2l+1 ce) 
(7.9.35) 
where 
Aree fp. = (7.9.36) 
Se ie OES a 
and ii 
4 1 1 2m! n 
= 1 1 9. 
u a ( +7 + (Z))) (7.9.37) 
n 1 1 Im 7H on 
v Fi (1 va (1 +o (+ )) (7.9.38) 
where 
25: HAP y. Ae eG 
ae y+ (me) Ponet ae) ee 
For a strong magnetic field |7/Ae| >> 1, (7.9.35) reduces to 
+ = 1 
Epp = a a 4 n G + 5) (7.9.40) 


which together with (7.9.29), (7.9.27), coincide with the earlier solution in 
(7.9.20), (7.9.21). We also note that 


Z 0+1/2Em! # C+1/24=m 
o ye nl 20+1 aay) 


which coincide with the coefficients in (7.9.17), (7.9.18). 
For a weak magnetic field |n/Ae| < 1, 


1 
(oe J 1+ —— 9.42 
Enom! SE +m ( si) (7.9.42) 


which upon combining with the results in (7.9.27), (7.9.29), provide a shift 
induced by the magnetic field given by 


AE jem, = M595, (7.9.43) 
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where |m,;| < 


p+ 1/2 
IL = (- -f (7.9.44) 
with the latter referred to as the Landé g-factor (see also (5.7.32)). The 
expression in (7.9.44) follows by noting that for m; = +(¢+1/2),j = €4 
1/2, mj; 9;,e = £(€+ 1), while for |m,;| < €—1/2,7 = €£1/2,9;¢ = (20+1+ 
1)/(2@+ 1). 

The splitting in (7.9.43) takes the spin of the electron into account and is 
referred to as the anomalous Zeeman effect — a rather unfortunate nomen- 
clature.?° When the spin is not taken into account or if one is dealing with a 
spinless particle, the effect is usually referred to as the normal Zeeman effect, 
with m; > m,gj,e > 1. 


7.9.2 The Atom in an External Electric Field 


The shift of energy levels of an atom induced by an external electric field 
is generally referred to as the Stark effect. We consider the hydrogen atom 
in an external uniform electric field € such as the splittings induced by the 
latter are much smaller than that of the fine-structure but much larger than 
that of the hyperfine-structure. 

Accordingly, we consider the Hamiltonian 


H = Ho+ Vp - e&2, z=rcosé (7.9.45) 


where we have taken the electric field to point in the z-direction, with —e€z 
taken formally as a perturbation. 

The energy levels of Hc + Vp, with the Vp contribution taken small in 
comparison to the differences between the energy levels FE, provided by the 
Coulomb potential, is determined by the two quantum numbers n and 7 and 
are given by 


a? 1 3 
Eng = En En [— 1/2 =| : (7.9.46) 
as obtained in (7.5.19) 

We are considering shifts in E,,,;, as induced by the electric field €, for 
given fixed pairs (n,7). The perturbation —e€z commutes with J, but does 
not commute with L? due to its dependence on 6. Accordingly, we are led 
to consider linear combinations, denoted by ¢n,j,m,, of the states in (7.9.9), 
(7.9.10) with @= 7 +1/2. 

We may then introduce the effective Hamiltonian 


Hog (nj,j,m;) = S- a lt je mj) 


(=j41/20=j 


3° Historically, the theory of the normal Zeeman effect ran into problems when 
confronting experiments until the theory of the anomalous one was used based 
on the incorporation of the spin of the electron in an atom. 
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x (n,j,£,m;|H|n, j,€, mj) (n,j,€,mj|. (7.9.47) 
Now we rewrite (7.9.9), (7.9.10) in terms of j on their right-hand sides: 


J+ ™; Yj-1/2,m;-1/2 


Ry j—-1/2 
W5,0,m3,n = erie (7.9.48) 
j- mM; Yj-1/2,mj+1/2 
for €= j — 1/2, 
R46 J +1 — m3; Yj41/2,m;-1/2 
W5,L,m;.n = eo cm (7.9.49) 
G+) J +1+ mj Vj41/2,m5+1/2 
for €= 7 +1/2. 
By setting 
timima= D>. arbi pmen (7.9.50) 
l=j+1/2 
the eigenvalue equation then reads 
Heg(n, j, ™m)bj,m,.n = jum; nPj,mj,n- (7.9.51) 


To evaluate the matrix elements, (n, j, £,m, |H|n, j, &’,m,;), we recall that 
the matrix elements of (Hc + Vp) are given by Ey, j;dee, with E,,,; given in 
(7.9.46). To find the matrix elements of r cos 0, we use the identity in (5.8.41) 
to carry out the angular integration. To this ends, the latter identity leads 
for all £, €’, m, m’: 


[oo Yorm: (9, d) cos 8 Yom (6, ) 


£4+1)2—m? oe 
- Brea ebm Fy Geen fet 18mi ms (7.9.52) 


Thus for the radial integration only the product Ry j—1/2(") Rn j41/2(") 
will contribute. An application of the properties of the associated Laguerre 
polynomials in (7.3.16) also shows (see Problem 7.33) from (7.3.14) that 


i dr 7? Ry j—1/2(") Rnj+1/2(7) = — aon (2n)? a (29 + 1)?. (7.9.53) 
0 


All told, we obtain from (7.9.52), (7.9.53) 


(n, 5,9 = 1/2,m,; |r cos O|n, 9, J +- 1/2,m,;) 


SS 4 AO OFF? Say 
= <agnm; GED =M (7.9.54) 


8 
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and zero for the diagonal element ¢ = ¢’.*+ 

Upon substitution of (7.9.54) in the 2 x 2 matrix equation in (7.9.51), 
we obtain the necessary condition (the secular equation) for the existence of 
non-trivial solutions, 


EF = Ejmjin —eM'E 


n,J = 
dea ae ee =0. (7.9.55) 


That is, the energy levels are given by 


JOnP = 25 +1? 
j(j + 1) 
For an electric field € strong enough such that the fine-structure effect 
may be neglected, it amounts to taking the Hamiltonian as 


Ee ee = eg geaoe myn 


(7.9.56) 


H = Ho - e&z. (7.9.57) 


The latter commutes with J, but does not commute with J? and L?. Accord- 
ingly, the above procedure leads to considering linear combinations of several 
states for n not too small and the analysis becomes laborious. 


Problems 


7.1. Show that for the probability distribution of the kinetic energy, in 
particular, one has 


Prob [T > To] < = (T) 
To 


for any Ty > 0 and infer from (7.1.16) that for the hydrogen atom 


1 /2pe! 
Prob[T' > Tal < x ( aa ). 


7.2. For any linear combination 
w(x) — S- Gnemvnem (x) 
n,lym 
with the nem (x) denoting the eigenstates of the hydrogen atom, and 
om la@nem|" =1 
n,lym 


3! We note, quite generally, that the matrix elements for ¢ = ¢’ are zero because 
the corresponding states have a definite parity (see (5.3.43)), while z = rcos0 
has odd parity. 


7.3. 


7A. 


7.0. 


7.6. 


7.7. 
7.8. 


7.9. 
7.10. 
7.11. 
7.12. 
7.13. 
7.14. 
7.15. 


7.16. 
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(i) derive bounds for Prob [r < ro], Prob [I > To] and compare them 
with the ones in (7.1.21), and the one in Problem 7.1, respectively. 
(ii) Derive the exact expressions for Prob [r < ro], Prob [Z’ > To], for 
the hydrogen atom in its ground-state. [Hint: It may be use- 
ful to show that the ground-state in the momentum descrip- 


tion is given by ®(p) = 8/rae” [1 + p2a3/h?|~* normalized as 


3 
| eye Pwr? = 


(iii) Evaluate the left-hand side of the inequality (7.1.13) for any of 


the eigenstates |Wnem), and show that it is consistent with the 
inequality in question. 
With p, defined in (7.2.2), show that 


@) = (55422) 
(ii) [pr] = ih 
Gi) p= 4 
(w) [pd] = 
©) Wi = 3 (4). 


With the boundary conditions imposed on the functions in (7.2.10) 
for r — oo and r — 04, show that the adjoint DF of Dz is given by 
the expression in (7.2.11). 

Verify the relations in (7.2.12)—(7.2.15). 

Show that the normalized solution of (7.3.4), according to (7.3.5), is 
given by (7.3.6). 

Use (7.2.13), (7.2.15), with @ — @—1, to establish (7.3.10), for ¢ 4 0. 
Show that (7.3.11) leads to the differential equation in (7.3.12), and, 
in particular, for = n — 1, it leads to the solution in (7.3.13). 

Show that (7.3.14) coincide with Ryn-1, Rnn—2 given, respectively, 
in (7.3.6), (7.3.13) for 2=n—1,n—2. 

Use (7.3.24), (7.3.25) to establish the orthogonality relation in (7.3.26), 
(7.3.27). [Is there an orthogonality relation for n’ = n, ¢’ # ¢ for the 
Rre?| 

Derive the expectation values in (7.3.34)—(7.3.38). 

Prove the equalities in (7.4.26), (7.4.27). [Note the positions of the 
various brackets in these equations. | 

Derive the operator identity in (7.4.31). 

Derive the fundamental Poisson equation (7.4.36). 

Derive afresh the expressions for Vp, Var starting from (7.4.2) for a 
spin 0 charged particle interacting with a proton. 

Find the fine-structure correction corresponding to the one in (7.5.19) 
for the spin 0 case in Problem 7.15. 
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7.17. Show that the operators F?, J?, S*, I’, L?, F, introduced in 87.6 
commute in pairs. 

7.18. Find the simultaneous eigenstates of the operators in Problem 7.17 
in terms of the spherical harmonics and the spin eigenstates of the 
electron and proton. 

7.19. Find the hyperfine correction corresponding to the one in (7.6.47) for 
the spin 0 case in Problem 7.15. Is there such a correction for € = 0 
states ? 

7.20. Derive the operator identities in (7.6.33)—(7.6.36). Here it is worth 
recalling, in particular, the operator equality in (7.5.17). 

7.21. Show that the matrix elements of (7.6.35), (7.6.36) lead to the expres- 
sions as indicated in (7.6.37), (7.6.38). 

7.22. Derive the equalities in (7.6.37), (7.6.38) directly from the Wigner- 
Eckart Theorem in §5.7, §5.8. 

7.23. Extend as much as possible, the analyses given in §7.1—-7.3 to the 
attractive 1/r potential in two dimensional space. 

7.24. Find the coefficient of the orthogonality relation of the photon states 
(k, A|k’, 1’) x 63(k—k’)6)y, from the completeness relation in (7.7.5). 

7.25. Provide a physical explanation if the extreme case with Z3 = 0 is 
realized in (7.7.14). 

7.26. Prove the equality in (7.7.46) by using in the process (7.4.37), (7.3.32) 
for Z > 1. 

7.27. Consider the addition of the spin part — st o-Brap, where Brap = 
V x Arap to the Hamiltonian in (7.7.7). 

(i) Show that 


(0|[Braplk, A) = iV4a Vice**k x ey(k). 


(ii) Show that p-e)[]e)- p in (7.7.26) becomes simply replaced by 
n(p,k)-e,[Je,- n'(p,k) where 


ih 
n(p, k) = (p+ 37% k) : 


(iii) Show that for any operator Q which commutes with o and k, the 
following relation holds true 


k-pk he? ih 
dome, Qean! = (» ke ):28 yRQ+5o-(k x [Q,P]). 
mn 


(iv) Carry out the renormalization in the appendix to §7.7 by including 
spin as described in (i). 
(v) Investigate then the contribution of spin in the calculation of §7.7. 
7.28. By explicit integration of b(p) in (A-7.7.6) derive (A-7.7.12). 
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7.29. Suppose that the charge distribution of the proton is given by 


p(x) = sexn(-r/), r= hl, 70 
ay 

normalized to unity, with the empirical data for the root mean square 

radius of the proton \/(r?) = 0.81 x 107! m. 

(i) Find the value of ¥. 

(ii) With the Coulomb potential U(x) = —e?/ |x| now replaced by 


Ue(x) = -e? [es BED: 


Ix — x’| 
estimate the energy level shifts, with 
AUc(x) = U(x) — Uc(x) 


treated as a perturbation. 

7.30. Provide the details of the computations leading to the results given 
through (7.8.9)—(7.8.11) in determining the mean-life of the 2P,/2 
state. 

7.31. Show that the eigenvectors in (7.9.14) may be rewritten as in (7.9.11) 
with coefficients given in (7.9.17), (7.9.18). 

7.32. Derive the relations in (7.9.32), (7.9.33) and the quadratic equation 
(7.9.34) for the eigenvalues. Finally show that the solutions of these 
equations are as given in (7.9.35)—(7.9.39) 

7.33. Use the expression for the associated Laguerre polynomials in 
(7.3.16) /(7.3.15) and the relation of Rp ¢(r) to them in (7.3.14) to 
prove (7.9.53). Finally show that (7.9.52), (7.9.53) lead to the result 
given in (7.9.54). 

7.34. The Schrédinger equation, with the Coulomb potential —e?/|x|, can 
also be separated in the so-called parabolic coordinate system. The 
latter may be defined by introducing three variables £,7,¢,0 < € < ~, 
0<n<w,0< ¢< 2n, with 


n= Vijcosd, y= JEising, 2= 5(€-2). 


In particular note that r = (€ + )/2, én = r? sin? 6, @ = tan7!(y/z). 
(i) Show that the Laplacian in this coordinate system is given by 


2 4 0.0 0.0 i 7 
Vo= €é—+—n + Ze 
(§+n) LOg°0 An On} &n Ad 
(ii) Prove that the 3D volume element is given by (€ + 7) d€édnd@/4. 
(iii) Carry out a separation of variables of the Schrédinger equation, 
with the Coulomb potential, to show that the eigenvectors in the 
(€,7, ¢) representation may be written as 
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= alien’ 


W(x) cE we 


% 


where m is the magnetic quantum number, and x1, x2 satisfy the 


equations 


| 
| 


d2 (m*-1) MZ,e2 ME 
d2 (m?-1) MZge? ME 
dr? 4n? hn ape | X2(n) = 0 


with Z,, Z2 as separation constants and Z,+ Z2 = 1. [For a hydro- 
genic atom with potential —Ze?/ |x|, then Z, + Z2 = Z.] You may 
express part of the solutions of the above equations, as factors, in 
terms of the associated Laguerre polynomials in (7.3.15), (7.3.16) 
as functions of €,7. 
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Quantum Physics of Spin 1/2 and Two-Level 
Systems; Quantum Predictions Using Such 
Systems 


Spin 1/2 and two-level systems are simple enough that quite often they 
allow explicit solutions to their underlying problems and provide a wealth of 
information on quantum systems, in general. This chapter is devoted entirely 
to these structures and their intricate details. It includes important quan- 
tum predictions that are made by their direct analyses which may be tested 
experimentally. 

General properties of the above systems are studied in §8.1, including the 
exponential decay law (§3.5) in two-level systems. The Pauli Hamiltonian as a 
generalization of the Schrédinger equation of a non-relativistic spin 1/2 par- 
ticle is discussed in $8.2 and makes contact with supersymmetry introduced 
in §2.9 and further elaborated upon in §4.7, §6.5. The so-called Landau levels 
are treated in §8.3 and special emphasis is put on the g-factor of the electron. 
Spin precession and accompanying radiation losses is the subject matter of 
§8.4. A derivation of the anomalous magnetic moment of the non-relativistic 
electron in the ever presence of radiation accompanying a charged particle is 
given in 88.5. This gives a deviation of the g-factor of the electron from the 
value 2 which is remarkably quite accurate. Density operators (§1.5) and the 
scattering of spin 1/2 particles off spin 0 and spin 1/2 targets are studied 
in 88.6. The analysis carried out in §1.10 on probability and measurement 
is extended in much detail in 88.7 emphasizing the role of the environment, 
surrounding a physical system and a measuring device, and the so-called 
quantum decoherence as well as the quantum superposition law in the light 
of classical notions of measurements. The Ramsey oscillatory fields method, 
based on the Ramsey apparatus, is introduced in §8.8 which provides inter- 
esting applications to interference phenomena, spin flips, and in monitoring 
spin, in general, as a particle moves in different magnetic field zones. The 
role of the superposition law for macroscopic systems, such as Schrédinger’s 
cat, and quasi-macroscopic (mesoscopic) systems, often referred to as kittens, 
and the important role of quantum decoherence, due to the environment, are 
studied in 88.9 as extensions of the work in 88.7. Bell’s test together with 
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the analysis of basic processes in the light of Local Hidden Variables is the 
subject matter of 88.10. Quantum teleportation and quantum cryptography 
which rely on fundamental and mysterious aspects of quantum theory are 
treated in §8.11. The rotation of a spinor by 27 radians, initially introduced 
in §2.8 ((2.8.68), (2.8.70)), is analyzed from a practical dynamical point of 
view in 88.12. The theoretical foundations of geometric phases are developed 
in §8.13. There has been much interest in recent years in geometric phases 
both experimentally and theoretically. Finally we provide an analytical quan- 
tum dynamical treatment of the Stern-Gerlach effect for charged as well as 
neutral spin 1/2 particles in §8.14. 


8.1 General Properties of Spin 1/2 and Two-Level 
Systems 


Much has been developed in the previous chapters on spin, in general, and 
spin 1/2, in particular. Here we discuss some of general aspects of spin 1/2 
which are useful in describing its quantum dynamics and treating problems 
associated with quantum measurement. Some pertinent aspects of two-level 
systems, in general, are also investigated. 


8.1.1 General Aspects of Spin 1/2 


Under a c.c.w. rotation of a coordinate system about a unit vector n by 
an angle y, a wavefunction, in the coordinate description, transforms as (see 


(2.7.42)) . 
(x) = (exp (Gn-3) wo (8.1.1) 


where J is the total angular momentum. 

The spin S arises (§2.7, §5.4), by rewriting (8.1.1) in terms of the coordi- 
nate labels x’ = Rx, with R the rotation matrix defined in (2.1.4), (2.1.24), 
via (see (2.7.43)) 


vita) = (ex (Fn-8) #00. (8.1.2) 


As an angular momentum operator, the spin components satisfy the com- 
mutation relations (see (2.7.10)), 


[S*,.$7] = ihe *5*. (8.1.3) 


An elementary way of obtaining a representation of the spin operator S 
for spin 1/2 is the following. With the spin quantization along the z-axis 
(§5.4), the eigenvalue equations 


1 Cf., §2.7, §2.8, §5.4, 85.10. 
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°(-20: °Q-40) 


for spin along or opposite the orientation of the z-axis, provides the repre- 


= € =) (8.1.5) 


On the other hand, the commutation relations (see (5.10)), 


sentation 


[S?.S2) Shoe (8.1.6) 
where 
SG eS" (8.1.7) 
give, 
O01 00 
s.a0(22),  s2n(29) oe 
from which, 
h (01 h (0-i 
i ior. t 
eA), ta8(), fs) 


Upon writing S = ho/2, the so-called Pauli matrices 01, 72, a3 as the 
components of o, satisfy the important relations, 


OjOk = jk + IE ;KIOI (8.1.10) 
from which, or directly from (8.1.3), 
(0; , on] = 2i€;RI01- (8.1.11) 
The transformation rule in (8.1.2), in terms of the components 7)“ of the 
spinor ~ (§2.8), reads (see (2.8.1), (2.8.7)) 
laf l\ __ YP b 
w(x!) = [exp (iZn-o)| v(x) 
= (cos £ gah isin * [n- o"”) w(x). (8.1.12) 
The matrix exp (ip n:o/ 2) is given explicitly by 


cos . + ing sin . (ing + nz) sin _ 
exp (ifn. c) = (8.1.13) 

2 ; _ Oe tas 
(in, — ng) sin 5 008 5 — ingsin 5 


2 


where n = (m1, na, n3) : 
For the spin S) along an arbitrary unit vector 
N = (sin 6 cos ¢, sin @ sin @, cos 6), 
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cos@ sin6 ei? 
N-S= SnN= = (8.1.14) 
sin@ ee? —cosd 


with eigenstates |1/2,N), —1/2,N), 


h 
Sn |£1/2,N) = 5 |+1/2,N) (8.1.15) 
6... 
Cos a e7i¢/2 — sin — e7i¢/2 
|1/2,N) = , |-1/2,N) = . (8.1.16) 
sin 5 ei¢/2 cos 5 eb /2 
In particular, 
0, 1 G- 5 0 
_ * a—id¢/2 « / nid/2 
|1/2,N) = cos 5° (4) +sin5 e (7) (8.1.17) 


and with z a unit vector along the z-axis, 


|1/2,2) = |+z) = (6) , |-1/2,2) =|—-z) = (?) (8.1.18) 


one obtains the amplitudes 


(+z|1/2,N) = cos ; gel? (8.1.19) 
(-z|1/2,N) = sin ; eiP/? (8.1.20) 
Similarly, 
(+z|-1/2,N) = -sing gone (8.1.21) 
(-z|-1/2,N) = cos ; eels) (8.1.22) 


For example, for a particle with spin initially prepared in the state |+z), 
which goes through a filtering process represented by a selective measurement 
|1/2,N)(1/2, N| symbol, the amplitude of a spin flip, i-e., that of the spin 
of the particle to be found in the state |—z) is given by (§5.4) 


(=z|1/2,N) (1/2,N|+z) = sin ee (8.1.23) 


9 . 
This gives a probability of (sin? @)/4 of a spin flip, with a maximum proba- 
bility of 25%, for @ = 7/2. 
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For an apparatus which allows both components [+1 12: N) to go through, 
the spin flip amplitude is reduced to zero 


(=2| [|1/2,N)(1/2,N] + |-1/2,N)(-1/2,N] | |+2) 


= (—2|1|+z) = (-z|+z) =0. (8.1.24) 


As another example suppose a measurement of spin of a particle is carried 
out along a direction specified by a unit vector n; = (sin 8, 0, cos 01) followed 
by a measurement of spin along ng = (sin 02,0, cos 02), where 6, 4 6. If the 
spin of the particle is initially prepared in the state |+-z), then the probability 
that a measurement of spin along n, is found to be parallel to n; followed 
by a measurement of spin along ng is found to be, say, parallel to ng as well, 
is given by 


0, —0 0 
[+n +m) (4m | +2)? = 0s? ( : 5 :) cos” (F) (8.1.25) 


where we have used the notation |+1/2,m) = |+n). On the other hand for 
the reverse process of measurement of spin along ng, and found to be parallel 
to ng, followed by a measurement along nj, and also found to be parallel to 
nj, is given by 


0, —0 0 
(+m; |+my) (+n2|+z)|? = cos? ( : 5 2) cos” (2) (8.1.26) 


demonstrating, in particular, the inequivalence of the orders in which the 
measurements are carried out. 


8.1.2 Spin 1/2 in External Magnetic Fields 


Consider a neutral particle of spin 1/2, such as the neutron (§8.2), with 
magnetic moment yu < 0. Restricting for simplicity to the dynamics of the spin 
only,” the Hamiltonian for the interaction of spin with an external uniform 
time-independent magnetic field B = |B|n is given by 


H =—p|Bln-o =|pBln-o. (8.1.27) 
Hence from (8.1.13), the time evolution operator is given by 
we . . wt : . wt 
cos ing sin (in, + ng) sin — 
i 2 2 2 
U(t) = exp Gua = 
A (i ‘i wt wt. . wt 
iny —N2)sin-> cos > + ing sin > 
(8.1.28) 


? The more complete description, where the kinetic energy of the particle is not 
neglected, or more precisely is not assumed to be negligible in comparison to the 
spin part, will be studied in detail in §8.14 in the Stern-Gerlach effect. 
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with w = 2|4B|/h. For a particle initially prepared in the state |+z), the 
amplitude of a spin flip at time t > 0, is then 
wt 


(—z|U(t)|+2) = —(iny — ng) sin a (8.1.29) 


giving a probability of spin flip of (n? +73) sin? wt/2. For a magnetic field in 
the z-y plane and at time t = 7/w, the spin flip probability reaches a 100%. 
Consider the time-dependent Hamiltonian 


H(t) = —yBR(t)-o (8.1.30) 


for the interaction of spin with a time-dependent magnetic field BR(t), where 
R(t) is the unit vector 


R(t) = (sin @ cos wt, sin # sin wt, cos 8). (8.1.31) 


As an initial state |y(0)) of spin, we choose it to be the eigenstate of H(0). 
That is, 


(0)) = (8.1.32) 


up to a phase factor. 
Using the notation —u~B = hw /2, the Hamiltonian in (8.1.30) may be 


rewritten as i 
@ sind e™ 
hw cos 
H(t) = — (8.1.33) 


sind el! —cosé 
By setting 
I(t) = (8.1.34) 
a_(t) 


the Schrédinger equation leads to the simultaneous equations 
7 = . WO . > —iwt 
a4 = —i> |cos daz+sind ea_ (8.1.35) 


aiwt 


yes iv [cos 8 a_—sinfe as| (8.1.36) 


With the initial condition in (8.1.32), the solutions of (8.1.35), (8.1.36) are 
readily obtained (see Problem 8.2) to be conveniently written in the form 


OfQ+wW-up voy Q-wt+wo _vor.. 
ay(t) = 0085 [Foe alr a8 ae ue (8.1.37) 
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ree ae Q-—w-wo vo Q+w+wo _; 
“(*th= iwt i(Q—w)t/2 i(Q+w)t/2 
a_(t) =sin=e | 50 e + pg 
(8.1.38) 


where 


Q= je? — 2wwy cos 6 + we. (8.1.39) 
This solution will find an interesting application in §8.13 on geometric 
phases. 
As an alternate initial condition to (8.1.32), consider the following one 


a 
cos 5 
wor=| - (8.1.40) 
sin mY 
where? 
ind 
sina = “08 (8.1.41) 
Q 
Wo cos 8 — w 
COO (8.1.42) 


with © defined in (8.1.39). The solution then takes the particularly simple 
form 
cos Or ev iwt/2 
[w(t)) = : ae (8.1.43) 
«it /2 
sin 5 e / 
An application of this solution will be made in §8.13 on geometric phases. 
Numerous additional applications of spin 1/2 devoted to concrete situa- 
tions will be given in the remaining sections of this chapter. We next consider 
the dynamics of spin in a general time-dependent magnetic field. 
By absorbing the magnetic moment ys of the particle in question in the 
magnetic field, we consider the Schrédinger equation 


ad 
iia, |O(t)) = H(é) l()) (8.1.44) 
with time-dependent Hamiltonian 
A 
H(t) = gKM -o (8.1.45) 
where K(t) is a time-dependent vector. 


3 Such an initial condition is also considered in Lin (2002) and references therein. 
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We prepare the spin of the particle in an initial state |(0)) as an eigen- 
state of ng - o, where no is an arbitrary unit vector, 


No = (sin 99 cos do, Sin Oo sin go, cos 90) (8.1.46) 
with 
no + 0 |4(0)) = |(0)). (8.1.47) 
Hence |7(0)) may be chosen to be 


Oy. 
cos — e~i¢o/2 
2 


\v(0)) = (8.1.48) 


Oo 
sin — ei?o/2 
2 


up to a phase factor. 
Let n(t) be a unit vector satisfying the equation* 


n(t) = K(t) x n(t) (8.1.49) 
with initial condition 
n(0) = no (8.1.50) 


then, as is easily shown below, the solution |4(t)) of (8.1.44) is an eigenstate 
of n(t) - o, that is, 
n(t) +o |(t)) = |¥(¢))- (8.1.51) 

To establish (8.1.51), we partition the interval [0,¢] into infinitesimally 
small sub-intervals. For t = 0, (8.1.51) coincides with (8.1.47) since n(0) 
satisfies (8.1.50). Therefore, it is sufficient to show that (8.1.51) is true for 
infinitesimally small At to complete the demonstration by induction. 

To the above end, it is readily checked that 


d 


cn [n(t) - o|d(t))] = -3K(i) +o [n(t) -o|v(t)) | (8.1.52) 


where we have used (8.1.49) and (8.1.44). Hence for At ~ 0 
n(At) + 6 |y(At)) = m(0) + 6 |9(0)) ~ 5K (0) + [(0) + }y(0))] At 
= |1¥(0)) — $K(0) - 6 |4(0)) At 
= |W(At)) (8.1.53) 


4 Cf. Lin (2002), and references therein; Wagh and Rakhecha (1993); Feynman 
et al. (1957). 
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as following from (8.1.44), (8.1.47), (8.1.52). That is (8.1.51) is true for t = At 
as well and hence for all t by induction. 
Since the unit vector n(t), may be parameterized as 


n(t) = (sin O(t) cos ¢(t), sin 0(t) sin d(t), cos 6(t)) (8.1.54) 


the solution of the Schrédinger equation (8.1.44) may be written from (8.1.51) 
as 
cos a e 16(t)/2 
|w(t)) = ; (8.1.55) 
sin ©D gotwr2 


up to a phase factor. 


8.1.3 Two-Level Systems; Exponential Decay 


Suppose that transitions occur between two levels, which we denote by 
|0) and |1) with corresponding energies Eo and £1, with E; > Ep. We may 
introduce creation at, and annihilation ap operators connecting these two 
levels 


at, |0) = |1) (8.1.56) 


ap |1) = |0). (8.1.57) 


For transitions that are restricted to these two levels, we have (al.)? = 0, 
(ap)? = 0, for example, and the operators ap, al, satisfy anti-commutation 
rules (cf. (6.4.2), (6.4.3)). 

By representing the states |0), |1), for example, by (0 ite (1 Oy we may 
write the Hamiltonian, in the absence of interaction, i.e., the free Hamiltonian 
as 

@ :) fo+ki | Fi —£o 
Ay = = 


eg ee (8.1.58) 


where o3 is a Pauli matrix. 
For the interaction Hamiltonian causing transitions between these two 
levels, we consider the simple structure 


Ay = & a) (8.1.59) 


where V may be time-dependent. 
Thus upon writing the solution of the time evolution problem as 


b(t) = e MEot Fit /2h ES (8.1.60) 
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the Schrédinger equation for |:b(t)) leads to 
wi (8) Alo) + (P24) (88) een 


Bi = Ey =e (8.1.62) 


and we have the normalization condition 


where 


|a(t)|? + |b(t)|? = 1. (8.1.63) 


We may introduce the unit vector 


n(t) = (a(o" + a* (t)b(t), ifa(t)b*(t) — a*(t)d(t)], Ja()|? - [o(t)|?) 
(8.1.64) 


and an angular frequency vector 
1 
O=5(V+V*i(V—V*), hw) (8.1.65) 


depending on V, and it may, in general, be time-dependent. It is then readily 
checked directly from (8.1.61) that it may be rewritten in the form® (8.1.49), 
ie., 


d 
qn) =x n(t). (8.1.66) 


It is easily verified, that with the parametrization a(t) = 
cos (0(t)/2) e19/?, b(t) = sin (0(t)/2) &?/?, n(é) in (8.1.64) coincides 
with the expression in (8.1.54). The system in (8.1.61), in particular, will be 
considered in §8.12 in analyzing the fundamental problem of the rotation of 
a spinor, corresponding to a two-level system, by 27 radians. 

We close this section, by considering the following Hamiltonian to describe 
transitions in a two-level system® 
E, — Eo 


er ee ae apo Meal teeth (8.1.67) 


Hy = ah So Ag be tar 5) dL (8.1.68) 
k k 


where al, ar are the creation, annihilation operators introduced in (8.1.56), 


(8.1.57) and may be represented as 


al, = ( i) =ot (8.1.69) 


5 Feynman et al. (1957), see also see this paper for other related details. 
° The energy levels may be considered to denote the mid-points of the correspond- 
ing energy linewidths. 
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i « . =o7 (8.1.70) 


satisfying anti-commutation relations, and bf, by are creation, annihilation 
operators of a photon of energy fw, satisfying commutation relations. 

Unlike the interaction in (8.1.59), where V is a priori given external po- 
tential, the interaction in (8.1.68), which may also cause transitions between 
the two levels, is due to photons which are treated dynamically though in a 
simplified manner. 

Suppose that the two-level system is initially in the state |1), we then in- 
vestigate the nature of the survival probability (§3.5) for the two-level system 
to stay in the state (1 0)". 

To do this, we consider the Schrédinger equation 


d 

ih, WW) = HId@) (8.1.71) 
write 

|W(t)) = et#o/* | b(t) (8.1.72) 


where Ho is the free Hamiltonian part in (8.1.67), to obtain (the so-called 
interaction picture) 


in |@(t)) = H(t) |9() (8.1.73) 
where 


A(t) = eitHo/h Fy, @itHo/h 


=o" S- Ng by ewe -W)E 4 ge S- Ne of el eRmeye, (8.1.74) 
k k 


Hence upon writing 
|A(t)) 
|e(t)) = (8.1.75) 
|B(t)) 


in (8.1.73), we have 


: d —i(wp—w)t 
ihe Gn = 2oelns (we) | B(t)) (8.1.76) 


: d * i(wp-w 
ins |B) = OM bt eilea—“)# | A(z) (8.1.77) 
k 


where we have used the expressions for the matrices in (8.1.69), (8.1.70). 
Upon multiplying (8.1.77) by the operator b, and using the commutation 
relations of b,;, bi, we obtain 
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in (Olbde| BU) = Ax eilwe—w)t (Q| A(t)) (8.1.78) 


where we have also used the fact that (0| bh = 0, and |0), here, denotes the 
‘no-photon’ state. 
Equations (8.1.76), (8.1.78) then yield 


at 7 OlAW) = =p LD fare iwe-w)™ (Q| A(t—7)). (8.1.79) 


The survival probability of the system to stay in the state (1 0)" is then 
P(t) = |(0| A(t))|?. (8.1.80) 
To see this note that if we write |q(t)) = (a(t) a(t)", then 

(0| a(t) |A(t)) 

(0| B(t)) |B(t)) 

ew But/P (0) A(t)) 
= (8.1.81) 

e-iPot/h (0| B(t)) 


and |(0|a(t))|? = |(0|.A(t))|?, where we have used the fact that (0|b! = 0 as 
arising from the application of the free photon Hamiltonian to (0]. 

Using a continuous variable extension of the photon energy, replacing the 
sum over k in (8.1.79) by an integral over w’, and setting 


(0| A(t)) = F(t) (8.1.82) 


we may rewrite (8.1.79) as 


“F = -i2 fe [A(w") |? n(w ») far ei)" P(t— 7) (8.1.83) 


where A(w’) is a continuous variable extension and n(w’) denotes the density 
of such states. 
We use the relation 


Lesa d ev i(w’—w)r —1 
—i(w'—w)r _ 
e en | (8.1.84) 


and integrate by parts over 7 in (8.1.83) to obtain 


< F(t)  F@) (10 i < RIF: a) (8.1.85) 
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oo —i(w’—w)t _ 
I(t) = 55 [ du! |A(w")|? n(w’) en (8.1.86) 


—i(w! — w) 


R[F; t] = = [ow |A(w’) |? n(w’) 


0 
t £' —i(w’—w)t __ 1 
x at’ [rar - 
0 0 —i(w! — w) 


Equation (8.1.85) gives for the survival probability in (8.1.80)/(8.1.82), 


La Ft ny}. (8.1.87) 


P(t) = C(t) exp|—G(t)] ‘ 


G(t) = [ee [2@’) + 1° (t*)] (8.1.88) 


and 
C(t) = |exp(-RIF; t)|’. (8.1.89) 


The time-integral in (8.1.88) is easily evaluated to yield 


sin(w’ — na 
(w! — w)/2 


ob2)] obe2) [ee 


The function sin a/a peaks at the origin, and is concentrated mainly in the 
region |a| < a. We make the Markov approximation’ by assuming that 
|A(w")|? n(w’) is a slowly varying function around the point of resonance 
w’ = w and hence for wt/2 > a, it may be taken outside the integral in 
(8.1.90), evaluated at w’ = w, with increasing accuracy for t >> 1/w, thus 
obtaining 


C0) = ae fe WIP nw’ | 


2b fF° 
= dx 


- ne —wt/2 


G(t) ~ = |A(w) |? r(w) [Le =| (8.1.91) 
and for t > 1/w, 8 
2 
G(t) ~ o ONC (8.1.92) 


which allows us to set 


” The Markov approximation will be discussed further in §12.7 in terms of so-called 
correlation functions. 
8 Note that for iw expressed in &V, t>> 1/w~ (1 WV/hw) x 107" s. 
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ZS IMw)P nw) = 9 (8.1.93) 


For 1/y > t, but t >> 1/w, it is not difficult to show that if we formally 
replace (dF(t’ — r)/dt’) /F(t’) by 7, then C(t) ~ exp [O(772?)]. 

Accordingly, in the above mentioned time limits, we obtain for the survival 
probability P(t) the exponential decay law 


P(t)y~e % (8.1.94) 


with y defined in (8.1.93). For ¢ short enough, this defines a weak coupling 
limit by 1/7 >> t. The time t, however, is taken large enough so that t > w. 
In particular, the latter condition implies that we must have y < w which is 
a well known property relating decay widths [ = hy and energy shifts hw. 


Remarks 1 

1. The exponential decay law in (8.1.94) holds for t not too small and t not 
too large, as discussed above and is in the light of the general analysis of 
decay in §3.5. 

2. If one formally replaces |X(w’)|? n(w’) by a constant in (8.1.83) and ex- 
tends the w’ limit of the corresponding integral to —oo, thus obtaining a 
Dirac delta 5(r) in the integrand, then this equation may be readily in- 
tegrated to yield the exponential decay law in (8.1.94). Such a procedure, 
although used by some authors, involves, according to the above analysis, 
the implicit assumption of a limit set on the time variable, and the ex- 
ponential decay law (8.1.94) cannot hold true exactly for all t. Otherwise 
one would run into a contradiction with the general analysis of decay in 
83.5. 

3. Equation (8.1.83) may be also used self-consistently to investigate the 
behavior of the survival probability P(t) in the truly asymptotic limits 
t > 0, t — co by basing the analysis on some sufficiency convergence 
conditions to be satisfied and will be left as an exercise to the reader (see 


Problem 8.8). 


Another derivation of the exponential decay law (8.1.94) will be given in 
812.7 by working directly with the density operator. 


8.2 The Pauli Hamiltonian; Supersymmetry 


8.2.1 The Pauli Hamiltonian 


The non-relativistic quantum dynamical equation of a spin 1/2 charged 
particle of charge e in an external vector potential A and scalar potential U 
may be derived in the following manner. 

One may start with the Schrédinger equation for a free particle 


8.2 The Pauli Hamiltonian; Supersymmetry 433 
2 
p ae) 
— —ih— = 2.1 
& ; x) ee =O ee) 


which upon using the identity (7.4.4) leads to® 


- o)(p: 0) 
IM 


8 
ing | |v) =0. (8.2.2) 


Now one makes the so-called minimal coupling substitutions 


e€ ae) oO 
p—p- o™ ins so iha, —U (8.2.3) 


to obtain, by finally using the identity (7.4.31) (see also Problem 7.13), 


1 e.\2 ge = 0 
E (p <A) 78 -B+U] Ih) = ins Iw) (8.2.4) 
known as the Pauli equation, where S = ho/2, g=2, B=VxA. 

The so-called g-factor of the electron, for example, is given approximately 
byg= 2(1+a / Qn), rather than 2, according to the leading order correction in 
quantum electrodynamics, where a = e?/hc is the fine-structure constant.'° 
A computation of g within the realm of quantum mechanics for the inter- 
action of a non-relativistic electron with radiation will be given in §8.5. Its 
observational aspect will be also discussed there. 

By formally replacing geh/4Mc by w in (8.2.4) and then taking the limit 
e — 0 in the latter equation, one also obtains the Pauli equation for a neutral 
spin 1/2 particle interacting with the magnetic field given by —u- B, and 
ut = po denoting the magnetic dipole moment. This will be used in the study 
of the spin precession of the neutron in §8.4, where the radiation loss due to 
the interaction of its spin with the magnetic field set up by radiation is also 
investigated. 

An application of (8.2.4) was given in §7.9 for the interaction of spin 1/2 
with a constant sufficiently strong magnetic field B along the z-axis!! such 
that the potential Vr, responsible for the fine-structure of the hydrogen atom, 
is small in comparison to the magnetic field contribution, where U is the 
coulomb potential due to the proton. The Pauli equation in (8.2.4), as it 
stands, provides an approximation to the more precise treatment, including 
relativistic corrections given in §7.4, for the hydrogen atom. 


° See also §7.4, in general. 

10 For a phenomenological treatment of the g-factor, one may formally add a term 
—(ke/Mc)S - B to the left-hand side of (8.2.4) giving rise to —(ge/2Mc)S - B, 
where now g = 2(1 + #0 2)5 for the interaction with the magnetic field. This, 
however, is no substitute to a dynamical treatment of g as done in 88.5 for the 
electron. 

4 See (7.9.13) and below it, where the magnetic field is not too strong so that 
quadratic term in A may be neglected (see (7.9.6)). 
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For B a uniform time-independent magnetic field, say, along the z-axis, 
with U = 0, the eigenvalue problem corresponding to (8.2.4) will be studied 
in the next section, and is also discussed below to some extent, leading to 
the so-called Landau energy levels. Equation (8.2.4) will be used in §8.14 to 
study the quantum dynamics involved in the Stern-Gerlach effect. 

For a purely time-dependent magnetic field B(t), the contribution of the 
spin involves an overall multiplicative factor in the solution of (8.2.4) as 
follows. By setting, 


1o(o) = (ex |i f i'w Bt)]). 1o(t)) (8.2.5) 


where (-)+ denotes the time-ordered product (see Appendix to §2.5), then 
|p(t)) satisfies the simpler equation 


Er (p <A) u| |9) = in |6) (8.2.6) 


where w= po, pp = geh/4Mc in (8.2.5). 


In particular, for a constant magnetic field B = Bn, where n is a constant 
unit vector, (8.2.5) gives (see (8.1.12)), 


I) = [eos (2) ihalndiy (=) fic o| 6). (8.2.7) 


8.2.2 Supersymmetry 


In §4.7 through (4.7.33)—(4.7.38), we have seen that for a magnetic field 
B along the z-axis, one may define supersymmetry generators Q, Q' (see 
(4.7.34), (4.7.37)) such that in 2D and 3D, the Pauli Hamiltonian in (8.2.4), 
with U = 0, has the formal structure (see §2.9) of a supersymmetric Hamil- 
tonian 


1 
HS = i 2. 
2 {0.1} (6.28) 
taking the forms (a = p — eA/c) 
me + 12 eh 
oe B >: 
9M 4Mc*"3 29) 
and ‘i 5 ¥ ‘ 
_ TW +79 + 73 e 
H= an 9B (8.2.10) 


respectively, where g = 2, B= (0, 0, B), for a particle of mass M. 

One of the most attractive features of the supersymmetric Hamiltonian 
in (8.2.8) is the formal non-negativity of its spectrum (see (4.7.5)). 

In particular, for a constant magnetic field B = (0,0,B), B > 0 with 
vector potential in the gauge 
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B 
A= y (-¥ 2:0) (8.2.11) 


with motion restricted in 2D in the z-y plane, we will see in the next section, 
that the eigenstates are given by 


bnm,o(t) = Crm (ee By (p”) exp imd exp (—p”/2) i) 


o,-1 
(8.2.12) 
where n = 0,1,...; m=0,+1, +2,..., 
Mw \eB| 
2_ Mw o = 
a ae OS re (8.2.13) 
r=(a,y), w=rcosd, y=rsing (8.2.14) 
1/2 

Im|+m \y Im|=m \y 

Mw \ 1/2 (n+ \i(n4 Wy 
Cig == é é (-1)r+lmi+m (8.9.15) 

Qh (n+ |m])! 


for e <0 (as for the electron), and L!"!(p?) are the associated Laguerre 
polynomials introduced in (7.3.16) now of argument p?. 
The eigenstates satisfy the normalizability condition 


: Pr [bame(t)|? = 1. (8.2.16) 
The eigenvalues are given by 
1 
Enymyo = fw (n+ ml oe 5 + ta.) (8.2.17) 


fore <0,n=0,1,...; m=0,+1,..., and we have deliberately kept g in the 
expression for Ey, m,¢ in (8.2.17). 

For g = 2, the ground-state energy (which is infinitely degenerate) is zero, 
as it should be by supersymmetry, corresponding to n = 0,m = 0, —-1,—2,.... 
As a matter of fact a ground-state vector, corresponding to m = 0, or —1, or 

.., and n = 0, is given from (8.2.12) to be 


bom,-1(t) = yan (=) NO Nils (-Me? +09) (9). 
(8.2.18) 


On the other hand from (4.7.34) 


q=y2( (5+) + (+E V0) (8.2.19) 


for e < 0, satisfy the supersymmetry relations (4.7.6), (4.7.7). Condition 
(4.7.7) is trivially satisfied. Condition (4.7.6) is readily checked by explicitly 
applying Q in (8.2.19) to (8.2.18) giving 
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Qvo,m,-1 = 0 (7) (8.2.20) 


as expected for g = 2. 


8.3 Landau Levels; Expression for the g-Factor 


In this section, we derive the expression for the Landau levels quoted in 
§8.2 through (8.2.11)—(8.2.18), and also obtain a useful relationship between 
the g-factor and such levels. The latter will be used, in the process in §8.5, 
to compute the anomalous magnetic moment of the electron. 


8.3.1 Landau Levels 


Consider the Hamiltonian H in (8.2.9) with the electron (e < 0) restricted 
to a plane with a transverse constant magnetic field to it and vector potential 
in the gauge given in (8.2.11). 

The eigenvalue equation for the Pauli matrix 03 may be conveniently 


written as ‘ ‘ 
o,t1\ _ o,4+1 peices 
03 ay =o cs) « (ofS EI (8.3.1) 
where we note that 03 commutes with H. 
Let r 
TT +7 
Hy = 82 
oo (8.3.2) 
and hence H = Ho + |eB\higo3/4Mc, where 
Mw Mw 
™ =~Px-—-Y,  T2 = Py + ——2 (8.3.3) 
2 2 
(71, 72] = —iMhw (8.3.4) 
and w is the so-called cyclotron angular frequency given by 
w = |eB|/Mc. (8.3.5) 
Upon defining the operators (see also (4.7.34)) 
A=———(m+im), At=—2—=(m-im) (8.3.6) 
= ——(72 + im), = 12 — in 13: 
J2mhw A : 2mhw : ? 
we have 
[A, At] =1 (8.3.7) 
Hy = hw (A'A + 1/2) (8.3.8) 


[Ho, A] =—fwA, [Hpo, At] = +hwAl (8.3.9) 
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and the following commutation relationships with the third component L, = 
LPy — yp, of the orbital angular momentum 


[L., A] =—hA, [L,,A'] =+hAt (8.3.10) 


[L., Ho] =0. (8.3.11) 


The unit of energy fw is sometimes referred to as the Landau-Larmor energy. 

The commutation relations (8.3.7), (8.3.9) are just those of the harmonic 
oscillator problem. Hence, by using in the process of the commutativity of 
L, with Ho in (8.3.11), we have simultaneous eigenstates |N,m) of A‘ A, Lz, 


A'A|N,m) = N|N,m) (8.3.12) 
L,|N,m) = hm|N,m) (8.3.13) 

and by using (8.3.9), (8.3.11) and (6.1.30) 
Al |k,q) = Vk+1|k+1,¢4+1) (8.3.14) 


using a general notation of the states |k, q) for convenience. 
For the ground state energy of AtA, we have 


A'A|0,—q) =0 (8.3.15) 
Lz |0,—q) = —hq|0, —4) (8.3.16) 
A|0,—q) =0 (8.3.17) 


and, as we will see below, the square-integrability of a state |0,—q) requires 
that the integer g to be non-negative, i.e., g = 0,1, 2,.... 

From the definition of A in (8.3.6), (8.3.17) leads to the following differ- 
ential equation for (x, y|0,—q) = Wo,-4(#, y) 


| (> 1) (= ed ee (8.3.18) 


This equation together (8.3.16), then give the normalized solution 


(q+1)/2 
vo,—q(2,y) = a (=) (x—iy)? exp (- (a? + ?)) (8.3.19) 


and q= 0,1,... for square-integrability of Wo,_,(r), 
pa [o,-g(r) |? = 1 (8.3.20) 


where r = (2, y). 
Upon setting « = rcos¢, y = rsin®@, (8.3.19) may be rewritten as 
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VMal2ah (Mw .\v M 
= a ( ar’) exp(—ig@) exp (-42") . (8.3.21) 


4h 
For the purpose of obtaining all of the eigenvectors, it is convenient to set 


¥o,-q(t) = 


—_r=p. (8.3.22) 
In terms of the variables p and ¢, At in (8.3.6) may be rewritten as 


: OF WT @ 
At = pel? | ap RS EE (8.3.23) 


From (8.3.21), and (8.3.14) with k = 0, q > —4, 


(r|0,-—q) = - ee els (a [(0)"] exp(—igd) exp (—p"/2) (8.3.24) 


Vaio! 
w/2th | 9) (1-a)/2 2\q-1 2 
(e|1,1 4g) = EE (py [a - 09) 
x exp(i(1 — q)@) exp (—p?/2) 
= EE (02)? expla — 9)0) exp (97/2) 
x (1) [es ? (=) (Ayteso(-r) | (8.3.25) 


Therefore, as an induction hypothesis, suppose that 


cine a= J Mw /2rh 


Vgkt 


(jyee” exp(i(k — q)@) exp (—p?/2) 


k 
x (-1)* [- (=) (p?)2 exp -* (8.3.26) 


for some k > 1, which is obviously true for k = 0,1, then we show that (8.3.26) 
is true for k replaced by k + 1 as well. This directly follows by explicitly 
applying Al, in (8.3.23), to (8.3.26) and then using (8.3.14). 

It is not difficult to verify that 


p d ‘ 2\q,—-e° | _ —q : (=i) (p?)! 
f (gz) re [=cut CS Giese oi a, * 


8.3 Landau Levels; Expression for the g-Factor 439 
for k>q (8.3.27) 


and 


See : (-1)! (p?)! 
=(0") We Gig =O) ae 


for q>k (8.3.28) 


which are polynomials in p? of degree g. From (7.3.16), we here recognize 
the sums over | to denote [(|k —q|+7)!]~' times the associated Laguerre 
polynomial L}"~@!(p?), of argument p, where 


n = min(k, q) (8.3.29) 
using the normalization adopted in (7.3.16). 
That is, 
k m m m\|—m 
ar (a) (p)%e-#] = CUE A RIM lg?) 
dp? (Im| +n)! " 
(8.3.30) 
with 
m=k-—q (8.3.31) 
(see also (8.3.26), (8.3.13)). 
We note that (8.3.29), (8.3.31) allow one to write 
q=nt mmm = (8.3.32) 
page ea (8.3.33) 


All told, we have from (8.3.1), (8.3.2), (8.3.12), (8.3.13), (8.3.26), (8.3.29)- 
(8.3.33) for k = N, that the eigenvectors Wnm,o of H are given in (8.2.12) 
with eigenvalues in (8.2.17), where 


BR 


1 
ns (w Heck a 


5 ) (8.3.34) 


with N = n+ (\m| + m)/2 as given in (8.3.33) for k = N. [For a spin 0 
charged particle with e < 0 simply replace g by 0.] 

In particular, for the supersymmetric case (§8.2) g = 2, and the ground 
state energy 0 corresponds to n = 0, 0 = —1, and m= 0,1, —2,.... 

The results derived in this section, as they stand, are also valid for e > 0 
if the choice B < 0 is made, i.e., if the magnetic field is taken to be along the 
negative of the z-axis.!? 


12 In the literature the choice eB > 0 is sometimes made. 
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8.3.2 Expression for the g-Factor 


We derive a convenient expression for the g-factor, as arising from the 
Landau levels, which will be used in the process of a computation of the 
anomalous magnetic moment of the electron in §8.5. 

To the above end, it is more convenient to consider the problem in the 
momentum description, and work in the gauge specified by a vector potential 
(B > 0) 


A = B(—y,0,0). (8.3.35) 
In the momentum description, the Hamiltonian in (8.2.10) is given by 
1 /ilel\B, Oa > ph ope elk 
H= h B 3. 
2M ( Obs. :) 9M ' 2M‘ 4Mc?"$ eae) 


where now g is left arbitrary, and we note that p;, p3 are just c-numbers. 
We may then consider the limiting case with p,,p3 — 0 and work in one- 
dimension along p2. 
Upon setting 
c 
Q= jeBM (8.3.37) 
one is then led to solve the equation 


Ro? MO, 


wv =eWV 8.3.38 
2m Op a P2 (p2) = eV (p2) ( ) 
where for the energy E we have 
1 go 
E= 8.3.39 
M202° * 420 ieee 
o = +1, and the eigenvalues € are given below. 


From the harmonic oscillator problem, the eigenvalues in (8.3.38) are given 
by én = AQ(n+ 1/2), n =0,1,..., and hence we may write for E in (8.3.39) 
_ le|Boh 
E(n,0) = a7 [(4n + 2)o +g] (8.3.40) 
where we have used the fact that (7)? = +1. This leads to the following 
useful relationship upon differentiation with respect to B, 
2Mc O 


_ 97? 
= ae (8.3.41) 


B=0,p1,p3=0 


For the ground state corresponding to (8.3.38), we obviously have 


ma\*4 Ma 
Wolo2) = (4) exp (-32) (8.3.42) 


normalized with respect to the measure dp2. With a proper normalization, 
we will see in §8.5, how the physical 3D problem may be reduced to a com- 
putation in 1D for the anomalous magnetic moment of the electron. 
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8.4 Spin Precession and Radiation Losses 
We consider the Hamiltonian of a neutron, an uncharged particle of 
spin 1/2, in a uniform and time-independent magnetic field 
2 


Pp 


H = —— 
2M 


—pyo-B (8.4.1) 
where the magnetic moment [y is about 
[in = —9.66 x 107?" J/T. (8.4.2) 


The commutator of the spin S = io /2 and the Hamiltonian H is easily 
worked out to be 


[S, H] = 2ip,S x B. (8.4.3) 
From the time-development of an operator O(t) = e#/? Oe—@#/", we 
have és 5 
Ln 
—S(t) = — S(t B. 4.4 
< s(1) =H s(t) x (8.4.4) 
In particular, for components perpendicular | or parallel || to B, 
a? 4y2 B? 
S S(t) =0 (8.4.6) 
Taking expectation values of the above yield the solutions 
(S.(t)) = acos(wnt + 6) (8.4.7) 


(S\(6)) = (8)(0)) (8.4.8) 


where a is a constant vector, showing the precession of the spin about the 


magnetic field B, with angular frequency 


with the projection (S)(¢)) along B being constant in time. 
Let B = (0,0, B), B > 0, then due to the precession of the spin, the 


neutron, from an excited state ({). for |p| — 0, with energy 
BE, =(|p.|B (8.4.10) 
may fall to the ground-state (0 13 for |p| — 0, of energy E_ = —|p,|B 


by the emission of a photon of energy Aikc = 2|u,|B. The mean lifetime 
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of such a transition is, however, too large for all practical values of B. For 
B~1T, the mean lifetime 7 is ~ 10° s, and hence such radiation losses are 
not significant, in general, unless one is dealing with huge magnetic fields of 
millions T encountered in astrophysics. 

Due to the uncharged nature of the neutron the mean lifetime 7 is easily 
estimated. To this end, the total Hamiltonian of the neutron in the magnetic 
field B, in the presence of radiation is given by (see also 87.7) 


A, = + Ao, RAD — Mn03B — Uno “(Vv x Arab) (8.4.11) 


oie 
where Apap is the radiation field (87.7). 

go} we have, in a similar manner as obtaining (7.7.28), 
(7.7.37), to second order in the radiation field, due to the presence of radia- 
tion, the energy shift Ac, (see also (7.7.35)) 


Ac (2m)? 5°(p — p’) 
3 a5 
= p2(4rhe) 1D feara (5) i (io x k)-e 
1 


(p! — hk)? 
2M, 


For a state |p) (5) 


x 
p- 
2M, 


v(t <1) 8; (5) Ip) (8.4.12) 


where the +ie is to account for the decay under study. 


+ |imlosB — hlk|e + ie 


\Ln|B 


Let 
p? p? p’ - Ak h2k? : 
pe Alk = Ad 
i OM a oe18) 
then ; 

Se 

1 - D — 2\|py|B 8414 

—|Pn|B-|pnlosB : 1 eee 
D 


Also using (see (§7.7)) 


S° [(o xk) - ea] [-..] [(0 x k)-e,] = (0 X k)- [...](0 x k) 


» 


= (dank? — ka ky) oal-- Joo. (8.4.15) 
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From (8.4.14), (8.4.15), the expression for (8.4.12) becomes 


Aé(p) (27) 5°(p — p’) 


2 3 2 2 2 2 2 

os LUNE as Maes 9: Qk? 2 kR kB 
oie at) SAB BY a D "D—2QpalBS 
(8.4.16) 


Upon integration over p’, taking the limit p — 0, and performing the 
k-angular integration in (8.4.16), we obtain 


2h, 4 1 
Ae(0) = — Hae rar = 
w 3 Peed h°k ic 
= 
“OM 
2 1 
#3 ae (8.4.17) 
(nes ou 7 2|tn|B- i) 


The first term in the curly brackets is independent of B. For a neutron, in 
the absence of an external magnetic field B, in the ever presence of radiation 
interacting with its spin, the shift in energy in (8.4.17) should be zero. This 
amounts to carrying out a renormalization (see also §7.7, §8.5) by subtract- 
ing off the corresponding expression to the one within the curly brackets in 
(8.4.17) evaluated at B= 0.'8 


Accordingly, 
2 wh 1 
AeRen (0) a a [reat are 
T 
hk — 2|u,|B-i 
( c+ or [in| ie] 
1 


Pe (8.4.18) 
(nie + aes i) 


By taking the imaginary part of the above (see (7.7.30)—(7.7.32), §7.8), 
we obtain for the decay constant!4 


'3 Actually this term does not contribute to the decay rate (or the mean lifetime) 
we are seeking. 

14 Decays due to precession of the spin in a magnetic field was also previously 
considered, by different methods, cf., Stump and Pollack (1998). 
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el a fe Ak? — |p| B hk? 
pas fe dk {o(k+ sar. Fe b(k+ sae} p- (8-419) 


For most practical cases (see (8.4.2)), |un|B << Mc?, and (8.4.19) gives 


32 [unl B® 
~ 3 A833 


(8.4.20) 


where the second term in (8.4.19) does not contribute. 

For B ~ 1 T, the mean lifetime is t = h/T’ ~ 107° s, which is incredibly 
large and not significant unless huge magnetic fields are considered within 
the realm of astrophysics of millions T. 

The real part of (8.4.18) gives rise to a very small correction (contribution) 
to the magnetic moment ji, (see Problem 8.9, 88.5). 


8.5 Anomalous Magnetic Moment of the Electron 


In this section, we consider the measurement as well as an explicit com- 
putation of the deviation (g — 2) of the gyromagnetic factor of the electron. 

We first consider the Hamiltonian of an electron in interaction with a 
uniform and time-independent magnetic field B, 


7 eg 


2M 2Mc ee 
where . 
m=p——A (8.5.2) 
with 
B=VxA (8.5.3) 
and 
[7 2k] = “Rk (8.5.4) 
Cc 
FoF — gil Bl (8.5.5) 
[$7 538"| Sahel 6" (8.5.6) 


The following commutation relations with the Hamiltonian are then read- 
ily derived 


(x, H| = —71xB (8.5.7) 


Sx B. (8.5.8) 
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From the time-development of an operator, O(t) = e#/" Oe-“4/", we 
then obtain for expectation values 
< (n(t)) = << (net) x B (8.5.9) 
dt ~ Me ~ 
< (s(n) = = (S(0) x B (8.5.10) 
dt ~ 2Mc , > 
In particular for components parallel || or L perpendicular to B 
d d 
a (7 (t)) = 0, a (S\(t)) =9, (8.5.11) 
d? e? B? 
ap (mL) = — Fae (1) (8.5.12) 
d2 e2g2B? 
ap (Si) Se (S(t) (8.5.13) 
giving rise to the solutions (|B| = B) 
eB 
(7(t)) = by cos (Sales) +be (8.5.14) 
egB 
(S(t)) = ¢ cos (Ee t+ vs) +9 (8.5.15) 


where b,, bg, ci, C2 are constant vectors and 61, d2 are phase factors. That 
is, (7(t)), (S(t)) would precess with the same angular velocity only if |g| = 2. 

We next consider the measurement of (g — 2). Its computation will then 
follow this study. 


8.5.1 Observational Aspect of the Anomalous Magnetic Moment 


With 7t/M denoting the velocity operator, we determine the commutation 
relation of S-7t with H. As we will see below, this commutator is proportional 
to (g — 2) and hence would vanish if g were equal to 2. 

A direct evaluation, leads to 


[Sat | = Jet (25itckmn — gel? 5°™) n™ BS! (8.5.16) 


from which we obtain (|B| = B) 
ihe 
2Mc 


[S-7,H] =— (g-2)(S x m)-B (8.5.17) 


[(S x m)-B,H] = Re (9 _ 9)(By? (5 7 7) Sinl (8.5.18) 


~ 2Mc 2 
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and 
(ehB)? ot, . EBINS e 4 
[[s +7. HT] H| = a (9 - 2)? (8% - = | inl, (8.5.19) 
We note that OF 
. BBI\ .. 
(s ae ) Siti =S, omy (8.5.20) 


corresponding to components perpendicular to B, and from (8.5.16), 
[S\ - =, H] =0. (8.5.21) 


According to (8.5.19)—(8.5.21) we then have for expectation values 


_ (S(t) . 7m (t)) pee e ae 2) (Si (2) A 7m, (t)) (8.5.22) 
< (S\(¢) : m™\(t)) =0 (8.5.23) 


giving rise to the solution 


|eB(g — 2)| 

(S(t) - (t)) = co cos ( oMe t+ 5) + do (8.5.24) 
with co, do denoting constants. 

Hence for g 4 2, (8.5.24) shows a periodic behavior in time with angular 
velocity 

_ leBlg — 2)| 
a SOME 

and period T = 27/w,. For a given B and the observed period, the anomaly 
(g — 2)/2 is readily determined?’ and is consistent with the value(g — 2)/2 = 
a/2m evaluated first by Schwinger using (relativistic) quantum electrody- 
namics. Actually, this anomaly has been evaluated to higher orders in the 
fine-structure constant and the agreement between theory and experiment is 
quite impressive. 


8.5.2 Computation of the Anomalous Magnetic Moment 


The expression for the eigenvalues in (8.3.40), allowed us to write the 
deviation (g — 2)/2 in the convenient from (8.3.41): 


—E(n=0,0 = -1) =- (8.5.25) 


(see also 8.3.1). 


'® Cf. Wilkinson and Crane (1963). 
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As a result of the interaction of the electron with the ever present radiation 
accompanying it, the value of g deviates slightly from 2 and this correction 
is of the order a — the fine-structure constant. A formal derivation of this is 
given below. 

The total Hamiltonian for the interaction of an electron with a uniform 
magnetic field B = (0, 0, B), B > 0 and the radiation field Apap (see also 
87.7) is given by 

Hy =HA+ Ho, RAD + Ay (8.5.26) 


where H is the Hamiltonian in (8.4.1) in the presence of the magnetic field 
B, and 


le| 7 2 
ips 2 Re ra -A meeey.< 
I= 5 a RAD*P+P RAD) ar Me *RAD 
le|B lelhg / 
"9M x2\e|ARap1 + AME (V x Arap) + bH. (8.5.27) 


taking the spin of the electron into account, and where 5H? is a renormaliza- 
tion counter-term whose nature is specified below. 
Let 


|n = 0) G = |¥o) (8.5.28) 


corresponding to n = 0, o = —1 and suppressing for simplicity the depen- 
dence on 74, p3. 

We work out the correction (g — 2)/2 to the order a as arising from the 
presence of radiation. 

To the above end, we denote 


0) |0) = |; 0) (8.5.29) 


0) KA) = |0; KA) (8.5.30) 


as eigenstates of H + Ho pap in (8.5.26), with |0), |kA) denoting, respec- 
tively, a no-photon state and a single-photon state with momentum Ak and 
polarization specified by A (see §7.7). 

Finally, let |¢o) denote the state corresponding to n = 0, o = —1 for the 
electron in the magnetic field B, in interaction with radiation, 


Hr |b0) = Eo |¢o) (8.5.31) 


such that if Hy is formally replaced by zero, then |¢9) = |q9;0) involving no 
photons. 
Following the derivation of (7.7.23), we may write 


(v0; 0|Ai|¢0) 


EO ec ia) 


(8.5.32) 
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for the energy shift in the presence of radiation, where 
Eo = E(n=0,0 =-1). (8.5.33) 


To second order e? in radiation, we then have from (8.5.32), (8.5.27), in a 
similar manner as in (7.7.28), 


ah? 1 


An? M? (wo | Vo) 


a ~/F (bol ma a ny Io) 
a lke Eo — H — hilk|c A 


e (Wo; 0|5H2|¢0) 
+ —— (0|Agapl0) 4 “ 8.5.34 
magee 0lAnaDl) + C5, 07d) ren 
where 
e|B ih 
7 = p-e)(k) — 8 eyi(k) + (6 x k) - e,(k) (8.5.35) 
and to second order e? in radiation, we may set g = 2 in (8.5.34). 
In detail, 
ikex ry—ikx _ (P—fik)? — |e|Bao 
H = hk 
? . 2M ie PP) 
lelRBg_  e? B? as 
4Mc ° | 2M 
“Ak hk? B 
a a; _ Ban (8.5.36) 


2M Mc 


It is convenient to work in the momentum description, and take the limits 
D1, p3 — 0. We set g equal to 2, Fy = 2 in (8.5.34) (see also (8.5.25)). For the 
wavefunction, as a function on its dependence on p2, we have 


May\/4 MQ 
(p2|Wo) = (=) exp (-»2 m) = Wo(p2) (8.5.37) 
Cc 


With the normalization factor (Wo|Wo) for the three dimensional motion, as 
initially appearing in (8.5.34), all reference to pi, p3 which are formally taken 
to go to zero, disappears. 

To make the dependence of AF on B explicit, we make the change of 
variable pp — €, with 
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el|hB 
p2= eee u &: (8.5.39) 


Then for an operator F'(p2, 0/Op2) 


[2 Vo(p2) F (os. i ) Vo (P2) 


_ 22 dé —€?/ \e|AB Cc 0 —¢?/ 
-[ Ze *#(y - g, ane ve} ° 2 (8.5.40) 


where we note that Wo(p2) is properly normalized with respect to the measure 
dpo. 
Accordingly, (8.5.34) becomes 


ah? dk f® dé 
NS CMe pcos 
ee fels 


: 
—€?/2 0 ee eee + (0 —€?/2 
° f (i) harem (2) 


(wo; 0|5H2|¢0) 


+ 0/A2.5|0 8.5.41 
rire (lARaDI) + rohan oon 
where 
2 
y= \e|AB O | \e|AB 4 lelfi 38 
2Mc 0€% 2Mc 2Mc 
ene hke nk? ene hk, . 0 
\/ 8.5.42 
i 2M * M "OE ( ) 
a 1 /leAB . A ih 
h= fe2— 5 ening + 5 (oxk)-e,. (8.5.43) 


With the parameters M, e appearing in (8.5.41) taken to de- 
note renormalized, i.e., the physically observed ones (see also 87.7), 
(40; 0|5H/|¢0) /(Wo;0|¢0) is so chosen such that there is no energy shift 
(AE = 0) for B = 0. Physically, this is a condition of the stability of the elec- 
tron in vacuum in the ever presence of radiation accompanying the electron. 
In the literature, the c-number (49; 0|5H{|¢0) / (40; 0| Go) is referred to as a 
contact term. 

Hence 

AE =A\E+A,E+A3E (8.5.44) 


where 
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ah? eine d3k dae /O\. 
A\E= en /2 
: An? M? » ic [k| G 
x Qr —— Qx (°) e€7/? (8.5.45) 
H’ + hilk|c 1 
1 
Qr = err — 5 ea ix (8.5.46) 


[| eine | iy [O\ 
AsE = e & /2 
: iP aa [k| (:) 
1 —ih 
k)- 
* lO. ame ( 2 Sis )-e 
ih 1 0 —€7/2 


ah? dk f° d& _-2)5 (0\" 
= -€/2 
O88 =~ FMP a use(s) 


x zi (o x k)-e : : 
2 “| A’ + hlkie tk Ak? 
Iklc+ Say 


x (+) (o x k)-e, ) rac (8.5.48) 


In a non-relativistic setting, starting from a non-relativistic Hamiltonian, 
integration over h|k|c beyond the natural cut-off Mc?, corresponding to the 
rest mass energy of the electron, is not justifiable. Accordingly, we cut-off the 
k-integrals in (8.5.45), (8.5.47), (8.5.48) by imposing the restriction fi/k|c < 
Mc?. The very welcome damping in |£| by the Gaussian function in these 
integrals should be also noted. 

Since we are interested in the dependence of AF only up to the first 
power of B, we immediately obtain, by using the completeness relation of 
the photon polarization vectors in (7.7.4), and the elementary averages over 
the angular integrations in (7.7.43), 


A\E= 


ah? |e|hB 8m ee kdk 
4n?M* cc 3 Jo hk? 
hk 
| c+ aT 
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Zz a) (Si) (+) In (3) (8.5.49) 
linear in B. 


The computation of the linear dependence of A3F on B is also relatively 
easy. To this end, we note that 


(k+ (ox k))” 
7 


S (0 x k) +e, (0 Xk) +e, = (0 Xk)” =0 (8.5.50) 


= (i de x k)+e, 03(0 X k)-e) (7) aia) oe 


According to (8.5.50), (8.5.51), we may effectively make the replacement 


\e|AB 
1 1 2M °° 
8.5.52 
H’ + hlk\c h2k? 2k? 2 ( ) 
in (8.5.48). 


From (8.5.51), (8.5.52), we then have 


ah? (/\elAB\ (h2\ 8x fM@e/h kak 
A3E = 
4n? M? \ 2Mc 4) 3 Jo h2k2 


D) 
[nk + aT 


- () (a) : hn (5) | (8.5.53) 


for the linear part in B. 
The first term within the square brackets in (8.5.47) is effectively 


(ik) (RY [lelhB a) 
(2) (az) Ve (Gee getge) 


x Gz + ky ix) (o x k) “ey (8.5.54) 


multiplied by [A|k|c+ hk? /2M] ae By summing over A, using (7.7.4), noting 
the fact that the k-angular integration leads to the rule spelled out in (7.7.43), 


452 8 Quantum Physics of Spin 1/2 & Two-Level Systems 


and anticipating the spin average to be performed in (8.5.47) with the simple 
observations that 


0; (1) = , g=1,2 (8.5.55) 


G is (7) af4 (8.5.56) 


we may effectively replace (8.5.54) by 


ih h leJAB 4n,5f ..0 iO 
k 8.5.57 
(3) Gaz) os (ae + oar) raee 
after carrying out the k-angular integral. 


Similarly, the second term within the square brackets in (8.5.47), after 
carrying out the k-angular integration, may be effectively replaced by 


ih h le|hB 4x, /i,dO .O 
(3) Gar) V ae Ce ae) oe 


multiplied by [hlk|c + hk? /2M]~°, giving for the expression within the 
square brackets in question the effective net result 


2 
o\" 0 — ene Ark? 
foo = = Z 5 (8.5.59) 
1 1 h2k? 
rikle+ sar | 


in (8.5.47) after the angular integration. 
After carrying the elementary €-Gaussian integral, we have from (8.5.59), 
the following expression for A2F in (8.5.47) 


Ase = (2) lelAB\ ( h4 Me/Rh k3dk 
2" \On) 2M) \M?) Jo Reh? 
[rte + S| 

2M 


= (=) (a) 4 hn (3) = ;| (8.5.60) 


for the linear part in B. [We note that there is no term proportional to B!/? |] 
All told, (8.5.44), (8.5.49), (8.5.53), (8.5.60), give 


AE = (=) (2) EF 21n(5)| (8.5.61) 
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for the linear part in B. 
From (8.5.61), (8.5.25), we may finally infer that 


y—*) = (=) + 21n (3)| (8.5.62) 


which is quite accurate since 
16 3 
| —2In (5) ~ 0.97 (8.5.63) 


and compares well with the fully quantum-electrodynamic value a/27.1° 


8.6 Density Operators and Spin 


Detailed accounts of density operators were given in §1.5, 81.6 and §5.4 
with the latter section dealing with spin 1/2 and arbitrary spins as well. Here 
we study general aspects of such operators as applied to concrete physical 
situations for spin 1/2 particles. 


8.6.1 Spin in a General Time-Dependent Magnetic Field 


As a first application, we consider the spin Hamiltonian in (8.1.45) for 
the interaction of spin 1/2 with a general time-dependent magnetic field: 


H(t) = —K(t)-o. (8.6.1) 


The initial state |7(0)) at time t = 0 is prepared to be an eigenstate 
of no - o for some vector nop = (sin 49 cos %o, sin Op sin go, cos 00). With such 
an initially prepared state as given in (8.1.48), the time t = 0, the density 
operator (0) works out to be 


p(0) = }(0))((0)| = 5 [1 +0 + 6] (8.6.2) 


where 1 is the 2 x 2 unit matrix. 
The time t > 0 density operator p(t) is then given by 


roe ; [1 + P(t) «o] (8.6.3) 


where for the average of spin at time t > 0, divided by h/2, we have 


'® There is a long history of the computation of (g — 2)/2 within non-relativistic 
quantum mechanics. For other attempts, cf. Arunasalam (1969), Grotch and 
Kazes (1977). Arunasalam also deals with Landau states. 
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Tr[op(t)] = P(t) = n(t) (8.6.4) 


and we have identified the latter with the unit vector n(t) in (8.1.54). P(t) 
is referred as the polarization vector. For t = 0, P(0) = n(0) = no. 

The equation for the time development of the polarization vector P(t) was 
already given in (8.1.49). Here it is more instructive to derive this equation 
from the equation 


ae) 
in< a(t) = (H(t), elt) (8.6.5) 
for the density operator p(t), which from (8.6.3), (8.6.1), leads to 
0 
ae) -o = (K(t) x P(t)) -o. (8.6.6) 


Upon multiplying the latter by o, and taking the trace gives 


0 


ay P(t) = K() x P(t) (8.6.7) 


consistent with (8.1.49) as expected. 

As another application, we consider the scattering of a spin 1/2 particle 
off a spin 0 target. Later on, we will treat the problem of the scattering of a 
spin 1/2 particle off a spin 1/2 target. 

8.6.2 Scattering of Spin 1/2 Particle off a Spin 0 Target 


For particles initially prepared to be polarized,!’ say, along the x-axis, 
the initial state may be taken, up to a phase factor, to be 


ie = (;) : (8.6.8) 


Similarly, for polarizations a long the y-, z-axes, we have respectively, up to 


phase factors, 
1 /1-i 1 


These states lead, respectively, to the following initial density operators 


mk fe ie glee 10 
Gree Ne 
a @ ae D € a G s) e010) 
with corresponding polarization vectors 
P® : (1,0,0), (0,1,0), (0,0, 1). (8.6.11) 


1” Polarized along the z-axis means, that the polarization vector as an average 
value has only an x-component. 
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The derivation of the expression for p“, P® for an arbitrary initial state 
is straightforward (see Problem 8.11). 

On the other hand, for an unpolarized beam, the density operator p©) 
may not be presented in the form as |7);)(;| for some |7;). Such a density 
operator defines a mixture (see also §1.5) rather than a pure state some of 
which are given in (8.6.10). For an initial unpolarized beam, it is easy to see 
that the density operator is given by 


;) 1/10 
(i) = 
Put 5 G ? (8.6.12) 


PO=Tr lont?| =0 (8.6.13) 


giving as expected, 


with the letter u standing for ‘unpolarized’. 

To find the general expression for the density operator p after the scat- 
tering process, suppose that the initial and final momenta of the projectile 
are, respectively, hk’ and hk. To this end, if |w) denotes the final state, we 
define a 2 x 2 matrix M such that 


be) = M |pi). (8.6.14) 


That is, for initially polarized, and also for initially unpolarized projectile, 
the final density operator p may be obtained from 


p=M p® Mt. (8.6.15) 


For the problem at hand, the general structure of M is readily obtained. 
To this end, using the fact that 1, and the Pauli matrices 01, o2, a3 constitute 
a complete set of matrices in the vector space generated by 2 x 2 matrices, 
we may write 


M=al+n-o (8.6.16) 


where n is a unit vector, a, 3, are in general, appropriate function of k’, k. 

If we assume the invariance of the theory under space reflection, i.e., under 
parity transformation, then using the fact that o (spin, angular momentum) 
is an axial vector (§2.6), n is to be chosen as an axial vector, constructed out 
of k’, k and a, @ are invariant under the transformation k’ > —k’, k > —k. 
That is, in particular, 

/ 
ane (8.6.17) 
|k' x k| 

(for |k’ x k| 4 0). a, 6 are, in general, functions of the scalar product k’ +k, 
Ik’|, Ik. 

With n given in (8.6.17), it is convenient to choose the coordinate system 
as shown in Figure 8.1, with n along the z-axis. 

With the choice of the coordinate system in Figure 8.1, the matrix WM 
takes the simple form 
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Fig. 8.1. The coordinate system for the scattering problem is conveniently 
chosen with k’, k in the z-y plane with k’, say along the x-axis. The unit 
vector n in (8.6.17) is then along the z-axis. 


0 a-®B 


For an initially polarized beam along the y-axis for example, we have from 
(8.6.18), (8.6.10), 


Tf late -i(a+8)(a* — 6°) 
p= (8.6.19) 
i(a* + B*)(a — B) ja — 5)? 


for the final density operator. This gives the probability density F,(v), at 
scattering angle ¢ = ¥ (see Figure 8.1),'® 


ma (*4? ‘ ); (8.6.18) 


F,(0) = Tr[p] = ; (la + BI? + |a — Bl’) = Jal? +] 6) (8.6.20) 


indicating, for simplicity of the notation, only the scattering angle V of the 

projectile. For the final polarization we have 

(2|a||4| sin 6, Jal? — |8]?, 2)a|| cos 5) 
la? + [BP 


Tr[op] 
Trp] 


where we have written 


=P, (8) = (8.6.21) 


18 F,(0) is related to the so-called differential cross section. Differential cross sec- 
tions will be studied in detail in Chapter 15. 
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a = |ale®, B= |Ble®, ef = ei(b2-41) (8.6.22) 


With the definitions in (8.6.22), the matrix M in (8.6.16) may be rewritten 
as 


M =e'* (all + |Ble*n - c) (8.6.23) 


and the knowledge of |a|, |G], e® determines the scattering problem. The 
overall phase factor exp[id;] in (8.6.23) does not contribute to (8.6.15). 

Of particular interest is the case of an initial unpolarized beam. The 
reason is that upon scattering, the beam becomes polarized along the z-axis. 
To this end, po in (8.6.12) gives, from (8.6.18), the final density operator 


1 la + 6)? 0 
pu=5 (8.6.24) 
0 ja— pl? 
with final probability density F,,(0) at scattering angle v, 
Fu(9) = Jal? + |8)? (8.6.25) 
and polarization vector 


(0, 0, 2|a||B| cos 6) 
lo? + [8]? 


P,(v) = (8.6.26) 


With an initially polarized beam along the z-axis, we have similarly, 


la + Bl? 0 

pz = ¢ F,(9) = |a+ Bl? (8.6.27) 
0 0 

P.(9) = (0,0, 1). (8.6.28) 


For a given angle J, probability densities such as F,(0), Fy(—¥), for spe- 
cific initial conditions given by p“, and polarization vectors, such as P,,(v) 
in (8.6.26), the numbers |a|, ||, exp[id:] defining the general matrix M in 
(8.6.23), up to an overall phase factor, may be determined from the underly- 
ing dynamics. 

Before discussing the scattering of particles of spin 1/2 off a target of 
spin 1/2, we recast the above problem in a more general form. 

We denote the 2 x 2 identity matrix 1 by oo, and write 


3 
p= So Cons (8.6.29) 
u=0 


Upon multiplying the latter by o, and taking the trace, one obtains 


1 F 1 i 
es @) a= (i) 
Cy= 5 Tr loup =5 (ov) (8.6.30) 
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where (a9) = 1 with Tr [p] normalized to unity. This leads to the expres- 
sion 


i 1 i 
pi = 3 Sey ae; (8.6.31) 
u=0 
and for the final density operator 
3 
1 i 
p=5>, (o,)% Mo, Mt (8.6.32) 
u=0 
giving 
3 
1 
Tr[o,p] = 5 S- (o,)% Tr [o, Mo, M?] : (8.6.33) 
p=0 
In particular, 
3 
1 : 
FO) =5 S- (ou) Tr [M 0, M"] (8.6.34) 
p=0 


for the probability density at angle 7, and with 


Trlovp| é. 
ia (o,) (8.6.35) 
we have 
3 
F(0) (o,) = ; S- (ou) Tr [o, Mo, M1}. (8.6.36) 
p=0 


Equations (8.6.34), (8.6.36) give the final expectation values of o, in terms 


of the initial expectation values of the o,, operators. 


For an initial unpolarized beam, (o,)% = 6,0, hence (8.6.34) gives 


1 
F,(0) = 5 Tr [M M"| (8.6.37) 
and (8.6.36) reduces to 


Fil) (on), = sik [o, MM"). (8.6.38) 


For a beam initially polarized along the z-axis, (o,)% = dn0 + du3 (see 
(8.6.11), 


1 1 
F,(9) = 5Tr [M M"] + 5a [M03 M"). (8.6.39) 
From (8.6.18) one explicitly has 


Tr [M o, M'] = Tr [Mt o, M| (8.6.40) 
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which by using (8.6.37), (8.6.38), (8.6.39) leads to the interesting result 
P,(8) = Fa(8) [1 + (08) (8.6.41) 


showing the modification of the probability density due to the initial polar- 
ization of the beam over the density with initially unpolarized beam. We note 
that F,(0) is expressed in terms of expectation values for the final state of 
the scattering problem with the initially unpolarized beam. 

The sign of (o3),, in (8.6.41) determines an asymmetry in the number of 
particles with spin along or opposite the orientation of the z-axis as arising 
from an initially unpolarized beam. Such an asymmetry is then reflected in 
the observed probability density F’,(v). 


8.6.3 Scattering of Spin 1/2 Particles off a Spin 1/2 Target 


T) 


We introduce Pauli matrices o(®), o(™ corresponding to the beam and 


target, and unit matrices 18) — oP) 17) = of) operating in their respec- 
tive spin spaces. 

The initial density operator p“ may be quite generally expanded in terms 
of the sixteen components!? 


og (T) Sie ght), “vey of Vg) (8.6.42) 


3. 3 
=" Cwa, os”. (8.6.43) 
u=0 v=0 
Upon multiplying the latter by of B) (tT ) , taking the trace and using the fact 
that the initial polarizations of the beara and the target are independent, we 


obtain 
1 (B) (7) g(t) Le py ee) 
One = 3 Ts [of Ve z (a8 oO; ) (8.6.44) 


for a normalized density Tr [pe] =. 
The final density then takes the form 


at (B)y (7) Mo® gD yt, (8.6.45) 


p=0 v=0 
For initially unpolarized beam and target, (8.6.45) gives 


F,(9) = 7a [MM"] (8.6.46) 
and 


1° These sixteen matrices may be rewritten as 4 x 4 matrices, cf. Fernow (1976). 
See also this elegant paper for other details. 
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F,(d) (of) S rik [omit (8.6.47) 
F,(0) Cay = rik [oem (8.6.48) 


and as a correlation of the polarizations of the beam and target 
1 
F,(8) (oo) = zr [ook emt . (8.6.49) 


For initially polarized beam and target along ppz, prz, respectively, where 
z is a unit vector along the z-axis and pp, pr take on the values +1, 


(i) 
(o®af™) = 5209v0 + PBOpn39v0 + PT 00v3 + PBPTOu30v3 (8.6.50) 


where again we have used the fact that the initial polarizations of the beam 
and target are independent. Equation (8.6.50) gives the probability density 


1 i 
F (0; pp, pr’) = i [MM*) + PR Tr [Mof?) Mt] 


1 1 
+ Ree Tr [MoS Mt] + qPBPT Tr [Mo§? of" ai ’ 
(8.6.51) 
For a process which is invariant under space reflection and time reversal, 


the matrix M(k’, k, o®), o‘T)) expressed as a function of the unit vectors k’, 
k and the Pauli matrices, satisfies 


M(k',k, o®), 6) = M(-k’,-k, o®, 6) (8.6.52) 


under space reflection and 


M(k’,k,o® 0) = M(—-k, —k’, -0® , 6) (8.6.53) 


under time reversal. 
Below we will see later that under the constraints (8.6.52), (8.6.53), the 
following equalities follow, 


Lemma 1. 
(i) Tr [M o®)() Mt] = Tr [Mt o®)() My] 
(ti) Tr [mM og Mt] = Tr [ui oP) gh) M| 


From (8.6.46)—(8.6.49) and the above two equalities give for (8.6.51) 


F(sprd,pr@) = Fy(0)]1-+pe (0), px (oS?) 
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+ PBPT (of?o$") | ; (8.6.54) 


This equation shows how the probability density is obtained, with the beam 
and target initially polarized along pgz, prz, respectively, from that with ini- 
tially unpolarized beam and target. The first term within the square brackets 
on the right-hand side of (8.6.54) corresponds to the case that neither the 
beam nor the target is polarized (from initially unpolarized ones), while the 
second, third and fourth terms correspond, respectively, to the cases where 
only the beam is polarized, only the target is polarized and finally both the 
beam and target are polarized. 

The proof of the above lemma is straightforward, and easily follows by 
finding, for example, the general expression for M under the constraints in 
(8.6.52), (8.6.53) as done below. 

The matrix M may be written as 


M = A1® +B. o®) (8.6.55) 


where in addition to k, k’ the matrices A and B may depend on o?). 
Consider the three orthonormal vectors 
k—k k+k k xk 
= KX roe) b=— ar OO = 
|k —k | |k+k | |k x kj 


(8.6.56) 


for non-zero denominators, where c is a unit vector along the z-axis. Since 
o 7) is an axial vector, A is of the form 


A=ay1™ + age- 0, (8.6.57) 
On the other hand, the vector B may, in general, be written as 
B= fiat Bob + 3c. (8.6.58) 


Now it is a simple matter?? to show that the constraints (8.6.52), (8.6.53) 
imply that 


By =aga-o', By = ayb- oo) (8.6.59) 
83 = aol ® + ase- 6” (8.6.60) 
where ao,...,@5 are scalars. 


The expression for M in (8.6.55) then gives explicitly 
Tr [Mo{ ut| = 21 [Api] +210 [B, Al] - 2ieg TY [B.B}] (8.6.61) 


20 Cf. Goldberger and Watson (1964), p. 391. 
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where Tr’ corresponds to tracing over the target variables only. Using, in 
the process, the orthogonality of the vector in (8.6.56), and the structure of 
B in (8.6.58)—(8.6.60), this shows that the last term in (8.6.61) is equal to 
zero. This establishes part (i) of the lemma for o) upon the comparison of 
(8.6.61) with Tr [Mt] . The proof of the part (i) for o( is similar. 

The proof of part (ii) of the lemma easily follows (see Problem 8.14 (ii)) 
by noting, in the process, that 


Bs = 83 = 0917 + aso$” (8.6.62) 


which depends on the component of?) of o) only. 
Incidently (8.6.55), (8.6.56)—(8.6.60) give the following general expression 


for M: 


M =qQ11®1™ + n0%-ao™-a + mo®-b o™-b 


+ no®)-c of «6 + m4 (oP) 1” +1) 97) = 


ae ne (om = 19) g) Le. (8.6.63) 


8.7 Quantum Interference and Measurement; The Role 
of the Environment 


In §1.10, an example was given of the interaction (see (1.10.29)—(1.10.31), 
(1.10.35)—(1.10.38)) of a spin 1/2 physical system and an apparatus, also de- 
scribed as a spin 1/2 system, and studied the role of quantum interference 
(see (1.10.52), (1.10.53), (1.10.54)) in the presence of the apparatus allowing 
imperfection for the latter, specified by some parameter «, in the measure- 
ment process. We have seen that up to small corrections corresponding to 
a small «, how this example provides a model for the disappearance of in- 
terference (see (1.10.56)) by the mere presence of the measuring and unread 
apparatus. The limit « — 0, defines an ideal apparatus for which a perfect 
correlation occurs between the physical system and the apparatus as given 
in (1.10.31). 

In this section, we show how the above process may be formally im- 
plemented by an interaction Hamiltonian, involving the system and meter 
variables, leading to the unitary operator spelled out in (1.10.35)—(1.10.39). 
We also provide another illustration of the interaction of the spin 1/2 system 
with an apparatus with the latter described by a harmonic oscillator in a 
coherent state. We will see how this example may be used to provide a model 
with an almost perfect correlation occurring between the system and the ap- 
paratus with a built-in imperfection in the latter naturally arising from the 
non-orthogonality of the coherent states. 
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Finally, we consider the role of the environment, surrounding a quan- 
tum system (e.g., as a response of excitations generated in some medium 
nearby,...), as part of the measuring process, on the meter readings. The 
environment, coupled to the meter variables, consists of everything else mon- 
itoring the observables being measured and provides, as one may argue, the 
different alternative readings of the meter being sought. This gives a natural 
way of producing classical correlations between the system and the detector 
(meter) eliminating the coherence between different states thus destroying 
quantum superpositions. The destruction of the superposition, referred to as 
quantum decoherence, makes sure that the system is in one of its alternative 
states rather than in a superposition of them. The information thus obtained 
on the system by the meter, “hooked up” to the environment, can be then 
described in usually perceived classical terms. 


8.7.1 Interaction with an Apparatus and Unitary Evolution 
Operator 


To describe the apparatus in §1.10 as a spin 1/2 system, we introduce 
its spin variables operating in the apparatus vector space. To this end, we 
introduce the spin operator hX/2, with 


= f 4 4 ee i} a yi a= F | (8.7.1) 


which are the Pauli matrices operating in the vector space in question. To 
the three matrices in (8.7.1), we adjoin the unit matrix 


‘= E 4 (8.7.2) 


The initial state of the apparatus, given in (1.10.28), is then simply 


: fae (8.7.3) 


which is the eigenvector of the component No: 


0 —i] 1 J1—i 1]1-i 

f ‘ 2 fe mane fe ee) 
with the “needle” of the apparatus initially pointing at 0 in Figure 1.16 along 
the positive direction of the y-axis. 


The initial state of the system and apparatus in (1.10.29) may be also 
simply written in matrix form as 


or (2) 5B] ers 
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with the variables of the system and the apparatus operating in different 
vector spaces using different notations for the matrices for the two systems 
with square ones for the apparatus matrices. 

We consider the following pair of the apparatus variables 


ee LaT-aLaLy es 


ee 
1 1j11 1 fi] 1 fi 
A fy ee = —_ . we 
0+=)= alae lil ab enn 
with [1 ila /V2, [1 i /V2 denoting, respectively, the apparatus spin 


states along the negative/positive x-axis. 
These matrices, apart being Hermitian, have the following interesting 


properties 
1/1 41/1/71 41 1/1 41 
2 B 1 | 2 E 1 ~ 9 Ee 1 (8.7.8) 
and ; ‘ 
1 -1 11 00 
2 E 1 | 2 i 1 ~ E 4 eo) 
As a Hamiltonian, in spin space, we consider the following simple one 
1/1 -1 1/11 
H= ross E 1 _ AqGo15 i Hl (8.7.10) 


where 01, 03 are Pauli matrices pertinent to the physical system, A is a 
coupling parameter and q, as we will see below, is a measure of imperfection 
of the apparatus related to « in §1.10. The coupling parameter is not 
arbitrary and is specified by the time of operation of the apparatus until the 
measurement is completed and the correlation between the apparatus and 
the physical system, of the type given in (1.10.30), required is achieved. The 
parameter q, however, may be controlled, taking the value 0 in the limit of 
an ideal apparatus, and should, in general, be small. 

Due to the property in (8.7.9), the two parts of the Hamiltonian in (8.7.10) 
automatically commute, although o; and o3 do not. 

The construction of the time evolution operator from (8.7.10) is straight- 
forward. 

From the commutativity of the two terms in (8.7.10), the evolution oper- 
ator may be written as the product of two factors 


OU Gales 8 fag DEE V4 
U(T) = ex (Frans & 1 |) ex ( 7 ns lt il) (8.7.11) 


Since (73)? = 1, we have from (8.7.8), for the first factor in the product 
in (8.7.11) 
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ee en en 


=1+ (isin (+) o3 + (cos (+) _ 1) 1) ; E a . (8.7.12) 


The second factor in the product in (8.7.11) is similarly handled, giving 
for U(T) the expression 


U(T) =1+ (isin (+) een (cos (=) _ 1) 1) : ei "a 
+ (isin (=) ot (cos (*) i) 1) . f dl (8.7.13) 


where we have finally used (8.7.9) to carry out explicitly the product of the 
two factors in (8.7.11). 

The implementation of measurement by the interaction of the apparatus 
with the physical system, means that a perfect correlation between these two 
systems occurs for q = 0. For an ideal apparatus, for which g = 0, we must 
have for a perfect correlation, 


|b) = U(T) |o) = e+ (5) aI +c (1) Hl (8.7.14) 


which states that if the spin is up, then the “needle” of the apparatus is also 
up (see Figure 1.16), and similarly, vis-a-vis the second term in (8.7.14), if 
the spin is down so is the “needle” of the apparatus.?! Hence the correlation 
to be reached between the apparatus and the system by the measurement 
process, as given in (8.7.14), dictates that AT’/h in (8.7.13) cannot be chosen 
to take any arbitrary value. It is easy to see that to achieve this correlation 
condition in (8.7.14), as implied by measurement, we may choose?” 


\T/h = 1/2. (8.7.15) 
The unitary operator (8.7.13) then becomes 


1 In the state (8.7.14), neither the state of the physical system nor the state of 
the apparatus is well defined, and this state is referred to as an entangled one 
(see also §8.10). Later on below, we will see how one may argue, by taking into 
account of the environment, surrounding the combined system above, either the 
first state (coefficient of c;) or the second state (coefficient of c_) in (8.7.14) is 
selected in a measurement, by a process referred to as quantum decoherence, in 
conformity with one’s classical notions of a measurement. 

Such correlations measurements restrictions are typical in dynamical investiga- 
tions of measurement theory, cf. Perés (1986), Yurke and Stoler (1986). Such 
conditions are realized experimentally, cf. Itano et al. (1990). 


22 
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1 -1 
Sa ieie on) E 1 cos (34) — 1. isin (3) 
0 3) aa ee 2 
isin (34) cos (44) —1 
(8.7.16) 
which for g = 0 leads to (8.7.14). The unit matrix in (8.7.16) is given by 


1= (39) [9]. fran 


Upon comparison of (8.7.16) with (1.10.35)—(1.10.38) it is easy to see that 
the parameter «, as a measure of the imperfection of the apparatus, is related 
to q by the simple relation 


u(r) =1+( 


k= 7q/2. (8.7.18) 


The final state, as obtained from (8.7.16), (8.7.5), quite generally, is given 
by 


(cos (28) - Je. 
m= [er(o) +3 fein (2) oy. 4 (con (#2) — 1) a 
(3) ) 


(cos 
BO (+3 isin (7) cy + (cos (44) — 1) e- 


As expected, this coincides with the expression in (1.10.30). 

The relevant probabilities for the above combined systems are now 
straightforward to compute and the reader may refer to §1.10 for these expres- 
sions. [Here (1 0)", (0 os in (8.7.19), refer, respectively, to the +2 directions 
there. | 

In the above illustration, T in (8.7.11) denotes the time of operation of 
the apparatus. To be rigorous in the treatment of the “switching on” of the 
apparatus at time t = 0 and “switching it off’ at time t = T, we may replace 
the formal discontinuous expression 


Aq 
2 


qd 


E(t) = O(T — t) — O(t) (8.7.20) 


reflecting this property, by a smooth function 


0, -0o <t<0 
— exp(—</t), 0<t<e 
&(t) = 41, egt<T-e (8.7.21) 
Fy Pl ETD). Pe ecier 
0, T<t<o 
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where € is an arbitrarily chosen small number much less than T. This re- 
markable function is not only a continuous function of t but may be also 
differentiated an arbitrary number of times. It vanishes for t < 0, t > T, and 
&(t) =1 fore <t < T —€ as given above. 

The coupling 2 is then to be replaced by Ao€(t), where Xo is the coupling 
parameter between the system and the apparatus during the interaction pe- 
riod, and X, now, is an effective coupling related to Ao as spelled out below. 

To the above end, we integrate €(t) over ¢ to obtain 


[ase =e (1 ef Te) 


= T —0.8073¢ (8.7.22) 


recognizing the exponential integral in (8.7.22). We may then set 
Ao(T — 0.8073 ¢) = AT (8.7.23) 


fora givn0<e<T. 


8.7.2 Interaction with a Harmonic Oscillator in a Coherent State 


As another illustration we consider the interaction of a spin 1/2 with 
an apparatus described by a harmonic oscillator, in a coherent state, with 
Hamiltonian 


H =—)o3a'a + hwata (8.7.24) 


omitting, for simplicity the zero point energy. The initial state of the system 
plus the apparatus is taken to be 


|®o) = oS) |—iao) (8.7.25) 


where |—iao) is a coherent state (§6.6, (6.6.21), (6.6.15), (6.6.28), (6.6.4)), 
providing a very close description of a classical state of the apparatus. For 
convenience, we choose ap to be real (and positive) so that iao is pure imag- 
inary and hence in configuration space, the initial state (see (6.6.50)) cor- 
responds to a Gaussian distribution centered at the origin. ag will be taken 
to be a large number. As a matter of fact, since ag = |A|(mw/2h)!/? (see 
(6.6.15), (6.6.4)), it will be quite large for macroscopic values taken by | A], 
vies 


23 For example, for |A| ~ 10-3 meters, m ~ 107? kg, one has the estimate 
ao ~ 10!” (w- gyit. with w in s~'. During a short time a microscopic par- 
ticle interacting with an apparatus, w will be also not small due to the rapid 
response of the oscillator (apparatus) to the particle in the short time. An upper 
time limit of response of the oscillator is also set up by its decay time induced 
by the environment. 
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If a4 = iap exp(id+), where exp(id. 
then we recall from (6.6.38) that 


.) are phase factors, i.e., 64 are real, 


(a4. la_) = exp (-o8 (1 a el(--44))) (8.7.26) 


and the states jaz) are optimally almost orthogonal for large a2, if, in par- 
ticular, 


6. -d, =7. (8.7.27) 
The final state ® is easily obtained from (8.7.25), (8.7.24), (6.6.27) to be 


b=c, el |-i a ey, +o @ =i OF —Mag) (8.7.28) 


The two coherent states, on the right-hand side of (8.7.28) are not orthogonal. 

The state in (8.7.28) corresponds, however, to an almost perfect corre- 
lation, as in (8.7.14), between the physical system and the apparatus as a 
successful measurement, for (see (8.7.27)) 


2AT/h= nr. (8.7.29) 


This gives the almost perfectly correlated state?* 


1D) =c, @ lay pe @ 2a) (8.7.30) 


with a@ = ap exp(—iwT), and |—iap) denoting the neutral state of the appa- 
ratus before its interaction with the physical system. Equation (8.7.30) is to 
be compared with (8.7.14). 

The density operator corresponding to (8.7.30) is given by 


p= les? (fo) la (al + eI? (92) [=2) (-al 


ie G 7 aapecte G ) aar (8.7.31) 


If one is not interested in reading the apparatus, and thus the latter is 
unread, one may introduce the so-called reduced density operator prep by 
taking the trace over the coherent states in (8.7.31). This gives the remarkably 
simple expression 


10 00 
ue ( 3) Pi ¢ ‘ 


24 A correlation will be also achieved for other values of \T'/h. The “tuning” condi- 
tion in (8.7.29), however, provides an optimum one as dictated by the definition 
of a measurement with the (almost) perfect correlation occurring between the 
apparatus and the physical system as given in (8.7.30). 
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+ leset G Hg es Gj 5) exp(—2a@) (8.7.32) 


and the non-diagonal terms in (8.7.32) are practically equal to zero for macro- 
scopic large values of a% as discussed above. Thus we obtain a mixture (see 
§1.5) of spin states for practical purposes. 

Upon writing 


6 7 = |+1/2, 2) (41/2, 21, é i) = |-1/2,2) (-1/2, 4 


6 = |+1/2, Z) (-1/2, Z| , e ) = |-1/2,2Z) (+1/2,z| (8.7.33) 


in the notation of §1.10, we note that the apparatus has a built in imperfection 
for measurement (see also Figure 1.17). In particular, the probabilities of a 
spin flip or a non-flip, corresponding to the experiment in Figure 1.17 (d) in 
the presence of our new apparatus are, respectively, 


2 2 
sin“ 9 _ sin B 202 


7.34 
; ; (8.7.34) 


(1/2, 2|PRED| ia 1/2, z) = 


a) 
(+1/2, z|prep| + 1/2, z) = sin*(@/2) + cos*(G/2) + sn Betas 
where we have used (1.10.44), (1.10.57), (1.10.58), and the interference terms 
(see (1.10.52), (1.10.53)) initially present in the experiment in part (c) of 
Figure 1.17, up to the exponentially damped terms involving exp(—2a@), 
disappear. The expressions in (8.7.34), (8.7.35) should be compared with 
those in Figure 1.18 (d). 


(8.7.35) 


8.7.3 The Role of the Environment 


Macroscopic systems, such as meters associated with the above two ex- 
amples, in the real world are never in isolation from the environment. The 
latter, coupled to a meter’s variables, consists of everything else monitoring 
an observable being measured and may include additional degrees of free- 
dom associated with the apparatus itself, providing, as one may argue, the 
different alternative readings on a meter being sought.?° Modellings of the 
environment, as applied to the above two examples, will be described and 
shown as to how quantum decoherence may set in producing classical cor- 
relations between the physical system into consideration and the detector 
(meter) in conformity with one’s classical conception of measurements. 

Consider the pure correlated state? between the physical system and 
the meter given in (8.7.14). Referring to such a state, it is not clear what 


5 Zurek (1991). 
6 Such a state, with cy 4 0, c_ #0 is referred to as an entangled one. Entangled 
states will be considered in detail in §8.10. 
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the different alternatives being measured. For example, for cz = +i, and by 
using the expansions, 


1\_ 1+i 1/1-i\ , 1-i 114i 
(6) = 2 G3) are 573) Pete) 
0) 1-i 1/1-i\ , 14+i 1fiti 
Ce 2 5033) es (075) ere) 


and similarly for the meter’s variables, we may rewrite the same state |w) in 
(8.7.14), with cy = +i, as 


1/1-i\) 1/1-i 1/1+i\) 1 J1+i 
Ww =-3 (G4) stad te Gta aia (S88) 
This raises the ambiguity as what spin components are actually being mea- 
sured — the z-components or the y-components. On the other hand, the 


density operator as obtained from the state in (8.7.14) is given by 


p=les? (4) 9) [6] ol le-PG) 0 [f] (on 


+ epc% Gc) (0 1) aI [0 1J+ che (7) (1 0) Hl [1 0]. (8.7.39) 


| 2 


The coefficients |c;|?, |c_|? are interpreted as classical probabilities, and 
if the non-diagonal terms, proportional to c,c* , c_c4,, were absent, one may 
infer that the detector and the system are either in the spin up or spin down 
states, in conformity with one’s classical perception of measurements, and not 
in superpositions of these states, and the set of alternatives to measurement 
of spin components along the z-axis have been selected. 

Now we provide a straightforward illustration of the quantum decoherence 
as induced by the environment. One may consider the measurement process, 
involving the physical system (S), the meter (M) and the environment (E), 
as a two-step?’ one. In the first step of the measurement, a correlation is 
established between the system and the meter, as given in (8.7.14). In the 
second step, the environment becomes correlated with the meter, as a result 
of its interaction with the latter. If we develop an elementary modelling of 
the environment as a two-level system, one may then consider, in the process, 
the state |), in (8.7.14), to be replaced by 


1 fli 
|W) 5 {1 i i} (8.7.40) 
leading to the final state of the system/meter/environment: 


27 Zurek (1991). 
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women (DEE ORE a0 


using the notation {-} for the environment states. 

Since one is interested in the state of the system and the meter reading 
only, we may obtain the reduced density operator pg), concerning the system 
and the meter, by tracing over the environment states obtaining 


PsM = Tr (|Ysme) (YsmE|) 


eee (4) (10) 4| [1 0] +|e_[? a (01) H fou] (8.7.42) 


transforming the quantum superpositions into statistical mixtures (see also 
§1.5) readily interpreted in classical terms. As mentioned in the introduction 
to this section, this reduction of the density operator from a pure to a mix- 
ture is referred to as quantum decoherence. Also (8.7.42) implies that the 
interaction between the meter and the environment has actually led to the 
selection of the alternatives being measured — the components of spin along 
the z-axis. Also the basis [1 0] tes [0 1] " in the meter’s vector space, referred 
to as the pointer basis, have been selected. This removes the ambiguity as to 
what the alternatives being measured are. 

The interaction between the meter and the environment may be imple- 
mented (in analogy to the one in (8.7.10) with g — 0) by the Hamiltonian 


10/1/41 -1 
Hur = —Ao E zi 5 i l \ (8.7.43) 


which for Agr = 7/2, with 7 referring to the interaction time, leads from the 
state in (8.7.40) to the state in (8.7.41) (see Problem 8.17). 

A more realistic and more interesting modelling of the environment may 
be given, pertaining to the second example in (8.7.24), where the environment 
is represented by a collection of harmonic oscillators involving infinitely many 
degrees of freedom. As the interaction of the meter and the environment, we 
consider the model?® interaction Hamiltonian 


Hyp = al S~ Agbe +a” AB, (8.7.44) 
k k 


where bz, at denote the annihilation, creation operators associated with the 
various degrees of freedom of the environment (also referred to as a reservoir). 
The environment is taken initially in its ground-state. This interaction will 
be considered in detail in 12.7. 


8 This is reminiscent of the interaction of a charged particle with the ever present 
electromagnetic field surrounding it with corresponding physical consequences 
and that of renormalization (see §7.7, §8.5). 
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We set ‘ 
us 
= se |A(w)|?n(w) (8.7.45) 


where A(w) is a continuous variable extension of A, replacing the summation 
over k in (8.7.44) by an integral, n(w) denotes the density of such states and 
w is introduced in (8.7.24). The explicit expression of + is not important for a 
qualitative discussion of quantum decoherence. Then for weak coupling and 
a time t short enough such that the oscillator, representing the meter, has 
not changed much given by 
t 

+ <1 (8.7.46) 
but for ¢ much larger than the correlation time of the environment, the re- 
duced density operator, pertinent to the system and meter only, is given by 


(§12.7) 


10 = z 
psm(t) = lcy/? ( ) lac ie (ae we 


+P (22) eae) (oe 


. fol = = = 
+ cep.c8 t i lac me) (—ae 2) exp(—2a8(1 —e 7) 


+ ¢_c4 (; i) |-a0"/?) (ae-7#/?| exp(—2ag(1 — e~7*)) 
(8.7.47) 


evolving from the density operator in (8.7.31), up to phase factors which are 
unimportant for discussing quantum decoherence, with a a complex number. 
The condition (8.7.46) implies that 


exp(—2a3(1 — e~7*)) ~ exp(—2a?4t) (8.7.48) 


and for ag, associated with the meter (see (8.7.25)) taking on macroscopic 
large values such that 


1 
ao > = 8.7.49 
>a (8.7.49) 
the non-diagonal part in (8.7.47) will be washed away relative to the diagonal 
one, demonstrating how quantum decoherence may arise destroying quantum 
superpositions. Decoherence then occurs exponentially on a decoherence time 
scale ~ 1/(ya2). 
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8.8 Ramsey Oscillatory Fields Method and Spin Flip; 
Monitoring the Spin 


8.8.1 Ramsey Apparatus and Interference; Spin Flip 


The Ramsey separated fields method,” in its simplest form, consists of 
two oscillatory magnetic fields each acting for some time 7 separated for some 
time T with no oscillations. More precisely, the magnetic field as a function 
of time may be taken to be of the form 


(B cos wt, B sin wt, Bo), OSES F 
B(t) = 4 (0,0, Bo), T<t<Tr4+T (8.8.1) 
(Bcoswt, Bsinwt, Bo), T+7T <t<274+T 


where By > 0, B > 0 are constants. We note that for time T,, the amplitude 
of oscillations is reduced to zero as is shown pictorially in Figure 8.2. 

It is remarkable that an interference pattern arises as a consequence of 
having the two separated field zones, as will be seen below, for the spin 
intensity distributions for a beam of particles, with spin, entering one Ramsey 
zone and finally leaving the other Ramsey zone. 


B m Ka» 
B 
5 ee ae, 


RAMSEY RAMSEY 
ZONE ZONE T —>\< re at T > 


oe T 


Fig. 8.2. Two separated oscillatory fields zones, each acting for a time 7, with 
zero amplitude for a time T in between. [More than two oscillatory fields zones 
may be also utilized.] 


Consider an uncharged particle of spin 1/2 and magnetic moment pt = p10, 
yu <0. The interaction of the magnetic moment with the magnetic field B(t) 
is given by 
hwo /2 |p| Bo" 
H = |p\o- Bit) = (8.8.2) 
|u|Bel”? —hw/2 


2° Ramsey (1990), based on the 1989 Noble Prize in Physics Lectures. For an earlier 
related classic paper by Ramsey, cf. Ramsey (1950). 
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where 


In (8.8.2), B is taken to be zero for 7 <t<74+T. 
From (8.8.2), we are led to solve the system 


hwo —iwt 
het 2.04 (t) + ||Be“™*a_(t) 
ia th) = (8.8.4) 
% —Agoa_(t) + |u| Bea, (t) 
Upon setting y 
a(t) = Bs (t)eFivo*/? (8.8.5) 
we obtain the two equations 
F 2 B2 
By +i(w — wo) Bs + —F—B4 =0 (8.8.6) 
2 B2 
B_ —i(w — wo) B_ 4 SS B_ =0. (8.8.7) 


After a lengthy but straightforward manipulations of (8.8.4)—(8.8.7), we 
obtain the following expression for the unitary evolution operator from some 
time t, to t within a zone 


Uirlt, ty) Ui2l(t, ty) 
U(t,ty) = i LayUst |) (8.8.8) 


where 


wW— Wo 


Uir(t, ty) = cos a(t = ty) +i ( ) sin a(t = n) e  iw(t—t1)/2 (8.8.9) 


|| B 


Ura(t,t1) = iF sin a(t — te (+41)/2 (8.8.10) 
B : 
Ue (t,t1) = —i lu sin a(t — t,)e(@+42)/2 (8.8.11) 


a 


Ualtstr) = | cosa t1) (+ sina(t—n)]ee-0v? (8.8.12) 


2 2 pR2 
WwW B 
i OF) Lb 
2 h? 
For the intermediate time range T < t < 7+ T, the unitary time evolution 
operator is simply 


1/2 


(8.8.13) 


e-iwol/2 0 


U(r +T, T) = Uo(T) = : (8.8.14) 
0 eiwoT /2 
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For the full process, from 0 < t < 7+ 77, the time evolution operator is 
given by 


Up(27 + T,0) = U(2r +T,7 + TU o(T)U(r, 0) (8.8.15) 


where U, Up are, respectively, given in (8.8.8), (8.8.14). 


In particular, for a particle initially prepared in the state (1 0)", the 
amplitude of a spin flip after the particle has gone through the process in 
Figure 8.2 is given from (8.8.15) to be 


Aia(spin flip) = (1) ; Up(2r + T,0) (5) 


~ilulB 
= He sin(ar)e 


x {ce cos at +i (=) sin ar| 
a 


pee ooze cos at —i (- 5, *) sin ar| \ (8.8.16) 
a 


This gives for the probability of a spin flip 


iw piwT /2 


Ay? B? T 
Prob[spin flip] = “F sin? ar| cos (w — wo) — } cos at 
ha? 2 


= (- =) sin ( = 05 | sin ar| j (8.8.17) 


where a is defined in (8.8.13), and wo is given in (8.8.3). 
At resonance w = wo, a = |u|B/h and (8.8.17) reduces to 


2|\B 
ProbJspin flip], econace = Sin (A) (8.8.18) 


and a 100% probability for a spin flip is formally attained for 


wR 


ee 8.8.19 
a B any 


To investigate the nature of the interference occurring in (8.8.16), (8.8.17), 
we first determine the amplitudes for a spin flip or of a non-flip of spin after 
the particle has gone through the first Ramsey zone before it enters the second 
one. These are respectively given by 


Ai (spin flip) = @ : Uo(T)U(r, 0) G 
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_ —ilulB 


; sin(ar)ei#oT/2eiv7/2 (8.8.20) 
a 


A, (non—flip of spin) = (a) : Uo(T)U(r, 0) (4) 


= e WoT /2_-iwr/2 cos at +i (=) sin ar| : 
a 
(8.8.21) 


Similarly for a particle entering the second Ramsey zone, the amplitude 
of a spin flip or of a non-flip of spin, and ending up in the state (0 ae at 
the end of the experiment, are, respectively, 


A2(spin flip) = (?) : U(2r+T,7+T) G 


—ilu|B ‘ 
= SUB sin(ar) er +2T)/2 (8.8.22) 
a 


# 
A2(non—flip of spin) = (7) U(27+T,7+T) a 


= lcosar — i (=) sina e7/2 (8.8.93) 
a 


From (8.8.16), (8.8.22), (8.8.23), we see that the amplitude for a spin flip 
for a particle going through the whole process in Figure 8.2, may be rewritten 
as 


Aoi(spin flip) = [A2(spin flip) Aj (non—flip of spin) 
+ A2(non-flip of spin) A; (spin flip)] 
= A(—++)+ A(- -— +) (8.8.24) 


We note that (8.8.24) is a statement of completeness as it follows from the 
insertion of the identity between Uo(T) and U(27 + T,7 + T) in (8.8.15): 


U(Qr + T,r + T)Uo(T)U(r, 0) 
=U(2r+T,7+T) { (5) (10) + (7) (0 )} Uo(T)U(r,0) (8.8.25) 


leading immediately to the two terms on the right-hand side of (8.8.24) upon 
taking the matrix element (0 1) (-) (10) ' of (8.8.25). 
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The probability of a spin flip in (8.8.17) may be then rewritten as 


Prob[spin flip] = |A(— + +)|? + |A( +)? 


+ (A*(—+ +)A( +) + A(— + +)A*( +)) (8.8.26) 
exhibiting an interference term 
[A*(— + +)A(— — +) + A(— + +) A*(— — 4+)]. (8.8.27) 


Such terms are observed and are responsible for providing narrow resonance 
curves about w = wo for the transition probability in corresponding experi- 
ments. 

It is interesting to insert the apparatus (a meter) of §8.7 in (8.7.10), de- 
scribed as a spin 1/2 system, in the intermediate stage between the two 
Ramsey zones in Figure 8.2, the interaction Hamiltonian in that region be- 


comes 
10 1 -l 
B st , TKt<rstt 
H(t)= (8.8.28) 
10 
o3|1|Bo ap T+tp9<t<7T4+T 


restricting for simplicity only (see also Problem 8.20) to an ideal apparatus. 
Here we use the notation to = 7h/2X (see (8.7.15)) for the time of operation 
of the apparatus, assumed to be finite, to reach the correlated state given 
in (8.7.14), rather than T, with the latter reserved to denote the time of no 
oscillations of the magnetic field between the two Ramsey zones. T’ is chosen 
such that T > to. 

The unitary time evolution operator between the two Ramsey zones in 
this case, then follows from (8.8.28), (8.8.14), to be given by 
e iwoT' /2 0 . ih aa 
Uo(T) = {1 + (io3 _ 1)5 b 1 \ (8.8.29) 


0 eivoT /2 
instead of (8.8.14), and for the full process we obtain 


Up(2r + 7,0) = U(27 + T,7 + T)Uo(T)U(r, 0). (8.8.30) 


1\ 1 j1—-i 
lo) = (3) 5 : Fe (8.8.31) 
of the particle-apparatus system, the amplitude of a spin flip of the particle 
is readily worked out from (8.8.28)—(8.8.31) to be 


Given the initial state 
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1 
AGE) a eee A (8.8.32) 
This gives for the probability of a spin flip, the simple expression 
|A(— + +)|? + |A(- — +)? (8.8.33) 


showing the disappearance of the interference term in (8.8.27) by the mere 
insertion of the meter between the two Ramsey zones. 


8.8.2 Monitoring the Spin 


The insertion of a meter between the two Ramsey zones, as just discussed, 
leads us to investigate the fate of the spin of a particle as it goes through 
the Ramsey apparatus, as described by the magnetic fields in (8.8.1), when 
the spin of the particle, as it comes out of the first zone, is monitored by 
the meter. We prepare the particle to be initially in the state (1 0)" before 
entering the first zone. 

A machine, as a composite system consisting of a filter, which accepts a 
particle in the state (1 0)" only, and a Ramsey apparatus, in the absence 
of a meter, may be represented by the matrix (see also Figure 1.7, (1.3.24), 
(1.3.25), (1.3.5)) 


M =U(2r+T,7r+T)Uo(T)U(r,0) € i) (8.8.34) 
which may be rewritten as 
— (Mi 0 
M= Ge i) (8.8.35) 


with 
Mi — e Mol 2TH (Br + T, ae te T)Uw(7, 0) 


+ eT /2U7, (27 + T, r + T)Un (7,0) (8.8.36) 


My, = eT /2Y, (27 + T, 7 + TU (7,0) 


4 eT 2075 (27 + T,7+T)Uo(7,0) (8.8.37) 


(see (8.8.8)—(8.8.12)). 
The amplitude that the particle comes out of the machine with no change 
in its spin state is then given by 


(10) M (6) =My (8.8.38) 
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giving the probability (see also (8.8.17)) 


4p? B? T 
Prob[same state] = 1 — a sin? ar cos (w — wo) 5) COs aT 


= (: =) sin (w = “)5 | sin ar| (8.8.39) 


In the presence of the meter, the corresponding machine may be repre- 
sented by the matrix 


Mo = U(2r +T, 7 + T) Uo(T) U(r, 0) ( ) (8.8.40) 


where U(T) is given in (8.8.29). For the apparatus in the initial state, ap- 
pearing tin (8.8.31), we have 
—iwoT /2 0 0 0 
Pores a en : 1 
2\|1+i 0 0 0 


leading from (8.8.40) to 


| (8.8.41) 


0 eiwoT /2 


2p2 © 2B? 2-2 
Probg|same state] = ( sin? ar) (1 aa sin? ar) 
iar! a h2a 
(8.8.42) 

corresponding to the probability in (8.8.39) now in the presence of the meter, 
as indicated by writing Probg for such a probability. 

At resonance w = wo, the expressions for the probabilities in (8.8.39), 
(8.8.42) simplify to 


2|u|B 
Prob|same state] = cos” (24"") 
B B 1 2|u|B 
= cos* @: =) + sin# (4 *) 5 sin’ ( ul "| 
(8.8.43) 
(see also (8.8.18)), 
B B 

Probg|same state] = cos* (4) + sin* (4) (8.8.44) 


respectively. 

Upon the comparison of the probabilities in (8.8.43), (8.8.44), we learn, 
under the above experimental situation, that monitoring the spin of the par- 
ticle by the meter, as the particle comes out of the first zone, suppresses the 
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probability of change of state, i.e., reduces the probability of decay (unless 
2|u|Br/h = na, n=0,1,2,...). 

It is interesting to consider the more general case of a system involving N 
Ramsey zones. As before a meter is inserted between each zone to monitor the 
spin of the particle as it comes out of a zone, with a finite time of operation 
of each meter to = wh/2X, so that the correlated state with the system is 
reached with T > to. The situation is depicted in Figure 8.3. 


(a) 
$ 
RZ RZ RZ.—sORZ 
(b) 


Fig. 8.3. A spin 1/2 (uncharged) particle going through N Ramsey zones 
(RZ). (a) An apparatus is set between each zone with the “needle” of each 
pointing initially towards the neutral 0 direction. The initial (a) and final (b) 
directions of the spin of the particle are the same in this experiment. Part (b) 
shows a possible configuration of the “needles” directions after the particle 
has gone through N — 1 zones. 


The time evolution operator of the combined state of the j*” meter and 
the spin of the particle coming out of the j” zone, 7 = 1,..., N—1, is of the 
form in (8.8.29). The expression for the time evolution operator within each 
Ramsey zone may be read from (8.8.8)—(8.8.12) with the appropriate values 
for t; and t in them. For the first zone, for example, t; = 0, t = 7, and the 
(1,1) element of U(r, 0) is (8.8.9) and so on. 

From (8.8.41), the state of a j‘” meter may be either [1 o]; or [0 ee In 


order that the final state of the spin of the particle be (1 0)", ie., be in the 
same state as the initial one, at the end of the experiment, the particle may 
have only an even number (0,2,...) of spin flips as the particle moves from 
the initial spin state to each of the meters between the Ramsey zones and 
to its final state. Also due to the orthogonality of the states [1 Ol [0 ie 
for each j, the relative phases of the elements of the U operators in (8.8.8)— 
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(8.8.12) are unimportant. Here we note that we have N such U operators 
corresponding to the N Ramsey zones. 
All told, (8.8.8)—(8.8.12) lead, in the presence of the meters 


N 2 2 x 2 2 N-« 
B B 
Probo/same state] = s ) oe sin? or) (1 - ma sin? ar) 


x2=0 
(x~=even) 


= PAY) [same state] (8.8.45) 


which is a generalization of the expression in (8.8.42) with the latter given 
for N = 2. 
At resonance w = wo, (8.8.45) reduces to 


N x N-« 
(N) = N\ (. 2 HB 2 BB 
P, [same state] = S- ) (si ir) (cos ir) . (8.8.46) 


x2=0 
(xw=even) 


In the absence of the meters and at resonance w = wo, the corresponding 
probability to (8.8.46) is simply given by (see Problem 8.21) 


NpB 
PO) {same state] = cos? (“,) : (8.8.47) 
For N = 3, for example, 
B B B 
P\) [same state] = 4cos® (Sr) —6cos* (47) +3 cos” (Hr) (8.8.48) 


and 


P®)[same state] = P [same state] 


4249 (1 cos? (42-)) (5 cos? (42-)) cos? (4) (8.8.49) 


and one learns that for 


B 
0 < cos? (47) <1/2 (8.8.50) 


monitoring the spin, in the above experimental situation, enhances the prob- 
ability of decay of the system (i.e., change of state), and for 


B 
1/2 < cos? (42) <1 (8.8.51) 


it reduces the probability of decay. Similar considerations may be given for 
N > 3. 
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The z = 0 term in (8.8.45), given by 


N 
——> sin“ at (8.8.52) 


corresponds to the physically interesting case where each time the meter 
“encounters” the particle its spin is always found in its initial state. For 7 
finite, i.e., non-zero, and, in general, for sin? at 4 0, the expression within 
the square-brackets in (8.8.52) is necessarily less than one. Accordingly for a 
system involving a large number of Ramsey zones (i.e., N large), the prob- 
ability that the spin of the particle is found by the meter, as the particle 
emerges from a zone, always in its initial state becomes negligibly small.*° 
The above analyses show that measurements made on a system during the 
course of its time evolution lead, in general, to situations where the probabil- 
ity of decay of the system (change of state), at the end of the experiment, may 
be either reduced or enhanced. Such conclusions depend, however, very much 
on the experimental situation considered and on the variables involved.*+ 


8.9 Schrédinger’s Cat and Quantum Decoherence 


The “Schrédinger cat” paradox arises when one considers the interpre- 
tation of the superposition principle of quantum physics as is extended to 
macroscopic systems. The classic example of this is the one dealing with 
Schrédinger’s 1935 thought experiment consisting, in a simple description, 
of a vessel containing a live cat coupled by a lethal device to a radioactive 


30 If ry denotes the total time of oscillations of the magnetic fields in the N zones, 
one may write tT = 7)/N. On using the elementary inequality sin? x < x”, one 
obtains the following lower bound [1 — p?B?7)/h?.N?]* to the probability in 
(8.8.52). One is then tempted to infer that the latter probability approaches 1 
for large N. Such a conclusion, however, cannot be true for finite 7. Also in 
an experiment, the total time Jo of no oscillations is finite, and for the time to 
of operation of a meter we have to < To/(N — 1). The limit N — oo, would 
then imply the unrealistic condition of an instantaneous operation time of an 
apparatus. 

There is a long history of the role of measurements (continuous, frequent,...), 
made on a system during the course of its evolution, on its decay, cf. Khalfin 
(1990); Degasperis e¢ al. (1973); Misra and Sudarshan (1977). The name “Zeno” 
effect, as their effect, was coined by the last two authors, and their work has 
led to numerous investigations. Cf. Nakazato et al. (1995); Koshino and Shimizu 
(2004) and references therein, and many other investigations by several authors 
with variations in its definition and different experimental situations with varied 
consequences following from them. In generally, one may refer to the suppression 
of the decay of a system noted as achieved by measurements made on it during the 
course of its evolution as a “quantum Zeno effect”, while refer to the enhancement 
of its decay as a “quantum anti-Zeno effect”. 
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substance. If a decay in it occurs, this triggers the device to release a deadly 
gas and the cat dies. On the other hand, if no decay in it occurs, the cat lives 
(see Figure 8.4). The radioactive decay law obeying the probabilistic rules of 
quantum physics, a decay may or may not occur within a given time speci- 
fied by the “experimentalist” and it depends on the half-life of the radioactive 
substance. 


(b) 


Fig. 8.4. (a) Vessel containing an alive cat in the presence of a radioactive 
substance in which no decay has occurred. (b) A decay has occurred which 
in turn has triggered a device to release a deadly gas and the cat dies. Unless 
one looks into the vessel, should one assume that the cat is in a superposition 
of cat alive/cat dead states? 


According to quantum theory, the cat may be then found in a superpo- 
sition state of being alive/dead correlated with the radioactive substance in 
which a no decay /decay has occurred. A state of this form may formally have 
the structure 


1 
|W) = Fa [|no decay, cat alive) + | decay, cat dead) | (8.9.1) 
where for simplicity one may assume equal amplitudes for both configuration 
states. Such a state is called an entangled one as it may not be rewritten as 
the product of two states (see Appendix to §8.10). Also in (8.9.1), neither 
the substance nor the cat is in a definite state. 
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In our usual perception of the world, the cat is either alive or dead and not 
in a superposition of the two states. Such “cat” states are not meaningful in 
the macroscopic world and one faces the question as to what the significance 
of the superposition of macroscopic states, in general, is. As done at the end 
of §8.7, one may then argue that the environment, surrounding a macroscopic 
system and interacting with it, will destroy such superpositions thus inducing 
naturally superselection rules preventing them to be observed. Such quantum 
decoherences should then occur in such short times for us to perceive. 

A cat is a very complex system consisting of a very large number of de- 
grees of freedom with its different parts evolving in time in complicated ways 
whether the cat is dead or alive. Admittedly, it is also rather too simplis- 
tic to introduce a state for a cat either alive or dead in a meaningful way. 
There are, however, so-called mesoscopic states which have macroscopic and 
microscopic features and have been actually prepared in the laboratory.°? 

In this context, the state of the “cat” as being alive or dead is replaced by 
some classical notion such as simply as the position of a particle. 

For example, the Monroe et. al. experiment involved in preparing an atom 
in a superposition of two spatially separated but localized wavepackets, thus 
creating a state F 

V2 
where |x), |v2) refer to wavepacket states corresponding to separated posi- 
tions of the atom, and |f), ||) refer to internal states of the atom. The ex- 
tension of the wavepackets was about 7 nm, and the separation between the 
wavepackets was not smaller than the rather macroscopic distance of 80 nm 
which is large in comparison to the atomic dimension of the order 0.1 nm. 
In the Brune et. al. experiment, the coupling of a two-level atom with a few 
photon coherent field in a cavity was considered generating a Schrédinger 
cat-like state of radiation and the quantum decoherence in a measurement 
process was observed for such a mesoscopic state. The decoherence is consid- 
ered as being due to dissipation corresponding to absorption by the cavity 
walls. Mesoscopic states are often referred to as kitten-like states. 

One may model the latter experiment by considering the coupling of a 
spin 1/2 system and a harmonic oscillator to generate a Schrédinger cat 


entangled state 
1) = (4) le) + (2) I-09] (8.9.3) 


where (1 ce (0 1)" correspond to two states of an atom, while the co- 
herent states ja), |—a) correspond to two configurations of radiation. In 


|?) [e1) |T) + lee) 11] (8.9.2) 


Schrédinger’s cat thought experiment, (1 0)" / (0 1)" correspond to the ra- 


82 Monroe et al. (1996); Brune et al. (1996); Brune et al. (1992); see also Gerry and 
Knight (1997). 
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dioactive substance in which no decay/a decay has occurred, while |@) /|—a) 
correspond to the cat alive, dead, respectively. 

By considering an interaction of the harmonic oscillator with the envi- 
ronment of the form given in (8.7.44), we may infer from the analysis given 
through (8.7.44)—(8.7.49), that quantum coherence will disappear exponen- 
tially on a decoherence time scale ~ 1/(y|a|*) (see end of §8.7), where y, for 
example, is defined in (8.7.45). We also recall from (6.6.24) that |a|? = 7, with 
n denoting the mean number of the oscillator quanta. That is, the larger 7 is 
the more rapidly coherence is destroyed. For macroscopic systems, 7 is very 
large and decoherence occurs too rapidly to be observed, while for mesoscopic 
ones, with m not too large decoherence is expected to set in slowly. 

A simple model that generates the Schrédinger entangled state in (8.9.3) 
may be readily given. Consider an uncharged spin 1/2 particle with magnetic 
moment yz. As an initial condition, we take the spin state (1 0)". 

As a first stage, we consider the interaction of the spin of a particle with a 
constant magnetic field B = (0,1,0), B > 0, » < 0, for a time t = hr/(4|p|B), 
with Hamiltonian H = —u-B. This generates the state 


exp Giese B)) (5) = S (3) i (7)| . (8.9.4) 


As a second stage, we consider the interaction of the spin, in the “initial” 
state (8.9.4), with a harmonic oscillator taken in an initial coherent state 
|—iag). This interaction will be taken to be proportional to the simple form 


(0) ays @ (0 | a'a (8.9.5) 


with a Hamiltonian (A > 0) 
H = hwata— dogzata (8.9.6) 


for a time t = fi /2A. This generates the state (see (8.7.28)) 


exp ( zt) = (7) ag =| a exp (Fin = Ayala) tag) 
i (7) ee Gin + Ayala) I-iao) | 


-pl()<()-2] ae 


a=e%a, d= —-. (8.9.8) 


where 


This second stage mimics an interaction which finally establishes the correla- 
tion between the radioactive substance and the cat as discussed below (8.9.3), 


486 8 Quantum Physics of Spin 1/2 & Two-Level Systems 


for formally the interaction of an atom in a superposition of two of its levels 
with a coherent field in a cavity (see also Problem 8.22).] 

As the third stage, one may also generate a pure superposition of the cat 
(kitten) states as follows. By applying an identical magnetic field as in stage 
1, for the same time t = fiz/(4|u|B), we generate the state 


3 (o) [le*’o2) —|-%e)] +3 (4) [le a0) + [ea] 0.0 


for a phase ¢’. 
Finally, as the fourth stage, one may perform a selective measurement, 


selecting the state (0 1)" component of the spin, and generate, in the process, 
the superposition state 


N [|e a0) Hs |-e-#'e0)| (8.9.10) 


where WN is a normalization factor. A coupling of the form in (8.7.44), where 
the harmonic oscillator in (8.9.6) interacts with the environment, represented 
by a collection of harmonic oscillators involving infinitely many degrees of 
freedom, leads to a destruction of such a superposition exponentially on a 
decoherence time scale ~ 1/(7|ao|?), where ¥ is defined in (8.7.45).°3 


8.10 Bell’s Test 
8.10.1 Bell’s Test 
Consider the commutation relations for spin 
(Si, 95] = Rese Se. (8.10.1) 


Let nj, ne be any two unit vectors specifying two different directions. Since 
the components X; of the position operator commute, and also commute with 
spin (see (2.7.8)), we may multiply (8.10.1) by the components n1;, n2; and 
sum over 7 and j to obtain 


[Sni, Sno] = ihjm1 X no|Sn (8.10.2) 


33 As pointed out in the footnote to the interaction in (8.7.44), quantum decoher- 
ence arising from the non-isolation of a measuring (or detection) system from 
the environment and hence from its interaction with it, is reminiscent of the in- 
teraction of the non-isolated charged particle with the ever present electromag- 
netic field surrounding it with corresponding physical consequences and that of 
renormalization. [It is interesting to note that the electromagnetic field is also 
essentially represented by harmonic oscillators with infinitely many degrees of 
freedom. | 
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where S, = n-S, N = (n; X no)/|ni X ng], and n; X ng is not the zero 
vector. 

For spin 1/2, the explicit expression for S, is given in (8.1.14), and for 
any unit vector n, Sy has the eigenvalues +f/2. The commutation relations 
(8.10.2) imply, in particular, that the components of spin along any two 
different directions cannot be defined simultaneously. As a matter of fact, 
if one argues, incorrectly, that there may be a simultaneous eigenvector for 
Sn, and Sy,, for which ny X no £0, then from (8.10.2), one would run into 
a contradiction that this eigenvector is also an eigenvector of Sn with zero 
eigenvalue! 

Suppose a pair of spin 1/2 particles are prepared in a singlet state 
(cf. (5.5.39)) 


1 
|p) = i (In), |—m)2 — |—m), |m)2) (8.10.3) 
where we have used the notation |+1/2,n) = |+n) (see (8.1.16), (8.1.25)), 


and the two particles move freely with momenta in opposite directions and 
cease to interact. The quantization axis in (8.10.3) was chosen, arbitrarily, 
along a unit vector n (see also Problem 8.24). 

An actual process will be discussed below giving rise to a singlet state as 
in (8.10.3). This state is not factorable as the product of two states and is 
referred to as an entangled state.** According to (8.10.3), if the measurement 
of spin of one of the particles is found to be along, say, n, then one would 
infer rather instantaneously, with probability one, that the component of 
spin of the other particle is along —n. This together with the fact that all the 
components of spin cannot be simultaneously defined, as implied by quantum 
physics, has led Einstein, Podolsky and Rosen (EPR) in 1935°° to a serious 
criticism of quantum mechanics.°° 

The EPR argument, tailored to the problem at hand, is of the following 
nature. 


‘Devil’s Advocate Argument 


From the measurement of spin of one of the particles, call it particle 1, 
and found, say, to be along n, one may conclude instantaneously, because of 
the correlation implied by (8.10.3), that the component of spin of the other 
particle, call it particle 2, is along —n without ever disturbing this latter 
particle. With no such disturbance, one may invoke locality, as a no-action at 
a distance, to infer that the value of the component of spin found indirectly 
for particle 2 must have existed prior to a measurement done on particle 1. 


34 The mathematical aspect of entangled states is given in the appendix to this 
section. 

3° Kinstein et al. (1935). 

36 Actually their criticism was reformulated by D. Bohm in terms of spin, while the 
original EPR argument was based on positions and momenta of particles. 
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Since n was arbitrary, one may also infer that all the components of spin of 
particle 2 were known to begin with. That is, all the components of spin of a 
particle are definite in clear contradiction with quantum mechanics and the 
underlying theory of the latter is incomplete. 2 

The above led to the belief that perhaps quantum mechanics is a limiting 
case of a more complete local theory which, involves, so-called, hidden vari- 
ables. Such theories are referred to as Local Hidden Variables (LHV) theories. 
In 1964, and in subsequent years, John Bell?” has put such theories to a test. 
Several tests have been also proposed in the literature by various authors. 
We refer to all such tests as Bell-like tests. We will discuss one originating 
from the work of Clauser, Horne, Shimoney and Holt (CHSH).°° 

To the above end, and in view of applications to a system of two particles, 
as described below (8.10.3), and other similar processes, we consider the 
following in the light of LHV theories. 

Let A denote collectively the random variables expected to be relevant to 
the system under study with corresponding probability density or probability 
mass function dp(X) normalized as 


; dp(A) =1 (8.10.4) 
A 


summed over the set A of all values that \ may take on. 

One is interested in determining coincidence and single counts obtained 
in the measurements of the spins of the particles, after emerging from the 
process, making angles, say, x1, Y2 with some given directions. 

Suppose that the system is in a state specified by A. We may introduce 
the following probabilities of counts: 


BxigXai Als Pas Ime De axa) (8.10.5) 


correspondingly, respectively, to coincidence counts when measurements are 
made on both particles’ spins, to a count when a measurement is made on only 
one particle (call it particle 1), and, finally, to a count when a measurement 
is made on particle 2 only. 

In such a framework, one makes the key assumption that if the system 
is in any given state specified by A, the probability count obtained from 
measurements performed on one particle is independent of the probability 
count corresponding to the other particle after they have emerged from the 
process. That is, the probability counts are necessarily factorable, 


P(X1,X23 A) = P(X1, - A)P(—, X23 A) (8.10.6) 


for all in A, implying their independence, with all determined in the same 
state X. 


37 Bell’s insight has been of great significance in science, in general. Many of his 
contributions to this problem have been collected, e.g., in: Bell (1989). 
38 Clauser and Horne (1974); Clauser and Shimony (1978); Clauser et al. (1969). 
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Now we use the fact that for any four numbers 0 < 21, 2%2,24,25 <1, we 
have the following elementary inequality 


-1<ajet2- 21954 v2. + 225-2 — 22 <0 (8.10.7) 


as established in the appendix to this section. 
Accordingly upon setting 


© = p(x1,—-3A) 


2 = P(—, X23 A) 
(8.10.8) 


Lo = P(—, X93) 


for four angles, v1, x2; X41, X9, and using the fact that probabilities, as in 
(8.10.5), necessarily must fall in the range [0,1], we have from (8.10.7) upon 
multiplying the latter by dp(A) and summing (integrating) over A: 


—1< p(x1, x2) — p(xX1, x2) + (Xd x2) + (Xd X2) — P(X4, -) — D(-) x2) < 0 
10 


where 


Cte i ROCCE enero 


= f do()p(xa x25») (8.10.10) 


etc., where we have used, in particular, the factorization assumption in 
(8.10.6). 

The inequality in (8.10.9) is expressed in terms of probability counts which 
may be determined experimentally putting LHV theories to a test. p(x1, x2) 
denotes the joint probability count for measurements of both spins, while 
p(x1,—),; pP(—,X2) correspond to probability counts with measurements of 
only one of the spins. 

In the sequel, the probabilities computed from quantum theory corre- 
sponding to p(x1, x2), p(X1,—),; p(—, X2) will be denoted, respectively, by 
P(x1,X2); P(x1,—), P(-, x2) with a capital “P”. 

In order to obtain a violation of the inequality in (8.10.9) experimentally, 
it is sufficient to choose any four angles x1, x2, x4, x4 that do the job since, 
according to the LHV reasoning, (8.10.9) must be true for all angles. Experi- 
ments show violation of the inequalities and are consistent with the quantum 
mechanical predictions. Experiments of optical nature have been performed 


490 8 Quantum Physics of Spin 1/2 & Two-Level Systems 


and a classic one involving two photons with measurements made on photon 
polarization correlations is one due to Aspect et. al.3°:4° 


8.10.2 Basic Processes 


Now we investigate the nature of the inequality (8.10.9) with the probabil- 
ities appearing in it as computed from quantum theory for specific processes. 


The Process: 


ee —eEee- 

We prepare a pair of electrons, with one spin up and the other spin down 
along the z-axis and initial momenta p and —p along the y-axis (see Fig- 
ure 8.5). There is a non-zero amplitude*! that the scattered electrons move 
along the z-axis, as shown in Figure 8.5. In this particular case, the proba- 
bilities P(v1, x2), P(x1,—), P(—, x2) take quite simple forms.*? 

Prior to the computation of the above probabilities let us investigate 
their precise physical meanings by analyzing the possible outcomes of an 
experiment. 

In the above experiment, depicted in Figure 8.5, a measurement of spin 
of one of the particles emerging from the process is measured along a unit 
vector n;, making an angle yx; with the z-axis, while the spin of the other 
particle is measured along a unit vector ng, making an angle x2 with the z- 
axis. The outcomes of such an experiment are shown in Figure 8.6. Since we 
are dealing with spin 1/2 particles, a spin measurement along a unit vector 
n gives only two possible answers, the spin is either along n or in opposite 
direction to n. That is, the particle’s spin is in the state |+n) or |—n) in our 
earlier notation in (8.10.3). 

That is, there are four possible outcomes of the experiment, as shown in 
Figure 8.6, with corresponding probabilities of occurrence 


Pl{+n1, +ngl, Pl{=n1, +n], Pi+n, No], Pi nj, No]. (8.10.11) 


3° Aspect et al. (1982). 

4° For many other experiments of different nature, cf. Clauser and Shimony (1978); 
Chiao et al. (1994); Bell (1989). 

41 That there is a non-zero amplitude for the process may be shown to be true from 
quantum electrodynamics, for example, to the leading order in the fine-structure 
constant, see: Manoukian and Yongram (2004). 

42 For the scattered electrons moving along other axes than the x-axis these proba- 
bilities turn out to have complicated dependences on the initial speed of the elec- 
trons relative to the speed of light, in general, as discussed below, see: Manoukian 
and Yongram (2004); Yongram and Manoukian (2003), for such details. 
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Fig. 8.5. A possible configuration of the process ee — e e , where the ini- 
tial spins are prepared to be one up and one down along the z-axis, and initial 
momenta p and —p are along the y-axis. There is a non-zero amplitude that 
the scattered electrons move with momenta along the x-axis as shown. This 
particular configuration yields to the simple probabilities given in (8.10.24), 
(8.10.26), (8.10.27) with spin measurements along directions specified by the 
unit vectors ni, Nz. There are only four possible outcomes of the spin mea- 
surements of the emerging particles along the unit vectors n1, ne. These are 
spelled out in Figure 8.6. 


Here, for example, read P[—n,+ng] as the probability that a measurement 
of spin of a particle, call it 1, along n, is found in its opposite direction, while 
the spin of particle 2, measured along nz is found to lie in the same direction. 
With the physical meanings of the probabilities in (8.10.11) made clear, we 
have 

P(x1,X2) = Pl+ni, tno] (8.10.12) 


P(x1 +7, x2) = Pl-nj, +no] (8.10.13) 
P(x1,xX2 +7) = Pi+m1, —np] (8.10.14) 
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Le he 


P[+n1, +n] Pini, +ny] 


be be 


P{+n1, —no] P[-n,, —ny] 


Fig. 8.6. The four possible outcomes of spin measurements (of the two emerg- 
ing particles) along the unit vectors ni, nz in the process depicted in Fig- 
ure 8.5. The corresponding probabilities of occurrence appear under each 
possible case. Some of these probabilities may be zero for some ni, nz and 
these may be read off from (8.10.24) in conjunction with (8.10.12)—(8.10.15). 


P(x + T,X2 + 1) = Pl-n, —ny| (8.10.15) 


corresponding to only four possible outcomes. 
For any angles v1, x2, the normalization of probability reads as 


Pods xa) +P Oat axe) + PO xott) + Pia ea xe +7) = 1, (8.1016) 


If a measurement of spin is made only on one particle, say, particle, call 
it, 1, then with P[+n,, —] denoting the probability that a measurement of its 
spin along n, is found in the same or opposite direction to n;, respectively, 
we have 


Px Py, =| (8.10.17) 
Pixita=) = finn (8.10.18) 


and the normalization of probability in this case reads 
Poa) + Por ta-) = 1. (8.10.19) 


Similar expressions are given for P(—, x2), P(—,x2+7). 

Now we go back to the process depicted in Figure 8.5, to compute the 
probabilities P(y1, x2), P(x1, —), P(—, x2) and the corresponding ones with 
X1,X2 7 X1 +7,X2+7. From the conservation of total angular momentum, 
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and the indistinguishability of the electrons, obeying the Fermi-Dirac statis- 
tics, that is, being described by an anti-symmetric state under the exchange 
of the two electrons, imply that the latter initial state, in spin-space, is a 


singlet-one, 
»-310,0,-0,0) a 


With measurements of spins along the axes 
n, = (0,sin x1, cos x1), No = (0,sin x2, cos x2) (8.10.21) 
(see Figure 8.5), and corresponding final states 
eit/4 cos x1 /2 ei7/4 cos x2 /2 


(8.10.22) 


e'7/4 sin yx, /2 e'7/4 sin y2/2 


1 2 


(see (8.1.16)), we obtain the amplitude 


(c'"/4 cos x1/2  e7i"/4 sin y1/2), (e'*/4 cos x2/2 7 i*/4 sin x2/2), |®) 


_ 1. fxi- xe 
= Rae ( 5 ) (8.10.23) 


giving the joint probability? 


1 = 
P(x1, x2) = 5 sin? (2) = P[+m,, +n9]. (8.10.24) 


The probability P(x1,—) may be obtained from the square of the norm 
of the state (see (8.10.20), (8.10.22)): 


(ci"/4 cos x1/2  e7i*/4 sin x1 /2), |®) 


= — |e'”/* cos — +e" sin — (8.10.25) 
V2 2\1/, 2 \0/, 


giving simply 
and similarly 


Upon defining 


43 This coincides with the quantum electrodynamics calculation (Manoukian and 
Yongram (2004)), to the leading order in the fine-structure constant as obtained 
for the process depicted in Figure 8.5. 
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8.10.28 


for four angles x1, x2, X41, X$, we note that for 
1 = 45°, Yo S00", 3, Sse 3 SH 180° (8.10.29) 


for example, we obtain S = —1.207, which violates the LHV theories inequal- 
ity (8.10.9) (from below), according to the quantum mechanical computations 
of the underlying probabilities. Such violations are in conformity with exper- 
iments and are consistent with the quantum theory predictions. 


Remarks 2 

1. The joint probability of spin correlation in (8.10.24), whose physical 
meaning is spelled out in (8.10.12)-(8.10.15), see also Figure 8.5, is 
what is technically called a conditional probability. That is, given that the 
process in Figure 8.5 has occurred, then (8.10.24) gives the joint proba- 
bility of spins correlations of the emerging particles. 

2. The expression in (8.10.24), corresponding to the process in Figure 8.5, 
is valid even in the relativistic regime where 3 is not small. Here 3 = u/c 
denotes the speed of any one of the incoming electrons and c denotes the 
speed of light. On the other hand for the scattered electrons moving along 
a different axis, than that of the x-axis, the corresponding conditional 
probability of the joint spin correlations has, in general, a complicated 
dependence on (3.44 Only when the formal limit 3 — 0 is taken (the 
non-relativistic regime) then these conditional joint probabilities of spin 
correlations for these different scattering axes coincide with the one in 
(8.10.24) as well. 

3. From (8.10.24), (8.10.26), (8.10.27) we note that, in general, P(y1, x2) 4 
P(x1, —)P(-, x2) showing the dependence of the two events correspond- 
ing to the spin measurements of the two particles. 


The Process: 


ete — Vy 


A very investing process, relevant to the above analysis, is the one of 
positron-electron annihilation into two photons. We consider the process de- 
picted in Figure 8.7. The electron, positron are prepared with spins up and 
down along the z-axis, and with momenta p and —p, along the y-axis, re- 
spectively. Again there is a non-zero amplitude that such a process occurs. 
The created photons move with opposite momenta and, in here, along the 
z-axis. For this situation, the (conditional) probabilities P(vi, v2), P(vi,—), 
P(—, x2) take particularly simple forms. 


44 Such details, based on quantum electrodynamics calculations, have been investi- 
gated in Manoukian and Yongram (2004), Yongram and Manoukian (2003), and 
are beyond the scope of the present analysis. 
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+ 


Fig. 8.7. A possible configuration of the process e 
of the electron, positron are prepared to be up and down along the z-axis, and 
with momenta p and —p along the y-axis, respectively. There is a non-zero 
amplitude that the created photons, emerging with opposite momenta, move 
along the z-axis as shown. Photon polarization correlations are measured with 
polarization vectors making angles v1, x2 with the z-axis. 


e — yy, where the spins 


To obtain the above probabilities, it is convenient to define right-handed 
(R-H) and left-handed (L-H) polarization vectors associated with a photon 
with momentum /ik, say, moving along the z-axis in the positive direction: 


— ie + ie”), e_ = —~(e! — ie?) (8.10.30) 


e. = 
+ V2 
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where e!, e? are (linear) polarization vector components along the x- and 


y-axis, respectively (see also §8.5). The action of the parity operation on R-H 
and L-H polarizations is spelled out in Figure 8.8. 


PARITY: \ 


R-H 


PARITY: 


L-H R-H 


Fig. 8.8. The role of the parity operation on the momentum and right- 
handed (R-H) and left-handed (L-H) polarization vectors. The photon has 
also a so-called intrinsic parity of minus one. 


To find the two-photon state, we note the following: (1) the photons satisfy 
the Bose-Einstein statistics, and hence such a state must be even under the 
interchange of the two-photons. (2) The positron and electron have opposite 
intrinsic parities*? and hence the two-photon state must be odd under a 
parity transformation. (3) The total angular momentum along the z-axis is 
Zero. 

The R-H and L-H polarizations may be obtained from (8.10.30), (1.6.2), 
(1.6.3), to be represented as follows: 


1 


1 
— |i], as R-H (€41) for k, > 0, as L-H (€-2) fork, <0 (8.10.31) 
v2 \o 


45 Intrinsic parities of some particles will be investigated in Chapter 16. The vector 
potential A, for example, being coupled to the current J density via Maxwell’s 
equations imply that the intrinsic parity of a photon is odd. Since we are con- 
sidering here two photons, the intrinsic parity of a photon does not play an 
important role in this analysis. 
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1 
— | -i], as R-H (€+2) for k, <0, as L-H (€-) fork, > 0. (8.10.32) 
v2 \o 
Accordingly, the two-photon polarization correlation state may be written 
as 


|v) = qlerie+2 7035) 
F 1 1 1 
= Fae ed ee (8.10.33) 
2/2 
v2 0 1 0 2 0 1 0 2: 


which is consistent with the Bose character of the photons of opposite momenta. 
From Figure 8.8, this state has also an odd parity: e +1e42 ~ e_1e_2. To 
show that the total angular momentum, associated with |), along the 
z-axis, is zero, we note that with the convention of the representation of e1 
and e? by the column vectors in (1.6.2), (1.6.3), we may write 


0-10 0-10 
S3=ih|10 0] +ih{1 0 0 (8.10.34) 
000), 000), 


for the total angular momentum (spin) for the two photons along the z-axis, 
and note that 


S3 |v) =0. (8.10.35) 
Finally, since 
1 1 0 
+i} =]0] +tifl (8.10.36) 
0 0 0 


the state in (8.10.33) may be rewritten as 


LTP (6 o\ /1 
vy = —]{o} {1} —f1] fo (8.10.37) 
v2 0 1 0 2 0 1 0 2 


which is an entangled state similar to the one in (8.10.20), leading to the 
amplitude 


1 
(cos x1 sin x1 0), (cos x2 sin x2 0), |v) = ea: sin(yi — x2) (8.10.38) 
(see (1.6.4)), and the conditional probability, 


Ln 5 
P(x1,X2) = 3 sin?(y1 — X2). (8.10.39) 
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The probability P(y1,—) is obtained from the square of the norm of the 
state 


0 1 
1 
(cos x1 sin x1 0), |¥) = —= |cosx1 {1} —sin x: | 0 (8.10.40) 
v2 0 0 
2 2 
giving simply 
P(x1,-) =1/2 (8.10.41) 
and similarly 
P(-,x2) = 1/2. (8.10.42) 
For four angles 
x1 = 0°, x2 = 28°, yx = 45°, x5 = 67°, (8.10.43) 


for example, S, defined in (8.10.28) is, according to the quantum mechanical 
computation, equal to —1.207 violating the LHV theories inequality (8.10.9) 
from below. For 


Vi 0. hoo. Ae 1S a ee (8.10.44) 


we obtain S = 0.207, violating the LHV theories inequality (8.10.9) from 
above according to the quantum mechanical computations of the underlying 
probabilities. Such violations are in conformity with experiments and are 
consistent with the quantum theory predictions. 


Remarks 3 

1. The probability in (8.10.89) is a conditional probability given that the 
process depicted in Figure 8.7 has occurred and may be also obtained 
directly from quantum electrodynamics, to the leading order, and is valid 
for all speeds 0 < 8 <1 of et, e~. For the line of momenta of the pair 
of photons in different directions than the one in Figure 8.7, P(x1, x2) 
turns out to have a complicated dependence*® on 3, in general. 

2. In the formal limit 8 — 0 (the non-relativistic regime) of e+, e7, 
P(x1,X2), a8 given in (8.10.39), holds true for all directions of the line 
of momenta of the two photons. 

3. The normalization condition for the photon (massless spin 1 particle) 
reads 


Tv Tv Tv Tv 
PGs va) Pas 5 7X2) + P(x1, x24 bP Og 4 51X24 a) = 1 


(8.10.45) 


instead of (8.10.16). 

4. From (8.10.39), (8.10.41), (8.10.42), we note that, in general, 
P(x1, x2) 4 P(x1, -)P(-, x2) showing the dependence of the two events 
corresponding to the polarizations measurements of the two photons. 


46 Manoukian and Yongram (2004), Yongram and Manoukian (2003). 
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Many other processes have been discussed in the literature.” It is in- 
teresting to note that for specific configurations, specific collision processes 
lead to entangled states and speed independent expressions for the probabil- 
ities. More generally, however, such probabilities depend on the speeds of the 
parent particles*® and makes a Bell-like test for testing LHV theories more 
challenging. 

The Bell-like tests have not only put quantum theory on a pedestal, and 
emphasized how reliable the theory is, but also created enormous interest 
in the foundation of its underlying theory. Its inherit “non-locality”, as some 
practitioners put it, remains to be a problem, while for others as a mere prob- 
lem of interpretation. Operationally, however, quantum theory is in pretty 
good shape. It is consistent with experiments — the final verdict of a theory. 


Appendix to §8.10. Entangled States; The C-H 
Inequality 


Entangled States 
Consider two sets of independent vectors |a;), |3;), 
(aj|o3) = 04, (8:17) = oy (A-8.10.1) 


then for any vector 


Iw) = Do ci lew) [B:) (A-8.10.2) 


such that at least two of the coefficients c; are non-zero, cannot be rewritten 
as a product 


lb) = |e) |e2) (A-8.10.3) 

where 
Ir) = > alan) (A-8.10.4) 
Ibo) = yy b; |G) - (A-8.10.5) 


To show this, suppose, without any loss of generality that c; 4 0 and 
co # 0. Upon multiplying (A-8.10.2), in turn, by (a1| (G1|, (a2 (Go| and using 
(A-8.10.3) we obtain 

gq = a,b, Q= Agbo. (A-8.10.6) 


On the other hand by multiplying (A-8.10.2) in turn by (a@;| (G9|, (a2| (G1| 
and using (A-8.10.3) we obtain 


47 Cf. Clauser and Shimony (1978). 
48 Manoukian and Yongram (2004), Yongram and Manoukian (2003). 
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0= abo, 0= ab (A-8.10.7) 


which upon comparison with (A-8.10.6) leads to the contradiction that at 
least one of c1, C2 is zero. 

A state as defined in (A-8.10.2), with at least two of the coefficients c; 
non-zero is called an entangled state. 


The Clauser-Horne (C-H) Inequality 


Consider four numbers 0 < 71,22, 74,275 < 1, and set 
) Placa ly nea 4 ’ 


/ / Lape fs / 
U = 41%2 — 4105 4+ 41,02 +2105 — 2X, — Xo. A-8.10.8 
2 1 1% 1 


We first derive the upper bound U < 0. 
For 21 > 2, we may rewrite 


U = (a1 — 1)a2o + 2) (a2 — 1) + h(a -— £1) 
<0 (A-8.10.9) 


since every term is non-positive. 
For 21 < 2}, we may rewrite 


U = 21 (2 — £5) + (2, — lee — x (1 — 25) 


= 2129+ (x, —1)z2e- 21 


= 21(%2 —1)+ (a, -1)r2 < 0. (A-8.10.10) 


We now derive the lower bound —1 < U. 
For x, > 71, 


U+1=(1—2})(1—22) +2122 + x5 (x, — 71) 
>0 (A-8.10.11) 


since every term is non-negative. 
For 21 > 24, 


U+1=(1—24)(1— 22) — (v1 — 24) (4) — 22) + 2122 


> (a1 — 21)(1 — 22) — (a1 — 24) (25 — 22) + x} x2 
= (#4, — x})(1—24)+ 222 > 0. (A-8.10.12) 


All told, we have 
-1<U <0. (A-8.10.13) 
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8.11 Quantum Teleportation and Quantum 
Cryptography 


Quantum teleportation, in its simplest form, is a process of transferring 
the quantum state of a particle onto another particle. On the other hand, 
quantum cryptography is a process of sending coded messages between two 
parties with the aim of minimizing, or even abolishing, the risk that the mes- 
sage be intercepted by an unwanted third party. These methods rely on such 
fundamental and mysterious aspects of quantum theory such as entanglement 
and on the general basic fact that a quantum system can be in a superposition 
of different states. Research in such problems, under the heading of quantum 
information, has quite flourished in recent years and uses, in the process, the 
very basics of quantum theory in important potential applications. Quantum 
teleportation and quantum cryptography are discussed next. 


8.11.1 Quantum Teleportation 


Suppose a person — traditionally called Alice — has a spin-1/2 particle 
(or any two-level system), call it particle 1, in a state 


a (8.11.1) 


|a|? + |G]? = 1, and she wants another person — traditionally called Bob — 
at a distant location, to have a particle, of spin-1/2, call it particle 3, in this 
state. Quantum theory provides an answer to the transfer of the state of one 
particle to another one as follows. 

Consider two particles, each of spin 1/2, call them particles 2 and 3, where 
3 denotes the particle in question above. Suppose particles 2 and 3 are in the 


entangled state 
)=510),0,-0,0] om 


i.e., one has no information on the states of particles 2 and 3 except that they 
are in opposite spin states. One of these particles, referred to as particle 2, 
is sent to Alice, and the other, referred to as particle 3, is sent to Bob. Alice 
wants to transfer the state in (8.11.1) of particle 1 with her to Bob’s particle 
3. The entangled state | W23) in (8.11.2) between particles 2 and 3 plays a 
key role in such a transfer. Because of this, particles 2 and 3 are referred to 
as entangled ancillary pair of particles. 

If Alice succeeds, by a specific measurement, of putting particles 1 and 
2, with her, in the entangled state |Yi3), then particle 3, with Bob — at 
a distant location from Alice — will be projected into the initial state of 
particle 1 in (8.11.1) as shown below. 
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To the above end, we introduce the four orthogonal entangled states: 


«-410.020,0) os 
0-419,0,20),0) «9 


These four states are referred to as Bell states. Also a measurement which 
puts two particles in one of these states is referred to as a Bell state mea- 
surement. 

Now it is straightforward to show by some algebra (see Problem 8.25) 
that we may write 


(3) 9s) —-2 (9), lar+(}), 


2 ah JO) + ea) jen} (8.11.5) 


where the left-hand side represents the initial state of the three particles 1, 
23: 

Clearly, all Alice has to do is to carry out a Bell state measurement such 
that to put particles 1 and 2 in the entangled state |Wi5). Then from (8.11.5), 
we see that particle 3, with Bob, will be necessarily projected onto the state 


er (8.11.6) 


as was initially for particle 1 (see also (8.11.1)). 

If it is unknown onto which of the four states |W7,), |Vy), |®y2), |®r2) 
Alice’s Bell state measurement of particles 1 and 2 are projected, then ac- 
cording to (8.11.5) there are equal probabilities of 25% that they could be 
found in any of the four states. 

What happens if Alice’s Bell state measurement yields the entangled state 
|W.) or |®{) or |®F,) instead of |Wz,). In such cases, Alice would inform 
Bob, by classical means, such as by telephone, to apply the unitary operator 
(up to phase factors) 


03 OF O01 OF O02 (8.11.7) 


respectively, on the state of particles 3, with him, since 


(A), Gy OG GO), 


to put his particle 3 in the initial state of particle 1. 
The application of any such a unitary operator is a dynamical process. 
For example, if w is the magnetic dipole magnetic moment of the particle, 
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such as a neutron, one may formally apply a magnetic fields in the directions 
“3, £1, V2, respectively, in reference to (8.11.5), for specific periods of times, 
to achieve such transformations (see Problem 8.26). 

In regard to the teleportation carried out above, up to a unitary trans- 
formation to be carried out by Bob, it should be noted that the initial state 
(8.11.1) of particle 1 need not be known neither to Alice nor to Bob. Also 
after particles 1 and 2 are entangled in a Bell state, particle 1 would not be 
in its original state, and hence particle 3 is not a clone. 

The identity in (8.11.5) leads also to the following interesting result that 
if particle 1 is initially entangled with another particle, call it particle 4, a 
Bell state measurement by Alice which puts particles 1 and 2 in an entangled 
state, automatically puts particles 4 and 3, with the latter particle with Bob, 
in an entangled state as well. To see this, suppose that particles 1 and 4 are 
initially in the entangled state |Wra)s then by setting a, 8 = 0 or 1 in (8.11.5) 
and multiplying the resulting equations by 


G). or a (8.11.9) 


as the case be, one obtains (see Problem 8.25) 


es et 1 eae 
[¥ ra) ax) = 3 sa) Naa) + faa) IW) 


+ |©3,) |®5) — |®F,) |et) \ (8.11.10) 


Again a Bell state measurement which puts particles 1 and 2, with Alice, 
in the state |W7,), then projects particles 3 and 4 in the entangled state |W3,) 
initially shared by particles 1 and 4. Such a quantum teleportation has been 
referred to as an entangled swapping.*9 

For earlier investigations, see the work of Bennett et. al.°° Several experi- 
ments! have been carried out on teleportation, with most of them involving 
photons, confirming this fascinating predictions of quantum theory. For the 
rather rapid progress in the field, one may consult the research journals. 


8.11.2 Quantum Cryptography 


Before going into the role of quantum theory in modern cryptography 
and its future, we first discuss a classic cryptographic system the so-called 
‘“Vernam cipher” or the “one-time pad scheme” introduced by Gilbert Vernam 
in the thirties which provides perfect secrecy of communication. 


49 Zukowski et al. (1993). 

°° Bennett et al. (1993). 

°1 Cf. Bouwmeester et al. (1997); Nielsen et al. (1998); Miranowicz and Tamaki 
(2002). 
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Suppose Alice wants to send a message to Bob at a distant location. 
Such a message may be sent in code consisting of 1’s and 0’s called bits. 
Letters, numerals, a blank space, the comma and so on are represented by 
ordered sequences of 1’s and 0’s making up a message which may be then 
readily read. In order to avoid an unwanted outsider — traditionally called 
Eve — from reading the message, however, Alice also produces another string 
of bits, randomly chosen, called the key as long as the message. She then 
encodes each bit of her message using the key generated thus introducing a 
scrambled text by using the following simple rule. 

Let n denote the number of bits in her message. She adds the k‘” digit of 
the key to the k‘” digit of the message, with k = 1,2,...,n, thus generating 
a scrambled message, using the rule that 

0+0, is replaced by 0 
0+1, is replaced by 1 


8.11.11 
1+ 0, is replaced by 1 ( ) 


1+1, is replaced by 0 
For example, 


key: 0110100 
Message: 1011110 >. (8.11.12) 
Scrambled Message: 110101 0 


Now she transmits the scrambled message publicly (i.e., by telephone, 
radio,...) to Bob. She must also provide the key to Bob in secrecy in order 
that he may read the actual message by using the simple rule given above 
involving the key and the scrambled message. 

The scrambled message is of no use to Eve if she does not have the key to 
decode the message. [Eve may guess and make up factorials of keys leading, 
in general, to different messages. | 

The practical difficulty of the above (classical) procedure is that to trans- 
mit a message in secrecy one has to transmit a key in secrecy. Also what 
happens if Alice does not know the location of Bob and the two have not 
even met before to share a secret key to communicate in secrecy? On the 
other hand, should she trust a messenger who, at an earlier time to the avail- 
ability of the message, was asked to deliver her key to Bob? For example, 
the messenger may make a copy of the key without even “disturbing” it, i-e., 
without even anybody else knowing it. This is unlike the situation in quantum 
physics where a measurement may, in general, disturb the system that has 
been tampered with. Finally it is not advisable to re-use a key in subsequent 
transmissions of messages if secrecy is of concern and hence the nomenclature 
“one-time pad scheme”. 

Now we will see how quantum theory may be used to generate a random 
key that Alice and Bob may share and how they may test the presence of an 
intruder. 
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For definiteness consider the scattering process in Figure 8.5 of the pair 
of spin 1/2 particles, with the pair put in the singlet state (8.10.20), such 
that, after scattering, one particle from each pair goes to Alice while the 
other goes to Bob. After scattering the particles from each pair are supposed 
to travel (in opposite directions) along the x-axis as depicted in the figure. 
When a particle reaches Alice she randomly chooses the y- or z-axis, by 
tossing a coin for example, to measure the spin direction of the particle in 
question. Bob, likewise, carries out a measurement of the spin orientation 
of the corresponding particle by randomly choosing the y- or z-axis. They 
both record their results by assigning 0 if the spin of a particle is opposite in 
direction to the axis chosen and 1 if it in the same direction. 

After all the measurements (assumed to be sufficiently large in number®”) 
have been made, Bob then selects randomly a subset of his ordered measure- 
ments, again assumed to be large in number, and communicates publicly 
with Alice which measurements he has selected (e.g., 24, 5”, 8",...) and 
the corresponding axes (y or z) he had chosen and the results recorded for this 
subset of ordered set of measurements. Alice then checks the results she has 
recorded for the same axes, say N in number (assumed to be large enough), 
common with Bob’s in the corresponding subset of the ordered set of mea- 
surements. If her results are exactly opposite to those of Bob’s, as imposed 
by the entanglement of their pairs of particles, for these common axes, then 
with some confidence, which will be quantified and estimated below, she will 
announce publicly to Bob that no intruder has spied upon them. 

Alice and Bob then communicate publicly the axes (y or z) they had 
chosen earlier in their remaining ordered set of measurements but not of their 
corresponding results obtained. Finally they select together randomly entries, 
equal in number to the number of bits, say n, making up Alice’s message, 
from the ordered set of this remaining measurements having common axes to 
both of them. For example, for the purpose of an illustration only, suppose 
that n = 3, and the axes chosen in the remaining set of ordered measurements 
are as follows 


Order of measurements 
1346791011 15 
Alice’s chosen axes yzzyzy zz Y 
Bob’s chosen axes yzyzzzy Zz y 
A possible selection y z z 
A possible selection — z Zz y 


then two possible selections of entries with axes common to Alice and Bob 
are given above. 

Assuming that they have not been spied on, and a selection of entries have 
been made, as just described, Alice knows the results (0 or 1) obtained by 


52 This should be much larger than the number of bits of the message Alice wants 
to send to Bob. 
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Bob, corresponding to the entries, by merely examining her own (opposite) 
results. The string of bits of n entries appearing in this selection in Bob’s 
measurements may be then taken to be the key. A scrambled message may 
be sent by Alice to Bob, using the key obtained above, by the “Vernam 
cipher” method, thus completing her task. Therefore it remains to quantify 
and estimate her degree of confidence of not being spied upon. 

To the above end, let z, y denote unit vectors along the z, y axes, respec- 
tively. We use the notation Paple1Z, €22], with ¢1,€2 = +1, for the probability 
that a measurement of spin along z for the respective particles, in a given 
pair, by Alice and Bob are found along ¢12, €2% respectively. The probability 
Paple1y, €2y] is similarly defined. Given that Alice has obtained the spin of 
her particle, in a pair, to be along €,z for a spin measurement along z, we 
also introduce the conditional probability Pg/,|é2%/é12] that Bob obtains for 
his particle, in the pair, to be along €2z for a spin measurement along Z. 

In the absence of an intruder, we have according to (8.10.24), (8.10.26), 


Pap [e1Z, py A| 


Bes ee (8.11.13) 


Ppale2%/e1Z] = 


Paple1y, €29] 


Palesy] = d(€1, —€2). (8.11.14) 


Pgyale2y/e1y] = 

If po denotes the probability that Alice picks the z-axis for a spin mea- 

surement, and hence a probability of (1 —p4) that she picks the y-axis, then 

in reference to a spin of observations in the N pairs of observations with com- 

mon axes for Alice and Bob, the probability that Bob gets an observation for 
spin in opposite direction, for his particle, to that of Alice’s is given by 


Pg|spin opposite to that of Alice’s] 


- > (Panle2, exélp! + Paslery,eo¥] (1 - pf) ) =1. (8.11.15) 


€1,€g=+1 


In the presence of an intruder, (8.11.15) is not necessarily true. To inves- 
tigate this pertinent situation, suppose that Eve, the intruder, is aware that 
Alice and Bob have planned to measure the spins of their respective particles 
along z or y. Accordingly, before each particle reaches Bob, Eve measures its 
spin along the z or y-axis as well, with the latter axes chosen, for example, by 
tossing a coin. By doing so, she puts the particle, in question in some “initial” 
state before it reaches Bob. 

With the above tampering by Eve, if for the N measurements recorded 
by Alice, mentioned earlier, corresponding to the same axes common with 
Bob’s for the ordered set of pairs, in anticipation of preparing a key, Alice 
finds at least one result not to be opposite to the corresponding one of Bob’s, 
then she will be certain of the presence of the intruder, and the process is 
discontinued. On the other hand, if for every one of the N measurements her 
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results are opposite to the corresponding ones of Bob, she will fail to detect 
Eve’s presence. 

Accordingly, if p denotes the probability that for any one of the N mea- 
surements, Bob and Alice obtain opposite results for spins, corresponding 
to either of the common axes y or z, then the probability that Alice fails 
to detect Eve’s intrusion, after examining the N measurements, is p’. The 
probability of Alice’s detection of the intruder is then 


Prob|detection of intruder] = 1 — (p)%. (8.11.16) 


The latter is referred to as the power of the test, with the test being the 
detection of the intruder. Such a probability close to one, which will be the 
case for p < 1 and N sufficiently large, quantifies Alice’s confidence in her 
statement that “no intruder has spied” on Bob and her. 

To determine p, let 


n(0) = (0, sin J, cos ¥) (8.11.17) 
then, with the letter E corresponding to Eve, the following are readily ob- 


tained, by using, in particular, (8.10.24), (8.10.27), (8.1.16), if Eve carries her 
measurements along n(v): 


z: Common Axis of Alice and Bob 
State “Prepared” 
by Eve 


e'7/4 cos 3 Pp/E[-2/+A(9)]=sin? 
etin/4 sin g Paplt+2/+A(9)|=cos? 


eit/4 sin 3 PB/p 
etit/4 cos g Pg/E5 


Paplt2,+a(9)|=3 sin? $ 
oO 
2 


Pap[-2,+8(0)|=$ cos? 


VIS NIS 
ae 


—2/—A(0)|=cos? 


Pap[t+2,—A(0)|=4 cos? 3 
Pap[—2,—A(8)]=$ sin? 3 


NIS no 
a 


+2/—n(0)]=sin? 


y: Common Axis of Alice and Bob 
State “Prepared” 
by Eve 


Papl[t+y,t+a(0)|=1(1-sin 9) e~it/4 cos 8 Pg/nl-9/+A()]=4 (1-sin 8) 
Pap[—-y,t+a(9)|=4(14sin 9) etin/4 sin 8 Paalty/+a(9)]=% (1+sin 9) 


Papl+y,—f(0)|=4 (14sin 9) —e7i"/4 sin B Pg/nl-¥/—-A(9)]=$ (1tsin 8) 
Pan[-¥,—A(0)]=¥ (1-sin 9) etin/4 cos 9 Pg/p[+¥/—A(9)]=4 (1-sin 8) 
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Hence for 0) = 0 or 0 = 7/2, if py denotes the corresponding probability 
that Eve picks the direction specified by n(0) or n(a/2) for a spin measure- 
ment of a particle before it goes off to Bob, then 


p= +L (Pavlerd.com(v)]Psyxl—212/eon(0)p8 


0=0,7/2 €1,€2=5 


+Paple1y,e2n(V)] Pg yp[-e1¥ /ean(v)] (1 - pe) ) Do 


(8.11.18) 


lI 
ao’ 
ee 
| 
1 NOR el 
3 
xP 
NS 
| 
—, 
1 NOR el 
| 
3 
> 
7 
3 
Xb 


where p® = Dy 


Clearly if pS ~ 0 and p® ~ 0 or pS ~ 1, p® ~ 1, then p ~ 1 which will 
not be advantageous to Alice. On the other hand for p4 = 1/2, which, for 
example, corresponds to the case of tossing a balanced coin for a large sample, 
p = 3/4 irrespective of what the value of p® is corresponding to Eve’s choice 
of the z- or y-axis. For this rather natural ’choice’ of ps = 1/2 by Alice, we 
obtain from (8.11.16), (8.11.18) 


Prob|detection of intruder] = 1 — (3/4)% (8.11.19) 


which is already ~ 0.99982 for N = 30 (see also Problem 8.27). 

There are other schemes, improvements, and other additional details that 
have been and are being developed in this rapidly growing field, and the reader 
may consult the literature?’ on such developments including the research 
journals. 


8.12 Rotation of a Spinor 


This section addresses the important question of the observability of the 
overall minus sign (§2.8, (2.8.68), (2.8.70), §8.1) acquired by a spinor for 
spin 1/2 under the operation of rotation through 27 radians. An interesting 
way of investigating the nature of this phase change is by the application 
of a Ramsey (§8.8) like method’ with oscillatory fields causing transitions 
between hyperfine energy levels of a molecule. 

In §8.1, we have seen how the nature of a spinor arises for a two-level 
system, as part of a multi-level system. For a given time interval of length 


°3 Cf. Bennett et al. (1992); Tittel et al. (1998). 
°4 Klempt (1976). 
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T, we consider an interaction which may cause transitions between only two 
such states, which we denote by |0), |1). 

Here we find it more convenient to represent the states |0), |1) by (1 O); 
(0 i, respectively, rather than the other way around. The interaction 
Hamiltonian, during the time interval in question, will be taken to be of 
the form 


Ay(0,1) = Ee ") (8.12.1) 


If Eo, E, denote the energies associated with the states |0), |1) then the free 
Hamiltonian, restricted to these two states may be written as 
(Eo + £1) 


Bye 
H(0,1) = 5 s iG 


3 (8.12.2) 


where o3 is a Pauli matrix. 
We consider the system to be initially in the state |0), and subject it to an 
interaction with an alternating field described by an interaction Hamiltonian 


iwt 
H7 (0,2) = eee ee ) ; 0 <t<T (8.12.3) 


acting during the time interval specified, which may cause transitions between 
|0) and some other state |2) only. Here b; is a complex number, and we denote 
the energy of the state |2) by Ey The interaction in (8.12.3) defines the first 
Ramsey zone. 

For a time T, following the interaction in (8.12.3), we subject the system 
to an interaction with Hamiltonian which may cause transitions between |0) 
and state |1) only 


iwot 
Ay(0,1) = (ies a8 am ) ; T<t<7T4+T (8.12.4) 


with bo is taken to be real, and wo to satisfy the resonance condition wo = 
(E, — Eo)/h. 

Finally, we subject the system to a second Ramsey zone with interaction 
Hamiltonian which may cause transitions between the states |0) and |2) only 
for a time interval of length 7 


0  fibget 
Hr (0,2) = Ge ° ) , T+T <t<2r+T. (8.12.5) 
We take 
by = be'®, bo = bei? (8.12.6) 


thus introducing a phase difference between the fields in the two Ramsey 
zones.°° The interaction Hamiltonian in (8.12.4) is crucial for the subsequent 


°° Klempt (1976); Ramsey and Silsbee (1951); Ramsey (1990). 
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considerations, and introduces, during a time interval of length T’, an inter- 
action occurring within this time interval between the two Ramsey zones, 
specified by (8.12.3), (8.12.5), as opposed to a free one of the type treated in 
88.8. 

To describe the dynamics of the system, we are led to find the expression 
for the unitary operator U(t,t1; 7) giving rise to the time development from 
some t, to t for a generic system specified by the integer j (7 = 1, 2): 


: Qo = Eo hbjelwt ao 
ih 2) = (eee By a (8.12.7) 


The same analysis as carried out in 88.8, leads to the following explicit 
expression for U(t,t1; 7): 


_ — (Uoolt, t13 9) Uor(t, t13 9) 
U(,t34) = Coe Usa(t,tr; 4) oe) 


where 


A os 
Volts t1:4) = eos at t,) —i— sina(t n) eae 2) (19.9) 


2a 
b ies 
Upa(t, t133) Scale sin a(t — tei ei (t+) +45) (8.12.10) 
a 
b G j( we | ‘ 
Uoo(t, tr; 3) = —i Ol sim a(t — ty)eniatt—t ei H+) +2) (8.12.11) 
a 
CEN 3: sation) eG) 
Uoo(t, t1; 7) = |cos a(t — ti) 4 i sina(t — t1)]}e Veen 
(8.12.12) 
FE, —E 
A=w-— - (8.12.13) 
1/2 
A 2 
a= (3) + |d|? (8.12.14) 
b; = |blel® (8.12.15) 
Eig + Eo 
pyed palais 8.12.16 
a ( ) 
Hence the initial state 
|(0)) = |O) (8.12.17) 
develops in time T to 
|2o(7)) = ao(r) |O) + a2(r) |2) (8.12.18) 


by going through the first Ramsey zone, where 
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ao(7) = Upo(r, 0; 1) (8.12.19) 
a2(T) = U2o(7, 0; 1). (8.12.20) 


It is interesting to note that the unitary operator U(t,t1;7) in (8.12.8)— 
(8.12.16), corresponding to the Ramsey zones, may be rewritten in the com- 
pact form (see Problem 8.28) 


U(t, ti; 7) = exp —i(t — t1)Ao(0, 2) exp Ex - to] 


x exp |[—ia(t — t))n; - o] (8.12.21) 


where Ho(0,2) is defined as in (8.12.2) with E, replaced by Eo, 


nj = (Geos [oj tw(t—t1)], iF (8.12.22) 


According to the interaction (8.12.4), acting in the time interval [7, T+T), 


the state |7)(7)) in (8.12.18) develops in time before going through the second 
Ramsey zone, via the unitary operator 


i 0 
U(r + T,T) = exp |- 5.7 Hol, | exp [-i5n . o| (8.12.23) 


as following from (8.12.9)—(8.12.16), (8.12.21), (8.12.22), at resonance wo = 
(E, — Eo)/h, and hence with A — 0, a — |bo|, where 


= |bo|T (8.12.24) 
n = (coswoT, — sin wo, 0) (8.12.25) 


and we recall that (see (2.8.7), (2.8.4)), 


cos — —ie! 
og 2 
exp [ign o| = (8.12.26) 


That is, during the time interval 7 < t < 7+ 77, a spinor, in the presence 
of the interaction in (8.12.4), gets rotated by the angle V, and the state |w(7)) 
in (8.12.18) develops in time, before the system enters the second Ramsey 
zone, to 


0 0 
lb(r + T)) =e 1207! cos 5 wo (F/O) ie 7? eer sin 5 |1) 


+ e iB2T/h ao (r) |2) (8.12.27) 
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as follows directly from (8.12.23)—(8.12.26), and the fact that the state |2) 
develops freely in the time interval in question of length T. 

By going through the second Ramsey zone, the state |w(7 + T)) develops 
in time, according to the unitary operator (8.12.8)—(8.12.12), with 7 = 2, to 


\b(27 + T)) = ap |0) + a |1) + ag |2) (8.12.28) 
with 


0 
ag = Ugo (27 + T,7 + T; 2) Qrihar hg. (7) cos 3 
+ Upe(27 + T, 7 + T; 2) ce 12F/" a, (7) (8.12.29) 


ay = —ie 11 (T +7) /P Q—iwor sin (8.12.30) 


‘ v0 
ag = Un9(27 + T, 7 + T; 2) e107 /Pay(7) cos 5 


+ Uo9(27 + T, 7 + T; 2) e 12F/" a, (7) (8.12.31) 


and ao(T), a2(T) are, respectively, defined in (8.12.19), (8.12.20). 

In particular, az denotes the amplitude of a transition from the state |0) 
to the state |2), just after going through the second Ramsey zone, if the 
spinor representing the two-level system, corresponding to the states |0), |1) 
is rotated by an angle V. 

The above transition probability is worked out from the expression of a2 
in (8.12.31) to be 


Py([Transition |0) — |2)] = |a2|? 


|b? 2 2 An. 
— sin* aT | Fy cos* at — — Fo sin 2ar 
a? 2a 


DON a) es 
af (=) F_ sin ar| (8.12.32) 


where 


v0 
Fy (cos 7 2 — 6x) = 1+ cos? . + 2cos 3 08 (TA +¢2—¢1) (8.12.33) 


v0 0 
Fo (cos 3? ob2 — 61) = cos 5 sin (TA + bg - o1). (8.12.34) 

We may compare this probability, for two cases of interest corresponding 
to no spinor rotation 0 = 0 and to the case corresponding to a full rotation 
for ) = 27. These are respectively given by 
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lel? 2 eT 
Ps = 4—, sin® ar |cos at cosQ — — sinarsinQ (8.12.35) 
0=0 a 2a 
|b? . 2 es ‘ 
Pp» = 4—; sin“ ar |cosarsinQ + — sin at cosQ (8.12.36) 
o=20 a 2a 


and are obviously different, in general, where 


OS 5(TA + $2 — $1). (8.12.37) 


In particular, one may consider the difference in the transition probability 
Ps between two cases: one with a phase difference ¢2 — ¢, = 7/2, and one 
with the reversal of the phase difference ¢2—¢, = —7/2. Denoting the change 
in the transition probability on such reversal of the phase difference by 


AP» = P. a 8.12.38 
é i g2—o1=1/2 ‘ g2—-¢1=—7/2 ( ) 
we have from (8.12.32) the explicit expression 
|b)? + 2 A ‘i 232 2 . 
AP» = 4—> sin* ar — ) sin“ at — cos‘ aTt| sinTA 
a 2a 

A 0 
— — sin 2ar cos TA > cos = (8.12.39) 

2a 2 


leading to a complete reversal in sign from 7? = 0 to J = 27, of the change in 
the transition probability under a reversal of the phase difference ¢2 — $1 = 
+7/2 between the fields in the two Ramsey zones. This sort of experiment 
has been performed”® and is consistent with the reversal of sign of the change 
in the transition probability. A 4a radian rotation restores everything back 
to the 0 = 0 case. 


8.13 Geometric Phases 


8.13.1 The Berry Phase and the Adiabatic Regime 


Consider a time-dependent Hamiltonian H(t) which varies in time t over 
some interval (0, T], such that for each such t, H(t) has a discrete non- 
degenerate spectrum, with eigenvalue equations 


56 Klempt (1976). For some other experimental studies with different procedures, 
see Rauch et al. (1975); Werner et al. (1975); Byrne (1978); Klein and Opat 
(1976); Stoll et al. (1977). 
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A(t) |tm(t)) = En(t) |m(t)), (tn (t) [Mm (t)) = Smn- (8.13.1) 


A process is said to be an adiabatic one, if the change of the Hamiltonian 
in time is so slow, measured by a large value of the length T of the time 
interval (i.e., T — oo), such that if the system is initially the state |7,,(0)) 
of H(0), then at any later time t, in the above interval, it is found, up to a 
phase factor, in the state |7,(t)) of H(t).°” That is, the state of the system 
evolves together with the Hamiltonian. In particular, if H(T) coincides with 
H(0), then the system will be found, approximately, in the state |n,,(0)) for 
T large enough, up to a possible phase factor. 

Now suppose that the time-dependence of H(t) is implicit and comes from 
the dependence of the latter on an external time-dependent k-vector R(t), 
with k > 2, ie, H(t) = H(R(Q)), |m(t)) = ln (R(t))). In particular, 


£ lnn(t)) = Vr | (R(t))) - R(t). (313.2) 


Also suppose that R(t) traces a closed curve C in the parameter space in 
which it varies such that R(T) = R(0). That is, the Hamiltonian parame- 
trized by the vector R(t), returns to its initial form H(T) = H(0), as R(t) 
follows a cyclic motion in the parameter space. 

The time T is taken to be much larger than any typical quantum mechan- 
ical oscillation period in the problem. 

In a remarkable paper, Berry®® made the observation, in the adiabatic 
regime, that if the system is initially in the state |~(0)) = |7,(R(0))), then 
\~(T)) develops in addition to the familiar phase factor 


exp (-i i wex() (8.13.3) 


a phase factor which depends on the geometry determined by the path tra- 
versed by R(t) in the parameter space. Specifically, 


i T 
(%(0)|o(T)) = exp (if wex()) exp (iYn) (Mm(0)|tm(L)) (8.13.4) 


with e 
Vn = if dt’ (nn (t")|7mn (t’)) « (8.13.5) 


To derive (8.13.4), note that according to the adiabatic hypothesis, we 
may write 


57 The adiabatic regime, as an approximation method, will be treated in Chapter 12. 
The relevant technical details of this approximation, however, are not necessary 
here to follow the present study. 

°8 Berry (1984). 
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jo() <exp (7 fave) exp (rat) ml) +O(F) (618.6) 


and, as indicated, up to corrections which vanish for the parameter T’ — 00, 
Yn(t) is to be determined. Upon substitution of (8.13.6) in the Schrédinger 
equation 


inS WW) = H(t) WO) (8.13.7) 


and taking the inner product of the resulting equation with (n,(t)|, one ob- 
tains 
in = i (mn (6) tin (4) (8.18.8) 


which formally leads to (8.13.4), (8.13.5). 
To investigate further the nature of (8.13.4), we consider phase transfor- 
mations of the eigenstates 


[rin(t)) —> ln (t)) elon, (8.13.9) 


Since the eigenvalues E,,(t), in (8.13.1), obviously do not change, under the 
transformations in (8.13.9), and 


an — In — (n(T) — 0 (0)) (8.13.10) 


(1m (0)|tm(L)) — (1n(0)|tin(T)) exp i(n(T) — an (0)) (8.13.11) 


we note that the expression on the right-hand side of (8.13.4) remains in- 
variant under the transformations in (8.13.9). Accordingly, by recalling that 
H(T) = H(0), and that |7,(T)) denotes the same state as |7,(0)), we may 
choose, by construction, the eigenstates |n,(t)), by appropriate choice of 
phases, such that |7,,(Z')) = |7,(0)). We may then rewrite (8.13.4) as 


eS AOR 
(w(0)|2b(T)) = exp (-i : WE) Jo (i¥n(C)) (8.13.12) 


with 
Yn(C) = ip (n(R)|Vrin(R)) -dR (8.13.13) 


where we have finally used (8.13.2). 

The phase ¥,,(C) is referred to as the Berry phase, in the adiabatic regime, 
while the familiar factor in (8.13.3) as the dynamical phase factor. We note 
that the integral in (8.13.13) depends on the segments of the curve C and 
not on the durations of time taken to travel over these segments and hence 
the term geometric. 

That exp(i7yn(C)) is a phase factor follows by noting that the normaliza- 
tion condition (7,(R(t))|mn(R(t))) = 1, ie., Vr (an(R)| mn (R)) = 0, implies 
that 
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(7in(R)|VrIm(R)) = — (Wrtn(R)|m(R)) 
= — (m(R)|Vrm(R)) (8.13.14) 


which means that i (7(R)|Wrin(R)) in (8.13.5) is real. [Note that for k = 1, 
the expression on the right-hand side of (8.13.13) vanishes, which explains 
the condition k > 2 stated earlier for the k-vector R(t).] 
n(C) in (8.13.13) is independent of phase transformations of the eigen- 
states 
[tm (R)) — |tm(R)) el), (8.13.15) 


To the above end, we set 

i (tm(R)|Vrtmn(R)) = A(R) (8.13.16) 
and observe that the latter then transforms according to the rule 

A(R) — A(R) —- VRG(R) (8.13.17) 


reminiscent of gauge transformations of a vector potential. The invariance 
of Y,(C) under the phase transformations (8.13.15), then follows upon the 
substitution of (8.13.17) in (8.13.13). 

By the application of Stokes’s theorem, 7,(C’) may be also rewritten as a 
surface integral 


YalC) = [ve x A(R))-dS (8.13.18) 


Since in the transformations (8.13.15), the phases are just c-numbers and 
hence they commute, such transformations are referred to as abelian transfor- 
mations as opposed to the cases when degeneracy is involved in the eigenvalue 
problem in (8.13.1) as will be discussed later. 

As an illustration consider the Hamiltonian of spin 1/2 in an external 
magnetic field B = BR(t), 


H(R(t)) = —pBR(t)-o (8.13.19) 
where R(t) is the unit vector, 
R(t) = (sin @ cos wt, sin@sinwt, cos 6) (8.13.20) 


and (, @ are time-independent. We set —wB = hw /2. 
For t = T = 2n/w, R(t) = R(0), and for any fixed t¢ in the interval [0,T] 
of consideration, an eigenstate of the Hamiltonian H(R(t)) is 


8 
COS 5 


Ins (t)) = (8.13.21) 


A OSE 
sin Ze 


satisfying the eigenvalues equation 
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hug 
H(R(O) Ins () = “$2 Ine() (8.13.22) 
and, |74(T)) = |n+(0)), 
i (ns (thi) = -s(1 ~cos6). (8.13.23) 


The adiabatic regime is defined by the very slow change of the Hamil- 
tonian, that is, for a very large T much larger than the characteristic quantum 
oscillation period Tp ~ 1/|wol, i-e., for |wo| >> w. 

From (8.13.4), we then have 


(1(0)|o(Z)) = exp (—iwoT/2) exp (in(C)) (8.13.24) 


with 
Yn(C) = —7(1 — cos 8) (8.13.25) 


where 27(1 — cos@) denotes the solid angle, subtended at the origin of the 
sphere of unit radius, swept out by the unit vector R(t), along the magnetic 
field B(t), which traces the closed curve C' on the surface of the sphere, with 
the latter surface defining the parameter space (see Figure 8.9). 


Rit) 


Fig. 8.9. —2y,(C) in (8.13.25) of the Berry phase, denotes the solid angle, 
subtended at the origin of a sphere of unit radius, swept up by R(t) while 
tracing the closed curve C' in the parameter space. 


It is interesting to compare the expression in (8.13.24) with the exact 
solution given in (8.1.37), (8.1.38), (8.1.34), to see the adiabatic hypothesis 
at work. 

To the above end, 2 in (8.1.39) is given by 
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Q & |wo| — Iwol cos 0 (8.13.26) 
wo 


for |wo| > w, which from (8.1.34), (8.1.37), (8.1.38), 


6 
COs 5 ‘s 
|ab(t)) By e wot /2,—iwt(1—cos 6) /2 +40 (=) ; (8:13.27) 
sin 2eit [wo] 
2 
With the initial state |(0)) given in (8.1.32), this leads, in the adiabatic 
regime, to the result in (8.13.24). 
Several experiments®? have measured the Berry phase for particles with 
different spins including that for spin 1/2. 
The analogue of the geometrical phase for classical systems has been also 
found.®° The classic example of this is the change in the direction of swing 
of the so-called Foucault pendulum after a full rotation of the earth.®! 


8.13.2 Degeneracy 


We briefly consider the generalization of (8.13.4) to the cases when de- 
generacy®? is involved. 
To the above end, consider the eigenvalues problem 


H(t) |n(t,a)) = E(t)|n(t,a)), a =1,...,N (8.13.28) 


for an N-fold degenerate eigenvalue E(t) for all t in [0,7], where we have 
suppressed a generic quantum number n for simplicity of the notation. As 
before if R(t) in H(t) = H(R(t)) is varied slowly over the long time interval 
of length T, i.e., in the adiabatic regime, and assuming that the given space 
of degenerate levels does not cross other levels, we may write for T’ — oo, 


a(t)) = |n(t, b)) cav(t) exp (-; few a(t) (8.13.29) 


as a linear combination of the degenerate states |7(t,6)) with a summation 
over 6 understood, where 


|a(0)) = In(0, @)) (8.13.30) 


for a given a. 
Upon substituting (8.13.29) in the Schrédinger equation gives 


°° Cf. Bitter and Dubbers (1987); Tycko (1987). 
6° Hannay (1985). 
6! For a demonstration of this among other things, by formulating the classical 


dynamical problem in complex form, cf. Manoukian and Yongram (2002). 
°° Wilczek and Zee (1984). 
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Cab (t) =iAp, (t) Cak (t) (8.13.31) 
where 
Ase(t) = i(n(t,®) [n(6,8)) (8.13.32) 


Equation (8.13.31) may be readily integrated as a time-ordered product as 
in (A-2.5.11), (A-2.5.2) obtaining for the N x N matrix c(t) the expression: 


[e(T)] 3 = (on i “at! A(t!) | (8.13.33) 
+4 ba 


which, in the adiabatic regime, allows us to write 


z i = / / 
(+ if dt’ A(t | ) |e (-if dt’ E,(t ) : 


(8.13.34) 

The state |w,(T)) evolves into a linear combination of the |7(T,6)) for 
any given a with initial condition given in (8.13.30). 

We consider general transformations, which generalize those in (8.13.9), 


\a(T)) = |n(T, b)) 


|n(t, )) —> Que(t) In(t,¢)) (8.13.35) 


(n(t, b)| — (n(t, c)| 25 (t) (8.13.36) 


with Q“ as N x N matrices whose inverses are assumed to exist. Since such 

matrices do not necessarily commute, in general, these transformations are 

referred to as non-abelian ones as opposed to the abelian ones in (8.13.9). 
Under the transformations (8.13.35), (8.13.36), the Ap,(¢) in (8.13.32) 


then transforms as®? 
Apg(t) —> Aca(t) 25) (t) Qea(t) + iQe(t) NG (t). (8.13.37) 


Additional details on the problem of degeneracy are worked out in Prob- 
lem 8.31. 

In the study of the Berry phase above in this section, the Hamiltonian 
was parametrized by a k-vector with cyclic motion in parameter space, and 
the assumption of an adiabatic regime was a key one in its development. The 
initial state of a system in question was also assumed to be an eigenstates of 
the instantaneous Hamiltonian at t = 0 i.e., of H(0). Aharonov and Anan- 
dan®* generalized Berry’s analysis by giving up the adiabaticity assumption. 
They also regard the cyclic parameters as labelling the states, rather than 
the Hamiltonian. This is discussed next. Later on, even the cyclicity of the 
parametric motion will be given up in studying the generation of geometric 
phases. 


°3 Such transformations are well known in non-abelian gauge theories of fundamen- 
tal interactions. 
64 Aharonov and Anandan (1987). 
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8.13.3 Aharonov-Anandan (AA) Phase 


The AA phase arising by even giving up the adiabaticity assumption is 
perhaps best introduced by considering first a couple of examples. 
Consider the time-independent (!) Hamiltonian 


h 
H= > o3 (8.13.38) 
and the initial state j 
cos 5 
\(0)) = 4 (8.13.39) 
sin 5 


which is not an eigenstates of H for general 6. 

The solution of the corresponding Schrédinger equation is elementary and 
is given by 

COs ; e wot /2 
|b(t)) = : (8.13.40) 
sin a eiwot/2 
2 

The following expectation value is readily worked out 


(b(t) |H|p(t)) = 2 95 0. (8.13.41) 


We may then introduce the dynamical phase factor by 


se 7 i at! wee) ive) | a [-Fe cos] (8.13.42) 


and rewrite the solution in (8.13.40) explicitly as 


0 
COs 5 
|ab(t)) aa e7 wot cos 0) /2 e7iwot(1—cos 0)/2 ; (8.13.43) 
eito sin — 
Also for 2/f times the averaged spin, we have 
(w(t) |o|v(t)) = (sin @ cos wot, sin 6 sin wot, cos @) = n(t). (8.13.44) 


Accordingly, for® t = T = 27/|wo|, as the parameter n(t) traces a closed 
curve in the parameter space, of the type shown in Figure 8.9, 


°5 We use the notation 7 rather than T in order not to confuse it with the time 
parameter T used in the adiabatic regime for which the latter is taken to be 
large. 
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Ieb(r)) = 7 (wor cos 0) /2 e em (1—cos 8) |2(0)) (8.13.45) 


where € = signwo. Again in addition to the dynamical phase factor in 
(8.13.42), |w(t)) acquires a geometric phase factor, with 2m(1 — cos@) de- 
noting the solid angle, subtended at the origin of a unit sphere, swept out 
by (2/h) times the averaged spin. Here we note that there is no question of 
adiabaticity assumption, and the Hamiltonian in (8.13.38) is not only not 
parametrized by n(t) but is also time-independent. 

As another example consider the time-dependent Hamiltonian 


30 sind et 
hw be cos 
H(t) = —N(t)-o = — (8.13.46) 
2 2 G iwt 
sin fe —cos 6 
where 
N(t) = (sin 0 cos wt, sin 0 sin wt, cos 6). (8.13.47) 


As in (8.1.41), (8.1.42), we define the angle a, and consider the initial 
state 


a 
cos — 
2 


|w(0)) = 7 (8.13.48) 
sin 2 
which is not an eigenstate of H(0) in (8.13.46). The solution of the corre- 


sponding Schrédinger equation has been given in (8.1.43) to be 
cos = e7iwt/2 
2 itQ/2 
|b(t)) = gs (8.13.49) 


sin = elwt/2 
2 


where Q = (w? — 2wwy cos # + we) \/? defined in (8.1.39). 
To obtain the expression for the dynamical phase factor, we compute the 
expectation value 


(w(t) |H(t)|b(t)) = mo cos(0 — a). (8.13.50) 
Also we note that 
(w(t) |o|v(t)) = (sin acos wt, sin asin wt, cosa) = n(t). (8.13.51) 


Hence we may rewrite (8.13.49) as 


\b(t)) = exp — unt cos(6 — a) exp [—Ser( — cos a) 
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where we have used the fact that 
w — wo cos(9 — a) +2 = w(1 — cos a) (8.13.53) 


(see (8.1.41), (8.1.42). 
For t = T = 27/w, as the vector parameter n(t) traces a closed curve in 
the parameter space, we may write from (8.13.52) 


(w(0)|W(7)) = exp (-5H07 cos(6 — a)) exp (—imw(1—cosa)) (8.13.54) 


recognizing the geometric phase factor as the second factor on the right-hand 
side of (8.13.54). 

More generally, suppose that a normalized state |7)(t)) evolving according 
to the Schrédinger equation 


nS W(t) = H(t) WO) (8.13.55) 


is such that 
[w(7)) = elo |4h(0)) (8.13.56) 


for some T, where exp(ia(7T)) is a phase factor. We denote the closed curve 
II {|(t)) :0 <t <r} by C, where II is the projection map which maps each 
(non-zero) vector to the ray on which it lies. 

Needless to say, for t 4 7, in the neighborhood of 7, |y(£)) is not neces- 
sarily related to |~(0)) by just a phase factor. Accordingly, before using the 
Schrédinger equation to determine a(r)) in (8.13.56), we carry out a phase 
transformation 


[w(t)) — eV [bt)) = x(t) (8.13.57) 
such that 
(x(0)[x(7)) = 1 (8.13.58) 
[x(0)) =e BO), [x(7)) =e F™ [w(r)) (8.13.59) 
(8.13.56) gives (up to additional integer multiples of 277) 
a(r) = f(r) — f(0). (8.13.60) 


Upon substitution of |x(t)), given in (8.13.57), in the Schrédinger equation 
(8.13.55) provides the simple expression 


f)= =; (POA) YO) + 1x) |x) (8.13.61) 


which upon integration, and using (8.13.60), gives 
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WOW) = exw (— Ff” aecweizr(Olw)) ew (if ae co iec)). 


(8.13.62) 
To investigate the nature of the last phase factor, in addition < the dy- 
namical one 


exp Gan dt (a (t)|wW(t ») (8.13.63) 
we define the state vector 
jo(o) =e (+ fae wero) Wwe) (8.13.64) 


thus removing the dynamical phase factor, leading to 


(®(0)|®(7 y= exp (if ae( dt (x(t)lix(t ») (8.13.65) 


we note that due to the normalizability condition (x(t)|x(t)) = 1, (x(t)lix(4) 
is real, hence the expression of the right-hand side of (8.13.65) is a phase 
factor. 

Upon substitution of the expression (8.13.64) in the Schrédinger equation 


(8.13.55), to obtain the resulting equation for 


d)), and taking the inner 
product of the latter with |®(¢)), we obtain the simple result that 


(a) |b) =0. (8.13.66) 


This expression is of some significance.®° It states, in particular, that for 
two neighboring points t, t + At, At ~ 0 


(®(é)|®(¢ + At)) 


= (BHO) {1 — EO" TM) — KOM) KOKO) }} 


(8.13.67) 


as is readily obtained from (8.13.57), (8.13.64), (8.13.61), where we have also 
used the normalizability of |®(¢)). We note that the second term in the curly 
brackets in (8.13.67) is not only of second order in At, due to (8.13.66), but 
is also real, and for At ~ 0, the entire expression in (8.13.67) is real and 
positive. 

Equation (8.13.67) means, in particular, that although, for infinitesimal 
At ~ 0, (®(t)|®(t + At)) does not develop a phase factor, to first (not even 
to second) order in At, as we move from ¢ to t + At, a net phase change 
arises, in general, according to (8.13.65), at the end of the journey for finite 


86 Simon (1983). 
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T as we move from t = 0 tot = T. Such a system, with a net phase change, is 
said to be non-holonomic, and the phase change is known as an anholonomy. 

To gain further insight into the nature of the phase acquired in (8.13.65), 
we use an idea®’ of Pancharatnam®® to study the phase relationship of two 
(non-orthogonal) vectors |®1), |®2) by considering their interference effect by 
the norm squared of their superposition 


\[|©1) + [@2) ||? = (®1|G1) + (G2| Bs) + 2 |(®1|G2)| cos dyy (8.13.68) 


where 


(®1|®2) = |(@1|©2)| ce. (8.13.69) 


The vectors |®;), |®2) are then said to be “in phase” when 612 = 0 giving a 
maximum for the norm squared in (8.13.68) for which is real and positive. 

In the light of the above definition, we note that for two neighboring 
points t, t+ At, At ~ 0, we have from (8.13.67), 


|||®(t + At)) + |®(t)) ||? =2 + 2|(@(t + At)|(t))| (8.13.70) 
implying that |®(t + At)), |®(¢)) are “in phase”, or that the vector |®(¢)) is 
“parallel transported” to itself within the infinitesimal intervals [t,t -+ At] for 


At ~ 0. In spite of this “parallel transport” in infinitesimal steps, for the full 
journey from t = 0 to t = 7, however, we have from (8.13.65), 


II®(7)) +| (0)? = 2-4 2e05 ( fav (xe) 


= 2+42|(®(7)|6(0) etice ‘\lix(t ))) 


(8.13.71) 


implying a net phase change. As a matter of fact it is easy to show that if 
vectors |®;), |®2) are “in phase” and so are |®2), |®3), then |®,), |®3) are 
not necessarily “in phase” (see Problem 8.33). 

Now suppose that the state |(t)) may be parameterized by a vector 
X(t) = (X1(0),..., X%(t)) : |x(4) = |x(X(t))). Then by using (8.13.67), we 
have for the distance squared between the states |®(t + dt)) and |®(t)) 


BCE + ae)) — |@()I? = ( (Wax Vox) = (Waxlx) (xl Vox)) AX*AX? 
(8.13.72) 
with sums over a,b = 1,...,N, and Va = 0/0X*%. Hence we may define a 
metric, up to an overall scale factor, in parameter space by® 


87 Samuel and Bhandari (1988). 

68 Pancharatnam (1956). This author applied this idea in optics, and was success- 
fully extended to quantum mechanics by the Samuel and Bhandari. 

°° See also Provost and Vallee (1980). 
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Jab = (Vax | Vox) — (Vax 1x) (x1 Vox) (8.13.73) 


signalling, in general, the curved nature of the parameter space. 
The AA phase factor, appearing on the right-hand side of (8.13.65), de- 
noted by exp(iyaa(C)), may be rewritten as 


exp(iyaa(C)) = exp (if A,dX*) (8.13.74) 
Cc 
where 
Aa = (x|iVax) - (8.13.75) 


We note that yaa(C) is independent of how fast the different parts of the 
path defined by C is traversed. 
Under a phase transformation 


Ix(X)) — el? |x (X)) (8.13.76) 


the A, transform as 
Aq — Aa — Vaa(X) (8.13.77) 


and we note that’ the metric in (8.13.73) may be rewritten as 
Jab = ((Va + iAa)x|(Vb + iAa)x) (8.13.78) 


and is obviously invariant under the transformations in (8.13.76). 
As an illustration of the various concepts introduced above, consider the 
Schrodinger equation 


d 


ae | 


VD) =HOWO), HW) =FK()-6 (8.13.79) 


where K(t) is an external time-dependent vector, n(t) is a unit vector satis- 
fying the equation 
n(t) = K(t) x n(t) (8.13.80) 


with an initially prepared state as an eigenstate of n(0) - o: 
n(0) + o|(0)) = |Y(0)). (8.13.81) 


Then the solution of (8.13.79) satisfies the eigenvalue equation (§8.1, (8.1.44)— 
(8.1.51)), 
n(t) - o|v(t)) = |¥(d)). (8.13.82) 


The unit vector may be parametrized as 


n(t) = (sin 6(t) cos #(t), sin 6(t) sin A(t), cos @(t)). (8.13.83) 


70 See also Samuel and Bhandari (1988). 
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Suppose that n(t) traces some closed curve on the unit sphere (see Fig- 
ure 8.10) as it moves from t = 0 to t =T with 


6(r) = 0(0), (7) = (0) +2r. (8.13.84) 


With the cyclic evolution just described, the state vector |y(t)) in 
(8.13.57), may be represented by 


Ix(t)) = (8.13.85) 


sin (2) eid(t) 


which deviously satisfies (8.13.82) and (8.13.58). 


aan": 


Fig. 8.10. A closed curve traced by the unit vector n in (8.13.83) over the unit 
sphere. The latter is referred to as the Poincaré sphere. [The corresponding 
sphere in optics involving polarization vectors is referred to as Stokes’s sphere.| 


From (8.13.83), (8.13.57) it is easily verified that 
(Y(t) |o|b(t)) = n(t) (8.13.86) 


and hence the dynamical phase factor in (8.13.63) for the system may be 
written as 


exp (-5 is dt K(t) - n() (8.13.87) 


On the other hand, 


(x(t)lix(t)) = 50 — cos O(t)) O(#) (8.13.88) 
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giving the AA phase factor 


(8.13.89) 
where X! = 6, X* = ¢. This phase factor is equivalently rewritten as 


expi f A-dé (8.13.90) 
Cc 


where A in the latter may be taken to be 


ae sin 0 A 
2(1 + cos 6) 
ne Ny 
= 0 8.13.91 
Ce a a) ee 


and @ is a unit vector in the direction of increasing ¢, 


di = 6d0 + sin dé, (8.13.92) 


8 is a unit vector in the direction of increasing 6. 
In reference to (8.13.72)/(8.13.73) we observe that 


Il|@(t + dt) — |®(¢)) ||? = G + sin? 04") (dt)? 


((d0)? + sin” 0(d¢)”) (8.13.93) 


Ale RTA 


and here we recognize the familiar metric defined on the unit sphere:7! 


71 A reader who has, for example, studied relativity has undoubtedly came across 
the concept of the parallel transport of a vector along a curve (in curved space) 
which states that the covariant derivative of its components are zero. In curved 
space, the ordinary derivative of the component of a vector does not transform 
as the component of a vector, and the covariant derivative is so defined to en- 
sure that the resulting expression does transform as a vector component. In the 
present case, with motion restricted to the surface of the unit sphere, with basis 
vectors 8, o sin @, spanning the tangent plane, at a point (0,¢) on the sphere, 
the covariant derivative of the components of a vector, relative to such a basis, 
read 


dei: ey ae d | 2 2A i eae 
an — sin@cosé ¢V~ =0, ai’ + cot @(V 0+ V 3) =0 
(cf. Misner et al. (1973), p. 340). These equations may be combined as 


< (Vv? —isind Vv’) =icosé $(V" —isind Vv?) 
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1 0 
00) = (9 int) Se a (8.13.94) 


up to the scale factor of 1/4 in (8.13.93). [We note that we could have instead 
considered the curve traced by the vector n(t)/2 on the surface of a sphere 
of radius 1/2.] 

The curl of A in (8.13.91) gives 


1 
VxA=-3= =B (8.13.95) 


1 
2 |r|? 
which may be interpreted as the magnetic field on the surface of the unit 
sphere at rf due to a magnetic monopole of strength 1/2 located at the ori- 
+ 72 
gin. 
The vector A in (8.13.91) may be rewritten as 
1—cos@) - 
i EO eee (8.13.96) 
2sin 6 

which develops a line of singularities for 6 — 7 corresponding to the negative 
z-axis. [Such a line of singularities is referred to as a Dirac string.] We could 
have equally chosen the vector 


1A tes 0h 
Rim ANE CORO) ee pied (8.13.97) 
2sin 0 


in (8.13.90) which has a well defined limit for 9 — 7, but develops a line 
of singularities for 6 — 0, corresponding to the positive z-axis. The vectors 


giving the solution 


(v* —isin@ Vv’) (rT) = (v" —isind Vv’) (0) exp | [cose ja 
0 


= (V' —isin6@ V’) (0) exp lif (1 — cos@) aol 


Here we recognize (the square of) the geometric phase in (8.13.89), arising as a 
consequence of the curvature of the surface of the sphere, associated with the 
vector. The extra 1/2 factor in the phase in (8.13.89) arises due to the spinor 
nature of |®). 

For 0(t) = 2/2, the phase in the phase factor in (8.13.89) simply becomes —7. 
One may argue that by equating this phase with the one formally generated 
in the wavefunction of a particle of charge q satisfying Schrédinger’s equation 
arising from the minimal coupling p — p—qA/c, due to a magnetic monopole of 
strength g: —q ¢ A-di/hc = —2nqg/hc, gives q = hcN/2g, where N = 1 for the 
initial condition chosen for the wavefunction. One may then formally argue that 
by generalizing this to arbitrary spins, that this equality becomes g = hcN/2g, 
where N = 0,+1,+2,..., implying, in particular, the quantization of the electric 
charge (Dirac (1931)) consistent with the experimental observation. See also 
Aitchison (1987). The study of the geometrical phase for arbitrary spins is not 
straightforward, see Lin (2002). 


7 


N 
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A, A’ are well defined in the region ¢ < 0 < 7 —e, for ¢ > 6 and small, 
and are related by a gauge transformation (see Problem 8.35). Clearly, that 
a singularity must exist for a vector A in any gauge is expected since one 
would otherwise obtain V x A = 0 in contradiction with the presence of a 
monopole at the origin of the sphere in the formalism. 

The AA geometric phase, as a generalization of the Berry phase, has been 
observed experimentally”? for various cyclic cases. 

We close this section by studying the generation of a geometric by re- 
laxing the condition of cyclicity’ in addition to relaxing the condition of 
adiabaticity already discussed. 


8.13.4 Samuel-Bhandari (SB) Phase 


Consider the general Hamiltonian for spin 1/2 in (8.13.79). Now suppose 
that n(¢) in (8.13.83) traces an open curve Co, instead of a closed curve as 
in the AA phase, starting from t = 0 and ending at t = 70 (see Figure 8.11), 
where we let 6(0) = 01, o(0) = pi, O(70) = A2, 0(70) = 2. 

Let U(7,0) denote the unitary operator which takes the initial state 
|®(0)), as an eigenstate of n(0)-o, with the dynamical phase removed, to 
the state |®(79)). As before, we are interested in the expression 


(®(0)|®(70)) = (®(0)|U (70, 7) | ®(0)) (8.13.98) 


but now involving non-cyclic evolution as just described, for the non-trivial 
case where the amplitude in (8.13.98) is non-zero. 

The SB phase is obtained as follows. The basic idea is to rewrite’ 
(8.13.98) as 


(®(0)|®(70)) = (®(0)|UL (7, 70)Ua(7, 70)U(70,0)6(0)) (8.13.99) 


where Ug(r, 79) is the unitary operator which is responsible for joining the 
end point (62, 62) to the initial point (61,¢1) by the shortest path (i.e., the 
geodesic’ on the surface of the sphere. The corresponding curve is denoted 
by G in Figure 8.11. 


"3 Suter et al. (1988). 
4 This remarkable result was observed by Samuel and Bhandari (1988) as an exten- 
sion of earlier work of Pancharatnam (1956), in optics, to quantum mechanics. 
© Zhu et al. (2000). 
76 The geodesic equation passing through the points (02,¢2) and (01, ¢1) is given 
by 
cot? = Acos¢+ Bsing 


where 


= (sin $1 cot 62 — sin d2 cot 61) /sin(¢1 — ¢2) 


A 
B = (cos $2 cot 61; — cos ¢1 cot 2)/ sin(¢1 — ¢2). 
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Fig. 8.11. The vector n(t) traces the open curve Co on the surface of the unit 
sphere. The shortest path (the geodesic) joining the point (62, ¢2) to (01, ¢1) 
is denoted by G. The unit vector k is perpendicular to n(0). 


From the earlier AA phase analysis, we have 
Ua(T, To)U (79, 0) |®(0)) = exp (ivan (CyoUG)) |®(0)) (8.13.100) 


where yaa(CoUG) is now given by (see (8.13.90), (8.13.91)) 


yaa(CoUG) = Avde+ | A-dé. (8.13.101) 
Co G 


That is, 
((0)|®(70)) = exp (iyaa(CoVG)) (POUL (r, 70)B(0))._ (8.13.102) 


Now all one has to do in order to establish that a geometric phase arises even 
for the non-cyclic evolution considered is to show that 


(@O)Uh(7, »)|®(0)) (8.13.103) 


is real and positive. Once this is established, one may then infer that the SB 
phase is obtained by closing the open curve Co by the geodesic joining the 
end points of the curve Co and calculate it from the AA phase for the newly 
constructed closed curve. 
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By definition of the geodesic, Ug(r, 7) is given by the now familiar unitary 
operator of rotation by some angle 3 as shown in Figure 8.11. (see §2.8)"7 
given by 

. en eee 
G(T, 7) = cos 3 +ik-osin 3 (8.13.104) 


where k is a constant unit vector perpendicular to n(0). But 
(®(0)|o|®(0)) = n(0) (8.13.105) 


hence the second term on the right-hand side of (8.13.104) does not contribute 
in computing the matrix element in (8.13.103) since k- n(0) = 0. Thus we 
may rewrite (8.13.102) as 


(&(0)|®(79)) = cos ° exp (iyaa(CyUG)) (8.13.106) 


for the non-cyclic evolution. For the non-trivial case considered, where 
(®(0)|®(7)) is non-zero, and for the shortest path joining the end points 
of Co, |8| < a which completes the demonstration of the reality and positiv- 
ity of the matrix element in (8.13.103).”8 
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8.14.1 The Quantum Dynamics 


In the present section, we provide an analytical dynamical treatment of 
the Stern-Gerlach (S-G) effect which is: 


(1) quantum mechanical, as it should be, and takes into account, 

(2) the field equation V-B = 0, where B is the magnetic field in the problem, 

(3) the quantum counterpart of the Lorentz force, 

(4) the two, rather than one, dimensional aspect of the beam hitting the 
observation screen, 

(5) the rather non-trivial correlations that occur between dynamical vari- 
ables, as will be seen to exist, describing the intensity distribution on the 
screen. 


A theoretical analysis of the effect will be given below which takes into 
account all of the above five points just listed.”? We will see that an analytical 
dynamical treatment to the leading order |e|/Whc = \/a in for the electron, 
where a is the fine-structure constant, and for spin 1/2 charged particles (e.g., 


7 See also Berry (1987). 

"8 For additional details on the subject of this section, one may refer to: Shapere 
and Wilczek (1989). Although this reprint volume was assembled way over a 
decade ago, it remains a useful reference source. 

"© This analysis is based on, Manoukian and Rotjanakusol (2003). 
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the proton), in general, leads to a unitary, i.e., positive definite, expression 
for the probability intensity distribution on the observation screen, where the 
magnetic field has a controllable uniform component along the initial average 
direction of propagation of the particle, in addition to a non-uniform, almost 
longitudinal, magnetic field lying in the plane defined by the quantization 
axis, in question of the spin, and the initial average direction of propagation. 

With an initially prepared Gaussian wavepacket, the analysis leads to a 
sum of so-called bivariate normal distributions for the probability intensity 
distribution with non-zero correlations.8° The uniform longitudinal control- 
lable magnetic field, as will be seen, has a dual role. Although longitudinal, 
it reduces effectively the quantum Lorentz force contribution by reducing, in 
turn, the correlation between the dynamical variables describing the proba- 
bility density of observation, and also provides a positive definite expression 
for the latter. We will also see that the analysis applies to neutral particles 
as well. 

The importance of the consideration of the S-G effect for the electron 
itself is evident. To this end, it is worth recalling the statement made by 
Albert Einstein: “We know, it would be sufficient to really understand the 
electron”.®' The difficulty in carrying out a S-G experiment for the electron, 
as such a basic experiment of quantum physics, with a conventional purely 
(non-uniform) transverse magnetic field has been well documented in the 
literature.82:83 

An obstacle in carrying out such an experiment with a standard trans- 
verse magnetic field of Stern and Gerlach, is that the Lorentz force, in the 
classic apparatus, causes an obvious deviation of the particle from its initial 
path thus leading to a blurring of the expected splitting of the beam. Be- 
cause of this, the feasibility of performing the experiment with a longitudinal 
non-uniform magnetic field was suggested many years ago*™ and also more re- 
cently.°° Another aspect of a transversal magnetic field is that a non-uniform 
magnetic field perpendicular to the non-uniform component along the quanti- 
zation axis of the spin, as demanded by the field equation V -B = 0, tends to 
cause, in general, a further splitting of the beam in a direction perpendicular 
to the quantization axis as well. 

We consider the Pauli-Hamiltonian in (8.2.4), written the form 


—u-B (8.14.1) 


with 

8° Cf. Manoukian (1986c), pp. 127, 129. 

8! As quoted in: Rabi (1988). 

82 Pauli (1964). In Wheeler and Zurek (1983), p. 701. Dehmelt (1990). 

83 Batelaan et al. (1997); Gallup et al. (2001). See also Conte et al. (1995). 

84 Brillouin (1928). 

85 Batelaan et al. (1997). See also these papers for the historical development in 
studying the S-G effect and for their analyses of the problem. 
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and g is taken to be arbitrary, where g ~ 2 for the electron, and, e.g., g = 5.59 
for the proton. We choose the vector potential A in the Coulomb gauge, i.e., 


B=VxA, w=yn0, pL g (8.14.2) 


V-A=0. (8.14.3) 
In terms of the dimensionless parameter 
dy = |aP2/he, (q) = sign(a) (8.14.4) 
the interaction Hamiltonian in (8.14.1) may be written as 


A A 1 he 
Ey = €(q),/Qq [Bape ctavaastiea” 4M “oa 2), 
(8.14.5) 


For the electron ¢ = —|e|, a is the fine-structure constant. 
For the initial wavepacket at time t = 0, in the x-description, we take the 
Gaussian type 


1 i x? 
Vo(x) = (On) 8/45 878 exp (5. . x) exp (-) (8.14.6) 


where we use the notation ? for the variance in order not to confuse it with 
the Pauli matrices, and 


Po = (0, po, 0). (8.14.7) 


Here the x2-axis denotes the initial average direction of propagation of the 
particle (see Figure 8.12). 
In the absence of a magnetic field Vo(x) in (8.14.6) develops in time to 


ipo:x/h,—ip2t/2MhA _ Po+)? 
e!Po /Re Pot/ (x rot) 


Wo(x,t) = 373 &XP ; 2 
(277)3/443/2 (1 4 stitz) Ay (1 ae sits) 


(8.14.8) 


and 
2 i (x — ht)" 
\Wo(x)| CERI exp ( Alt 5 (8.14.9) 
242 \ 1/2 
y(t) = 7 (1 + an) (8.14.10) 


We have chosen a common 7-width in all directions to simplify the grouping 
together of the various terms in the analysis. This is not a serious restriction. 
For the magnetic field we choose the simple form 


B= (0,b — Bx, Bars), A= (baz, Bx 1x3, 8x1 22) (8.14.11) 
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Baal 


Fig. 8.12. The x2-axis denotes the initial average direction of propagation 
of the particle with the observation screen being parallel to the x1-x3 plane. 
No magnetic field component is chosen along the 21-axis. 


satisfying (8.14.3), (8.14.2), where b, 6 are some constants. We note that 
the longitudinal component (b — Gx) is along the initial average direction of 
propagation specified by py, with Ox3 denoting the traditional quantization 
axis of the spin. 

We will see that the uniform part (0,b,0) of the magnetic field, although 
longitudinal, may be appropriately set up to effectively reduce the quantum 
mechanical counterpart of the Lorentz-force contribution by reducing, in turn, 
the correlation that occurs between x;, and x3 variables on the screen, and 
also provide a positive definite expression for the probability distribution in 
question. Concerning the non-uniform part (0, Gx2, 3x3), we note that since 
|v2|, a macroscopic distance, is much larger than |.3| (providing a measure of 
the splitting of the beam), this non-uniform magnetic field is almost longitu- 
dinal along the direction of propagation, at the screen. In the above set up, 
as a working hypothesis, we treat the particles as if they are throughout in 
the magnetic field. Otherwise an analytical treatment is not so manageable. 

The dynamics is most elegantly described in terms of the density operator, 
which at t = 0, is given by 


p= ws (4) Mo) (Bol (10) +w-($) Mo) (Wol(01) (8.14.12) 


where 


wy +w_=1. (8.14.13) 


For 
we =w_ =1/2 (8.14.14) 


one would be dealing with an unpolarized beam. For t > 0, the density 
operator is given by 
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ptt) = we (9) [¥e) (Wo (10) 


Agee @ |Wo) (Wo| (0 1) et #/", (8.14.15) 


The probability density is then (for t > 0) 
(x |p(t) |x) (8.14.16) 


and for the probability density, in question, on the screen one may then, most 
conveniently, write it as 


Flerasst) = fave belo(Obs) 


x7" (8) Nl 


=w, f dx 
Glen til (?) |) 


foe) 2 
+u- f dx2 
—oo 


= w+ f4(r1, 03;t) + w_f_(21, @331t). (8.14.17) 


8.14.2 The Intensity Distribution 


With exp(—itH/h) as the time-evolution operator, the following expecta- 
tion values of the Heisenberg operators in the state (8.14.6), relevant to the 
observation screen, to the leading order in ,/@j, are readily obtained: 


(x1(t)) =0 (8.14.18) 
(x3(t)) = sf -o3pt? (8.14.19) 


and the important non-trivial correlation occurring between the dynamical 
variables x(t), x3(t): 


2 22 4-2 
((a(t) = (ax(0))) (2alt) = (va())) = E+ HR 4 
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Upon using the notation 


(alt = facraes glr,as) floret (8.14.22) 


(9)o = (9) (8.14.23) 


where f+(a1,22;t) are introduced in (8.14.17), then a straightforward but 
tedious evaluation of f+(#1,273;t), as given in the appendix to the section, 
consistent with the following constraints, as dictated by the expectation val- 
ues in (8.14.18)—(8.14.21), normalizability and positivity: 


C.1. (a1); =0-+ higher orders 


2 
C.2. (a3); = =— (a3) + higher orders 


C.3. 4/ (x2); = y(t) + higher orders 
J ee 
C.4. ((x3)e - ((xs)i*) ) = 4(t) + higher orders 


C.5. (21 - (rr); ) (xs — (a3), Ee = Aj3 + higher orders 


C.6. fan da3 f(x1,23;t) =1 
C.7. f(x1,23;t) is real and positive 


where Aj3 is defined in (8.14.20), and higher orders stand relative to the 
parameter ,/a@qg. These lead to the following expression for the probability 
density in question: 


Vv det C 
20 


f (#1, 23;8) = 


ts exp (-5 — ai9)C™ (a; - vn) 


+ Ww exp (-3(: + ri9)CY (zx; + v7) (8.14.24) 


where C = (C¥], 4,7 =1,3, C1 =C® =1/77(0), 


C3 = C31 = 1 [qbty? — qBpot?y? _ gBpot*h? (8.14.25) 
y(t) | Me 2M?c 24M 442¢ —_ 
MP 15 
io = in ey 653 8.14.26 
v0 IM ( ) 
and w; = w— = 1/2 for an unpolarized beam. It remains to check to posi- 


tivity constraint C.7 (see also the appendix to this section). 
The probability density in (8.14.24) is a sum of bivariate normal distrib- 


utions and . 
[E] "| = le (8.14.27) 
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is the so-called covariance matrix describing the correlation between 7; and 
x3 on the screen for i # j. )> is a measure of dispersion in all directions 
in the (1,23)-plane. The multiplicative factor Vdet C/27 is the standard 
normalization factor. 

Finally, the constraint C.7 implies that det C > 0, i.e., it leads to a posi- 
tivity requirement. This in turn implies that we should have 


lglt b Bpot  Bpot?h? 
Mc| 2M 24M344 


het 
4M?-4 : 


(8.14.28) 


In reference to this inequality consider first the case with b = 0, i.e., the 
constraint 


ht? 
Ceres IM=A (8.14.29) 
with \alBpot? 42y2 
q|Ppot t 
C= 14 8.14.30 
2M?c ( aims) ( ) 
By setting, 
2 
ec ae (8.14.31) 
t 
va ae (8.14.32) 


with the latter denoting the macroscopic distance from the particle’s initial 
center of the wavepacket to the observation screen, we may rewrite C' as 


4. M Az h2t2 
on l9| (=) t (1 out): (8.14.33) 


For the electron with Az ~ 10-3 m, t~ 107s, Le 1m, y< 1073 m 


(8.14.34) 


—21 
C~ 1.73 x 10” (1+ aa 


v4 


which is a very large number and the positivity constraint (8.14.29) cannot 
be satisfied. On the other hand, the uniform magnetic field (0,b,0) may a 
priori be set at 

B 


b=5L (8.14.35) 


defined simply in terms of the non-uniform magnetic field gradient 
OB, /Oxr2 = —B = —OB3/0x3 (see (8.14.11)), and the distance to the obser- 
vation screen L, independently of any of the details of the spin 1/2 charged 
particle considered and of the (initial) spread +. [The uniform magnetic field 
component b may be, of course, chosen so that C'’ = 0, but this would mean 
to choose a different uniform magnetic field for every different charged par- 
ticle, and a different spread 7, and would not be physically as interesting.| 
The matrix elements in (8.14.25) then simply become 
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O8 =O = aig (52) (5) sa re 


and the positivity constraint 


1 AzLl ht aa ht? 
3lg] y¥ y¥ My? » AM244 


(8.14.37) 


is readily satisfied. For example, for the electron with Az = 107? m, L = 
0.7 m, y = 0.55 x 107° m, t = 4 x 10~® s, corresponding to an initial 
average speed of 1.75 x 10° m/s, a magnetic field gradient 6 = 12.280 T/m, 
and a uniform longitudinal magnetic field b = 4.298 T, the left-hand side 
of (8.14.37) is ~ 0.59. In Figure 8.13, t = 4 x 10~® s, and the probability 
density f(21,273;t) for B = 0, is plotted for y = 0.55 x 10~° m and the 
corresponding density for B 4 0, for the above just given parameters, is 
plotted in Figure 8.14, for an initially unpolarized beam, showing a clear 
splitting of the beam along the quantization axis. [The magnetic field b may 
be chosen to be even smaller. For example, for slower electrons t = 5.93 x 
10-° s, L=0.5 m, b= 1.4 T consistent with (8.14.37).] The asymmetry with 
elongations in the second and the fourth quadrants in Figure 8.14 are easy to 
understand. For the electron e(qg) = —1, C!? = C*! > 0 and the probability 
density gets, respectively, positive amplifying contributions for 73 > 239, 
xz < Oand x3 < —%30, 41 > 0. This graph corresponds to a negative charged 
particle. The formal physical argument for this asymmetry is that it arises 
as a consequence of the direction of the Lorentz force, as determined by the 
so-called right-hand rule, on a charged particle as applied to the transverse 
part of the non-uniform magnetic field. For a positive charge, the elongations, 
as arising in opposite directions, occur in the corresponding first and third 
quadrants. 

We note that the correlation in (8.14.20) and Cl? = C*! in (8.14.25) 
vanish for neutral particles. The analysis carried above (with C!3 = C*" set 
equal to zero), is equally valid for neutral spin 1/2 particles with magnetic 
moment 1 = po, as carried to the leading order in M||/|g|(f3c)!/?, and 
finally leads to the expression 


Fanasit)= gaa {ero [5 | 
ee | oe oe =r) (8.14.38) 


where xo = 3t?/2M. For an unpolarized beam, this is plotted in Figure 8.15 
fort = 4x10~© s, |ag| = 1x1073 m, y = 0.55107? m, showing the difference 
of the densities for the charged and uncharged cases in the presence of an 
appropriately chosen longitudinal uniform magnetic field. 
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-0.003 -0.002 -0.001 0 0.001 0.002 0.003 


Fig. 8.13. Plot of the density f(x1,23;t) for B = 0 y = 0.55 x 107? m, 
t=4x10°%s. 


-0.003 


-0.003 -0.002 -0.001 0 0.001 0.002 0.003 


Fig. 8.14. Plot of the density f(21, x3; ¢) for the electron, based on (8.14.24), 
(8.14.35), (8.14.36) with Az = 107? m, y = 0.55 x 10°? m, t = 4x 107% s, 
L = 0.7 m, corresponding to an initial average speed of 1.75 x 10° m/s, a 
magnetic field gradient @ = 12.280 T/m, and a uniform longitudinal magnetic 
field b = 4.298 T, for an initially unpolarized beam. 
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-0.003 


-0.003 -0.002 -0.001 0 0.001 0.002 0.003, 


Fig. 8.15. Plot of the density f(21,23;t) for uncharged particles, based on 
(8.14.38) for |vo| = 1 x 107? m, y = 0.55 x 1073 m, t = 4x 107® s, for an 
initially unpolarized beam. 


Appendix to §8.14: Time Evolution and Intensity 
Distribution 


To study the probability intensity distribution, we note the following com- 
mutation relations: 


2 . . 
p ihgG ihq 
z j = Fae U3PiP2 + Tze (B22 + b)pip3 
rf 2ihqG ihuG i ihe 
Tie tiP2P3 — qq O2P2 + a Obs 
(A-8.14.1) 
2 2 2 

p p 4h“ qB 
ee ae a — ee A-8.14.2 
ze ar M3c P1P2P3 (A-8 ) 


with all the other commutators with p?/2M vanish or are of higher order. We 
use a variation of the Baker-Campbell-Hausdorff formula (see Appendix I): 
if 


[B,[A, B]] =0 (A-8.14.3) 


[B, [A, (A, B]] | =0 (A-8.14.4) 
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[A. [A, (A, Bl] | =0 (A-8.14.5) 


for two operators A, B, then 


e4t+® — exp (5 [A, B] + - [A, [A, B] 1) ae: (A-8.14.6) 
We let 
itp? 
A oy (A-8.14.7) 
it 
Baa, i (A-8.14.8) 


and note that (A-8.14.3)—(A-8.14.5) hold true to the accuracy retained. Equa- 
tion (A-8.14.6) then gives 


x exp (-F i) exp (-; Ht) (A-8.14.9) 


where Hj = p?/2M is the free Hamiltonian and (exp(—itHo/h)w)(x) = 
wo(x, t) is explicitly given in (8.14.8). 

To carry out the time-evolution operation given in (A-8.14.9) on @% we 
use, in the process, the identity 


exp jia® f(x) = fle +a). (A-8.14.10) 


The operation defined on the right-hand side of (A-8.14.9) on w may be 
then carried out. The analysis is very tedious but straightforward. Up to a 
normalization factor, and the phase factor exp(itp):x/h), (exp(—iH/h)w) (x), 
is given by the expression: 


P P 
Fi (x1,22 = ar 23) t) Fy (x2 — art t) F3(x1, 23; t) (A-8.14.11) 
where we have conveniently isolated the terms dependent on the variable 
2, in the two factors F\, Fp as we have to integrate over it as indicated in 
(8.14.17). With 


Lo = Xo —- —t (A-8.14.12) 


F= —__ = F(t) (A-8.14.13) 


we have 
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4 
F, (21, 75,23) = exp (-2'3F) exp (- ie niwhesF ) 
Mc 
Sihg Gt? 16qGh7t? 
x exp ( Me 112503F" ) exp Bie T1tatak™ 
(A-8.14.14) 
and 
Po tuBor 
x MON Video| Ue re (A-8.14.15) 
2hM mM *? 
t 
F3 (21,23; t) = =o (5 P20) exp (Me ons) exp (Bt yraF ) 


_ 2tqb Sipot? g Bh 9 
Me Tan ) exp (- Sear 


is 2ihqt? 
x exp (45 oaraF ) exp ( a (ote + b) oyraF*) 


x exp 


M? M 


x exp (—(z] + 23)F) . (A-8.14.16) 


Now we have to apply the operator (Fi #23) in (A-8.14.11) to @ - 


(7) , and perform the operations defined in (8.14.15)—(8.14.17). To this end, 


we use the identities: 


~Q)-@e=Q)--Q) wasn 
(3) 4 ; 2) (A-8.14.18) 
Q)=-30)-5() asa 
20)-QeaC)-() usa 


and the orthogonality of (1 Dy (1 =e Also we note that 


F(t) + F*(t) = (A-8.14.21) 


277(t) 
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iF(t) + (F()* = raed (A-8.14.22) 
iF?(t) + (iF?(t))* = si ® eee (A-8.14.23) 


From (A-8.14.11)—(A-8.14.23) we obtain, up to a normalization factor, the 
following expression for the x-integrand in (8.14.17) 


exp (= sap [oP + alt}eraia] ) | exw (a5 Hoe) 


2 
+ exp ( aH He) | f (a1, #3; t) (A-8.14.24) 


where a(t), of order ,/aq, is a function of t only, and up to a multiplicative 
time-dependent constant, 


f (v1, @3;t) = wa fy (@1, £35 t) + w_ f_ (a1, 23; t) 
_ 1 o% 2 iad LAL; 
= W4 exp ( 2 ( E r £3 Vie 42 (t) 


1 2 2 t? Li AjjL; 
eee =o [2 se Lee ON 
(A-8.14.25) 


A summation over the repeated indices i,7 = 1,3 in (A-8.14.25) is under- 
stood, 


gbty” , aBpot?y? qBpot*h? 
Mc 2M2c | 24M4y2c’ 


Ais = Asi = (A-8.14.26) 


A, = Ag3 = 0. (A-8.14.27) 


The expression in (A-8.14.26) is identical to the correlation of the dynamical 
variables x1 (t), x3(t) in (8.14.20). 
In reference to the x2-integral in (8.14.17), we have, from (A-8.14.24) with 


o(t) == ie (A-8.14.28) 


for the shifted x5-integral, 


= V2r7(t) exp ( : [a(t)aya3 + KoF) (A-8.14.29) 
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where [a(t)x123 + b(t)]” is necessarily of a higher order correction in /Oq- 
Accordingly, for the probability density f(x1, 23; t), we obtain the prelim- 
inary expression given in (A-8.14.25). 
To satisfy, in the process, constraint C.2 in the text (see also (8.14.19)), we 
multiply the right-hand side of (A-8.14.25) by an overall normalizing factor 


exp (- (13t?/2M)* /2(0)) giving 


1 2\7? 2 Aja, 
meinen sight) 


1 2 pt? 2 Li Ajj Lj 
+ w_ exp (- PO E + (« + IM (f : 


(A-8.14.30) 


Consistency with the constrains C.1—C.6 in the text necessarily leads to the 
expression in (8.14.24) for the probability density in question below which 
the positivity constraint C.7 has been already analyzed in the text. 


Problems 


8.1. For nj = (sin 0), 0, cos 01), ny = (sin 6,0, cos 6), show that 


[+1/2,m) = |+1/2,m2) cos (45%) + |-1/2,nz) sin (2*) ; 


8.2. (i) Solve the simultaneous equations in (8.1.35), (8.1.36) to show that 
the solutions with the initial conditions (8.1.32), (8.1.40) are given 
respectively by (8.1.37)/(8.1.38) and (8.1.43). 

(ii) More generally, find the unitary operator U(t,0) for the time de- 
velopment via the time-dependent Hamiltonian in (8.1.30). 

[Hint: Find the latter by integrating (8.1.35), (8.1.36) rather than 
by considering the time-ordered product of the exponential of 
(<i fiat H@) /n) i 

8.3. Use (8.1.83) satisfied by the amplitude F(t), self-consistently, to inves- 
tigate the behavior of the survival probability in the truly asymptotic 
limits t — 0, t — co, and state the sufficiency conditions to be satisfied 
in your analysis for the validity of your results. 

8.4. (i) Show that for g = 2 in (8.2.17), the eigenvectors corresponding 
to the ground state energy 0, as follow from (8.2.12), are given in 
(8.2.18) with m =0 or —1, or.... 

(ii) Verify that the supersymmetric generators Q, Qt for g = 2, actu- 
ally annihilate Vo ,-1 (see (8.2.19)). 
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8.5. Verify explicitly the expressions for the polynomials in (8.3.27), 
(8.3.28) to finally establish (8.3.30)—(8.3.33). 

8.6. Repeat the operator analysis for the Landau levels in §8.3 in the gauge 
given by the vector potential in (8.3.35) rather than the one in (8.2.11). 
Comment on the connection between the two solutions obtained cor- 
responding to these two gauges. 

8.7. Solve the corresponding equation to the one in (8.3.38) without first 
putting p1, ps = 0 in (8.3.36). 

8.8. Derive the equations (8.4.3)—(8.4.6). 

8.9. Compute the very small contribution, by introducing in the process an 
ultraviolet cut-off Mc/h for the k-integral, to the magnetic moment 
[ty in §8.4 by considering the real part of (8.4.18). (See §8.5 for useful 
details). 

8.10. Verify the angular integration in (8.5.59) starting from (8.5.47). 

8.11. Derive the expressions for the density operators p“ and the polariza- 
tion vectors P™ as in (8.6.10), (8.6.11), for an arbitrary initial state. 

8.12. For an arbitrary initial state YO) = (cy aN using the general ex- 
pressions for p and P\ obtained in Problem 8.11, and the expression 
for the matrix M in (8.6.18), to find the general forms for the final 
density operator p and the final polarization vector P. 

8.13. Refer to (8.6.27), to find the expression F2(v) + F2(—¥) for the final 
probability densities for an initially polarized beam along the z-axis. 

8.14. (i) Show that the constraints in (8.6.52), (8.6.53) lead to the general 

expressions for (1, 32, 33 as given in (8.6.59), (8.6.60). 

(ii) Complete the proof of part (ii) in the lemma below the constraints 
(8.6.52), (8.6.53). 

(iii) Finally show that (8.6.55), (8.6.56)—(8.6.60) give the structure in 
(8.6.63) for M. 

8.15. Find the expression for the probability density for the scattering of a 
beam of spin 1/2 off a spin 0 target with the resulting spin 1/2 beam 
scattering off a second spin 0 target. This is referred to as double 
scattering. 

8.16. Obtain the final state |®) in (8.7.28) from (8.7.25), (8.7.24) and 
(6.6.21). 

8.17. Show that the state in (8.7.40), with |) given in (8.7.14), develops in 
time T = 7/29 via the Hamiltonian Ahyp in (8.7.43) to the state in 
(8.7.41). 

8.18. Show that the system of equations in (8.8.4)—(8.8.7), lead to the time 
evolution operator U(t, t,) in (8.8.8) from a time ft; to a time t. 

8.19. Write down explicitly the matrix elements of the unitary matrix in 
(8.8.15), then obtain the probability of a spin flip as given in (8.8.17). 

8.20. Insert an apparatus described by a harmonic oscillator, as introduced 
in §8.7, (8.7.24), instead of one described by a spin 1/2 system, in 
reference to the Hamiltonian in (8.8.28), between two Ramsey zones, 
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8.21. 


8.22. 


8.23. 


8.24. 


8.25. 


8.26. 


8.27. 


8.28. 


8.29. 


8.30. 


8.31. 


8.32. 


8.33. 


8.34. 
8.35. 
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to study the nature of interference effect in a spin flip experiment as 
done for the latter experiment emphasizing the differences between 
the two apparatuses. 

Derive the expression for the probability given in (8.8.47), at resonance 
W = Wo, with no meters inserted between the Ramsey zones. 

Follow the elementary model set up to generate the Schrédinger cat 
entangled state in (8.9.3) leading to (8.9.10), to set up a similar model 
to generate the Schrédinger cat state in (8.9.2). 

In the spirit of the actual experiment, reformulate the model developed 
through (8.9.3)—(8.9.9) by considering two levels of an atom and re- 
place, in the process, the interactions given there in the first and third 
stages by appropriate interactions, as done in §8.12, within Ramsey 
zones. 

Show that the state |q) in (8.10.3) is invariant under rotations of the 
unit vector n about any axis. 

Derive the expressions for the expansions of the initial states on the 
left-hand sides of (8.11.5) and (8.11.10) in terms of the other states as 
stated, respectively. 

Develop simple dynamical models, as done in 88.7, to generate the 
unitary transformations specified in (8.11.8). 

Develop the analysis in quantum cryptography in §8.11 including the 
evaluation of the power of the test of detecting an intruder, if an 
intruder is present, for the process depicted in Figure 8.7 for the pro- 
duction of pairs of photons one going to Alice and one to Bob from 
each pair. 

Show that the unitary operators in (8.12.8)—(8.12.16) may be rewritten 
in the compact form as given in (8.12.21). 

Show that |a2|? is as given in (8.12.32) for the transition probability 
Ps[Transition |0) — |2)] as follows from (8.12.31). 

Show that in the adiabatic regime, for which |wo| >> w, the exact 
solution in (8.1.34) goes over to the one in (8.13.27). 

Find the transformation law for the matrix ¢c(T) in (8.13.33) under 
the transformations (8.13.35), (8.13.36). [Hint: It is easier to consider 
(8.13.31) directly rather than the defining equation in (8.13.33).] 
Reformulate the change of sign of a spinor of spin 1/2 under a rotation 
by 27 radians in the light of the AA phase. 

Show that in the Pancharatnam definition (see (8.13.68), (8.13.69)), if 
vectors |®,), |®2) are “in phase” and so are |®2), |®3) then |®,), |®3) 
are not necessarily “in phase”. 

Derive (8.13.67) for the inner product (®(t)|®(¢ + At)). 

Show that the vectors in A, A’ in (8.13.96), (8.13.97), which are well 
defined in the region ¢ < 9 < 7—€ for e > 0 and small, are related by 
a gauge transformation. 
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Green Functions 


Green functions provide information on different aspects of quantum phys- 
ical systems in a unified manner and their importance cannot be overempha- 
sized. To see how a Green function may arise, suppose that (xt|w) denotes 
the state of a system at time t¢, in the x-description, as it has evolved from an 
initial state (x0|7). Upon the insertion of the identity operator in (2.4.13) 
between (x¢| and |W), we may relate (xt|w) to the initial state as follows 


(xt w) = fax! (xt]x’0) (|). (9.1) 
This may be rewritten as 
u(x) = fax! Glt;x'0)Uo(’) (9.2) 
where 
G(xt;x’t!) = (xt|x't’) (9.3) 


defines a Green function describing the evolution of the system. Unlike 
(xt|w), G(xt;x’t’) is independent of the state w, ie., it is not tied up to 
any state thus emphasizing its general character. By definition of (xt|x’t’), 
we may write 


G(xt; x't’) = (x|U(t,t’) |x’) (9.4) 


where U(t,t’) is the time evolution operator from time t’ to time ¢. For a 
time independent Hamiltonian U(t, t’) = exp —i(t — t’)H/h and one has 
[oe) 
G(xt;x’t’) = / dX (x|6(A — H)|x’) exp —i(t — t/)A/h (9.5) 
—0o 
providing the intimate connection between a Green function and the spec- 
trum of the underlying Hamiltonian of the system. 
Also by definition of (xt|x’t’), we have the following completeness relation 
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G(xt;x't!) = [d°x” Gxt; x"t’)G(«"t";x't’) (9.6) 


for t' < t” < t. With G(xt; x’t’) interpreted as the amplitude that the system 
at x’ at time t’ to be found later at x at time t > t’, (9.6) shows that 
for any given t” (t’ < t” < t), the system may go through any point x” 
before ending up at x. By repeated applications of (9.6), writing G(xt;x’t’) 
as integrals of the product of an arbitrary number of amplitudes, the path 
integral formulation will be developed in the next chapter. 

In the present chapter, we carry out a detailed study of Green functions 
and their properties starting with those of a free particle followed by those 
for systems with interactions. The Green function method is also applied to 
study the law of reflection as well as to the celebrated quantum phenomenon 
known as the Aharonov-Bohm (AB) effect. Special attention will be given to 
the Green function for general systems, and for the Coulomb potential, in 
particular. 

Green functions will find important applications in other chapters as well, 
especially, in the path integral formalism of quantum physics, in scattering 
theory and in the theory of multi-electron atoms. 


9.1 The Free Green Functions 


In this section, we consider various aspects of free Green functions mainly 
in three dimensions. Other dimensional cases are treated in the next few 
sections as well as in the problems. 

Consider the general solution of the Schrédinger equation for a free par- 
ticle in three dimensions 


wb (x,t) = (en itHo/ny) (x) 


P< af 2 
= ellP—P*t/2m]/hy (py, t>0 9.1.1 
poe v) (9.1.1) 
where Hy = —h?V*/2m, with initial condition at t = 0 
d°p ix-p/h 
~ (x, 0) = o(x) = ff ——zel™P/y (p). (9.1.2) 
Upon using the inverse Fourier transform of (9.1.2) 


w(p) = f ax’ ®P/y (x) (9.1.3) 


the general solution (9.1.1) may be rewritten as 
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wp (x,t) = [ax G® (xt; x'0) w (x’) (9.1.4) 
(compare with (9.2)), where 
@? (xt;x0) = f aaa (9.1.5) 


satisfying the initial condition 
G° (xt; x’0) rer 6° (x —x’). (9.1.6) 


By using the identity 


[(x— x’) + p— p*t/2m] = — 5 (» mr) bs “| 


changing the integration variable p — p+ m(x — x’) /t in (9.1.5), and car- 
rying out the resulting Gaussian integral, we obtain 

m we im|x — x’|? 
Qmiht pe Dh 


G° (xt;x’0) = ( (9.1.7) 
with t > 0. 

Since we are considering the case t > 0 in (9.1.7), we may introduce the 
following Green function 


m ae im |x — x’|? 
,) p (9.1.8) 


9 (xt x/t) = ; ORE), 
Gy (xt;x't) = 0 (¢ ) (smece MP DAE = 8) 


where © (¢ — t’) is the step function, and will be referred to as the retarded 
Green function. It satisfies the boundary condition 

GUE) =0 for t—t' <0 (9.1.9) 
and the initial condition 


GS (xt; x’t’) ar 6° (x — x’). (9.1.10) 


Because of the presence of the step function © (t — t’) in the definition 
(9.1.8), it follows directly from (9.1.5), that G9. (xt;x’t’) satisfies the differ- 
ential equation 


ling - Ho| GS (xt;x’t’) = ih & (x — x’) d(t-t’) (9.1.11) 


where Hy = —h?V?/2m. 
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We may incorporate the boundary condition, in an energy momentum 
description, of the retarded Green function, by rewriting the solution (9.1.8) 
in the form 


d i(w—a')-p/h 
G°_ (xt; x’t’) =in | ‘ - S 5 ,e—> +0 (9.1.12) 
2m 
where 
(2 —2')-p = (x—x’)-p-—(t-?)p® (9.1.13) 
and 
(dp) = dp°d*p (9.1.14) 


with —oo < p® < c0,—00 < p’ < oo, i= 1,2,3. 

To show that the expression on the right-hand side of (9.1.12) leads to 
the solution in (9.1.8), we go to the complex p°-plane and note that the 
integrand in (9.1.12) has a pole at p? = p?/2m—ie, i.e., in the lower complex 
plane. Hence for t — t’ < 0, we may close the contour in the complex p?- 
plane from above thus avoiding the pole, and noting that the real expression 
—i(t—t’) i Im p®, in the exponential in (9.1.12), is negative for t — t’ < 0 
and for Im p® > 0 in the upper complex plane. Thus the infinite semi-circle 
contour in the upper complex p°-plane gives zero contribution and we obtain 
(9.1.9). 

On the other hand for t—?t’ > 0, we may close the contour of integration in 
the complex p°-plane from below thus enclosing the pole. Also for t—t’ > 0, 
the real quantity —i (t — t’) ilm p®, in the exponential in (9.1.12), is negative 
for Im p® < 0 in the lower complex p°-plane. Thus the infinite semi-circle 
contour in the lower complex p?-plane gives zero (see Figure 9.1). From the 
residue theorem, we then obtain for t — t’ > 0 

d3 


GS. (xt; x’t) = (—2mi) (ih) / (nny 


expi [(x —x’)-p—p*(t—t’) /2m] /h 


(9.1.15) 


which from (9.1.5) coincides with (9.1.8). 
Upon rewriting G°. (xt; x’t’) as 


d3 . , , 

G® (xt;x’t’) = O(t-t’) / Ele Oem, gig) 
(27h) 

the following completeness relation is readily obtained 


pax (ety) G4. Ot x?) = GL tx'?’) (9.1.17) 


where t” is arbitrary such that t >t” > 1. 
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Fig. 9.1. The contour of integration chosen in the complex p°-plane to eval- 
uate the p°-integral in (9.1.12) for t — t’ > 0. The contour encloses a pole at 
p? = p’/2m — ie. For t—t’ > 0, (t—t')Imp® < 0 for Imp® < 0, in the lower 
complex p°-plane, and the infinite semi-circle contour gives zero contribution. 
The p°-integral in (9.1.12) is then evaluated by an application of the residue 
theorem. 


The physical interpretation of (9.1.17) was given in the introduction to 
this chapter. 

As an application, and as an idealistic toy model for the double slit ex- 
periment, suppose that particles are constrained to pass through two given 
points,' as shown in Figure 9.2, before hitting an observation screen. That 
is, we need the constraint 


Px” — 6 (0) 5(0f) [5(f —d) + 8(xf +d] ax” (9.1.18) 


to evaluate the amplitude of finding a particle at a given point on the screen. 
Using the notation x = (21, %2,x3), the geometry in Figure 9.2 together 

with the constraints in (9.1.18) give 

|? =x? + a? F 2rd (9.1.19) 


|x ant x” 


for the upper and lower “slits”, respectively, and 


Ix! — x"? = 1? + 2. (9.1.20) 


! For more realistic computations, allowing a spread, i.e., an extension, of the slits 
see Problem 9.5 
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SCREEN 


Fig. 9.2. Idealistic toy model of the double-slit experiment describing con- 
strained motion. Here the vector x” is restricted as defined through the inte- 
gration measure in (9.1.18). We are interested in determining the amplitude 
that a particle hits the screen at x given that it has “originated” from the 
point described by the vector x’. 


Hence the amplitude of finding a particle on the screen at x = (41, 2,23) 
is given by 


( m Na m Ne ; x? + @? : I? +d 
Imi Omint’) XP \ opp] PUT | Ope 


. «2d . xed 
x lexp (ime) + exp (—im 52°) | (9.1.21) 


where T = t—t”, T’ = t” —t'. This gives a probability density « 
cos? (ma2d/hT) of finding a particle at x, with constructive interference at 
L_ = nthT/md and destructive interference at x2 = (2n +1) ahT/2md for 
n = 0,+1,+2,.... The non-normalizability of this probability density is due 
to the simplified toy model adopted. 

A useful expression for the Green function G9, (xt;x’t’) in (9.1.12) may 
be also obtained by explicitly carrying out the p-integral to obtain a one- 
dimensional representation. 

To the above end, we set 
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x—x’ =p, t-t=T (9.1.22) 
(x —x’)+p = ppcosé (9.1.23) 
d°p = 2ap*dpsin 6d0 (9.1.24) 


and carry out the angular integration in (9.1.12) to obtain 


m dp? _ i ryn | o d adden ieee 
e 
Q7?hp Jo. 2h go (p? — 2mp?® — ie) 


GS. (xt;x’t') = 


(9.1.25) 
or 
dp? S20 i eipp/h 
G® txt) = m —ip 7, d 
2 GRY) Q77hp Joo 2th ; oe, sf (p? — 2mp° — ie) 
(9.1.26) 
where T > 0. 


Regarding the denominator in (9.1.26), for « — +0, we may make the 
substitutions 


p? — Imp — ie [p (V2mp? + ie) [p+ ( 2mp? + ic) | (9.1.27) 


p? —2mp® — ie (p i/2m [p"1) (p+ iv/2m |p") (9.1.28) 


for p? <0. 
Since p > 0, for x # x’, we close the contour from above in the complex 
p-plane to obtain, 


— 
GS (xt;x’t’) = aA i oe e-iP l/h le (p°) eb 27% p/h 
TMP T 


+ © (=p?) eV imiptle/t| (9.1.29) 
which may be rewritten as 


ioe) (0) . Ps, 
GS (xt;x’t’) =i oe eee. Gea ck So (9.1.30) 


where 


eu (x, x’; p°) exp (ev 2mp® |x — x'|) : (9.1.31) 


~ Q7ih |x — x’| 


Equation (9.1.30) is equivalently given by 
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co 4.0 ae 
en (xt;x’t’) = m dp fee ei Rete arnt eb) aM aney area ; 


2rihp Jo Qh 
(9.1.32) 
One may also define the advanced Green function 
G° (xt; x't') = @ (t — £) G® (xt; x’t’) (9.1.33) 
satisfying the differential equation 
lin - Ho| G° (xt;x’t') = —ih d° (x — x’) d(t’ — 2). (9.1.34) 


Its integral representation corresponding to the one in (9.1.12) is given by 


(dp) ei(e—2')p/h 


q 2 ’ 
2mh) po 2 ae 
2m 


The integrand, as a function of p®, has a pole at p? = p?/2m + ie in the 
upper complex p? -plane. Hence for t — t/ < 0 (see (9.1.13)), we may close 
the p® -contour from above thus obtaining the expression in (9.1.33) by an 
application of the residue theorem. For t — t’ > 0, the residue theorem gives 
zero by closing the p°-contour from below. 

From the identity 


1 1 2 
: >—— = —2nid (0° 2 r) (9.1.36) 


2m 
2m 2m 


Go (xt;x't') = inf e— +0. (9.1.35) 


or directly from the definition of the step function we note the relation 
G9 (xt;x’t’) + G2 (xt;x’t’) = G° (xt;x’t’). (9.1.37) 


Almost identical derivations as the one in obtaining (9.1.7) in three di- 
mensions, shows that for arbitrary v dimensions (v = 1,2,...), e.g., for t > t/ 


v/2 7 112 
eal. sd et 9.1.38 
+ (xfixt) (ata) P OR(t— P+) ( ) 
satisfying the boundary condition 
GS (xt; x’t’) a 6” (x —x’). (9.1.39) 


From (9.1.11), (9.1.34) and (9.1.37), we note that unlike G4 (xt;x’t’), 
G°(xt; xt’) satisfies a homogeneous differential equation 


ins _ io| G°(xt;x’t’) = 0. (9.1.40) 


It will still be referred to as a Green function. We next treat Green functions 
in the presence of interactions. 
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9.2 Linear and Quadratic Potentials 


Consider the Hamiltonian 


pe 
H=——-—E Qe 
2m - C ) 


in one dimension. The following commutation relations are readily derived 


[z,p”| = 2ihp (9.2.2) 
[x, [2, p*]] = -287 (9.2.3) 
[p?, ip" |] = 0. (9.2.4) 
Upon setting 
A=iETx/h, B= -iTp? /2mh (9.2.5) 


and using the modified Baker-Campbell-Hausdorff formula (see Appendix I), 
as applied to the problem at hand 


1 1 
exp (A+ B) = exp € [A, B] + 6 [A, [A, BI}) exp Bexp A (9.2.6) 
we obtain 


iT 
(| exp ——-H |e’) 


7 i E°T3 iT Ex! ; 
= exp ree exp = x x 
(9.2.7) 


where we have used the fact that A is a multiplicative operator when applied 
to |z’). 
The last factor on the right-hand side of (9.2.7) may be rewritten as 


li iET? iT p? 
Sek | Saye ee 
3 2hm af h 2m 


ale iET? iT p? \2") 
Bees ohm? h 2m 


=. of ap , iET? -iT p? 
= [| Stelpyivie'vew (For) ew (FF) 28) 


where (|p) = exp (ixp/h) (see §2.4). 
Upon completing the squares in the exponentials in (9.2.8), and integrat- 
ing over p, as in (9.1.5), we obtain for (9.2.8), the expression 
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mx 1/2 
(sane) so 


giving from (9.2.7) 


Nyy 
Se os cam me oT 2 24m 
(9.2.10) 
with t — t’ = T, which obviously satisfies the boundary condition in (9.1.39). 
For the quadratic Hamiltonian, 


m a i peepee) keene FE? 


DP ced 
H = += mu?2? (9.2.11) 
2m 2 
the method just applied to the linear potential is not the best one for this 
case. The reason is that the commutators of the operators p? and x? with 
[z?,p”|, and their commutation relations with the resulting ones and so on, 
go on and on unlike the situation in (9.2.2)—(9.2.4) for the linear potential. In 
this case, it is easier to solve the differential equation satisfied by G (at; 2’t’) 
directly. 
To the above end, we consider the equation with t/ = 0,t > 0 
2 92 
i o + = _ ; mux| G (at; 20) = 0. (9.2.12) 


Quite generally, we may write 
G = exp; [Az? + Br + C] (9.2.13) 


where A,B,C may, in general, be functions of x’ and t, with the solution 
subjected to satisfy the boundary condition in (9.1.39). Obviously, the expo- 
nential in (9.2.13) cannot depend on 2° or on higher powers of 2. 

Upon substituting (9.2.13) in (9.2.12), we obtain the following differential 
equations: 


Ai ea (9.2.14) 
m 2 

. 2AB 
B+——=0 (9.2.15) 

m 

BA capa 
C+—-“=0 (9.2.16) 

2m m 


for the coefficients of x?,x and the constant term, respectively, and where 
A = 0A/0t. Equation (9.2.15) gives A = —mB/2B. 
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By introducing the variable u = 1/B, and replacing the expression A = 
mu/2u in (9.2.14), leads to the equation ti + wu = 0, giving the solution 


A= 


mu (ae (9.2.17) 


2 acoswt + Bsinwt 


To satisfy the boundary condition (9.1.39), with A — m/2t, for t — 0 (see 
(9.1.38), (9.2.13)), we set a = 0. That is, we have 


1 


mw 
A= —cotwt B= — 
2 sae GBsinwt 


(9.2.18) 
and 3, which may depend only z’, will be determined. 

Substituting the expressions for A and B, given in (9.2.18), in (9.2.16) 
and using the integrals 


dz 


sin? z 


= —cotz, ic cot z = In (sin z) (9.2.19) 


gives the general solution 


cot wt ih : 
C= mw? + e In (sinwt) + Co (9.2.20) 


where Cp is independent of t. 
For G, in (9.2.13), we then have 


1 1 | mw x cot wt i 
G= * cot wt Co. (9.2.21 
Vsin wt Par pres eo 7 Gsin wt | ea ( ) 
Upon comparison with (9.1.38), (9.1.39) for t — 0, we may infer that 
1 ) i mw \ 1/2 
Besson ae (=) 9.2.22 
B Mwx exp = Co es ( ) 
giving the final expression, 
1/2 imw 2x2! 
G t: 1! an (=) 2 12 tw — 
Cet) \ anne) a ee eae 
(9.2.23) 


where T=t—t’. 

We apply the expression in (9.2.23) to provide an independent derivation 
of the eigenvalues and eigenvectors of the harmonic oscillator. 

To the above end, we use the representation for the generating function 
of the product of two Hermite polynomials:? 


? See, for example, Morse and Feshbach (1953), p. 786. [The overall factor 
1/V¥1+2z? on the left-hand side of the equation in question here should 
read 1//1— 2? as is easily checked by the normalizability property of 
exp (—2/2) Hn (x) by first putting « = 2’ in (9.2.24). 
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= exp [— (2? +. 27)] 7 ( Hy, (x) Hy (2'). (9.2.24) 


We set z = exp (—iwT), giving 


z 1 1l+2? 1 
ia ee pega ge eee) 


Upon multiplying (9.2.24) by 


(= 


1/2 
ae ) exp (x? + a”) /2 


and making the replacements x — (mw/h)'/? g, vw = (mw/h)'/? x. 


obtain from (9.2.24), and (9.2.23), 


G (at; 2't’) = Yew (Fri hw (n+ 1/2)1) ay — Hy, (x) 


, we 


12 
Gaye eT mun /2h H, (a! 
mh V2" n! 


from which the eigenvalues and eigenvectors are directly read. 


9.3 The Dirac Delta Potential 


Consider a particle in a one dimensional Dirac delta potential Ad (x). To 
solve for the Green function Gy (at’2’t’) satisfying the differential equation 


0 he oF ry : / , 
tha + 5 ang — NO (2)| Gy (at a't’) =ihd (@—2z')d(t-¥) (9.3.1) 
Ot 2m Ox? 


we carry out a Fourier transform 


me 0 +0 , =~ 
Gy (at; 2't') = / ov SEG (aa sp°) (9.3.2) 


where Gy. (a, 2';p°) satisfies the equation 
2 92 


on aie aX) (| Ga (a,2';p°) = ind (a— 2’). (9.3.3) 


po+* 


The latter may be rewritten as the integral equation 
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Gc (ee Dp) =Gi laa ip’) 
a cee % 
— ;/ da” GS (x, 2”; p°) X 5 (x) G4 (a, 2';p°) (9.3.4) 
—oo 


where en (x, 2';p°) is the free counter part of on (a,2';p°) satisfying the 
equation 


o, % %_ |} AO FeO): ois ees 
E + - | @ (e2'se) ihd (a — 2’) (9.3.5) 


as co gq eiP (2-2 \/h 
C® (x,a";p°) = ih us : (9.3.6) 
oo 2th p . 
p> Sie 
2m 


whose solution (see Problem 9.1) is 
wi m : 
GY (ico! ep) = [350 exp (iV2mp" la — 2’ | /n) i (9.3.7) 
Upon the application of 
oy BP 
2m Ox? 


to (9.3.4) and using (9.3.5), it is readily verified that (9.3.3) is satisfied. 
The integral in (9.3.4) may be explicitly carried out to give 

s s eae x 

Gy (zen) = CL (a2) < G®. (x,0;p°) G4 (0, 2’; p°) (9.3.8) 


from which 


10 a) 
Gi, (0,2':p°) = —4 ose) (9.3.9) 
c + ip G0; 0:0) 
and hence from (9.3.8) again,we obtain 
a 7 a » 0) (10 a) 
Gy. (as 2fs0) = G9 (0, 2tsp?) <A TELS) OTP) (g.3.10) 


h vA x 
f + is G%_(0, 0:99) 


In detail, (9.3.7), (9.3.10) give 


é i/ / iy/2mp? (|2|-+]x'|)/n 
Gy (x, 2"; p°) - [ase BOBS Seattle i — Bt 
p Vp° | pik |) 


(9.3.11) 
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For A < 0, it is convenient to write 


—A® ia) _ l|Al@(-A) |A| © (—A) 
alo NA boas] are 
(9.3.12) 
and for A > 0, 
ABA) a AY) (9.3.13) 


aime SZ) aver] yore 
to obtain for all \ < 0, A > 0, 


: eels /h 
imlAl 4 d) e 


canes SER ar 


2h? 


Vf 2mp® (\e|+|x’ |) /f 
+ [any § bv ameile—a'|/n aye . (9.3.14) 
Pp arm a | [m 


where we have used the fact that 0 (A) + 0 (—A) = 1. 
For \ <0, G4 (x, 2’;p®°) develops a pole at 


9. —mA? 
P= aa (9.3.15) 
(compare with (4.2.33)),? coming from the denominator of the first term in 
(9.3.14). By adding +ie to p® in the latter denominator, in conformity with 
(9.1.12), and closing the p® contour in the complex p°-plane from below, and 
by an application of the residue theorem (see Problem 9.8), this pole gives 
rise to G, (at; 2’t’), in (9.3.2), the contribution 


2mi im |A| m|A| j im? ; 
a 6 (—A) exp - 72 (|2| + |a"|) | exp ofS (¢—t')| (9.3.16) 


leading to the bound state wavefunction 


v(x) = ¥ tea exp | _ | (9.3.17) 


h 


consistent with (4.2.27). 


3 Here we let \ carry its own sign. 
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The second expression in the curly brackets in (9.3.14) also contains useful 
information. Suppose x’ < 0, p® > 0. Then for x > 0, the entire second term 
in (9.3.14) becomes equal to 


, | 0 
mm iV 2mp? (\a|+|x |) /f = (9.3.18) 
2p A ings ae 


from which we may infer that the transmission coefficient is given by 


a ae 
ipo AL 
PTE V2 
For x < 0, we have for the reflection coefficient, directly from the second term 
within the curly brackets in (9.3.14), the expression 


(9.3.19) 


id 
2 h \ 2 
fo iIAL [me 
BORN D 


(9.3.20) 


9.4 Time-Dependent Forced Dynamics 


We consider the forced linear potential, with the Schrédinger equation 
given by 
fa) h2 62 
Moe * Im aa2 +P ©) Cas! (9.4.1) 


where F'(t) is a given c-function. 
Let 7 (t) provide the classical solution of the problem. That is, it satisfies 
the classical equation of motion 


mij (t) = F(t) (9.4.2) 
with boundary conditions taken to be 
n(t') =a',n(t)=2 (9.4.3) 


for some t > ¢’. 
We define the quantum deviation from the classical path 


Z="2-7 (9.4.4) 


and note that in terms of the new variable z, we have to make the replacement 
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a 0,020 0 .8@ 
at «Bt Ot Oz ~«6Ot Bz 


for the time derivative in (9.4.1). 
In terms of the new variable, the Schrédinger equation (9.4.1) reads 


(9.4.5) 


OP ae SO. i 30° 
ling ih Iz, + a 5,2 + (z+ 7 (t)) Fi) y=0. (9.4.6) 
Upon setting 
“) = exp (=) ® (9.4.7) 
the following differential equation for ® is obtained 
oO he 0? may? 


Here we recognize, in the last term in the square brackets, the classical La- 
grangian 


22 t 
L.(t) = mir ) en F(t). (9.4.9) 
This suggests to set 
- nt 
d= lew; | dr Le (7| x (9.4.10) 
0 
to obtain the free Schrédinger equation 
On ndl? Or 
ih— + — = >| xv=0. ALL 
f a+acpa|* : C ) 


Hence from (9.4.7), (9.4.10), the solution of (9.4.1) is given by 


wb (2,t) = (exw um (x — 7) i) (exp x i dr Le (7) y(e—n,t). (9.4.12) 


To obtain the Green function, which has the advantageous of not being 
tied up to any wavefunctions, we note that from 


w(a,t) = ie da’ G (at; 2't') w (a’;t’) (9.4.13) 


we may write for y in (9.4.12) 


vent) = (exp wna) (ex-F far Lol) 


x if dz’ G (at; 2't’) (exp — a) i') 
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x (os if dr L, «) x (a! — 1, t’) (9.4.14) 


where 1! =n (t'), f = (t’). 
But for the free Schrédinger equation in (9.4.11) we have 


x (z, t) = i dz’ @? Geet). (9.4.15) 


—oco 


Upon comparing (9.4.14) with (9.4.15), this gives 


G(xts2't) =exp ™ [le —m) 9 ~~) ef] (ow far Le(r)) 


x G° (2 —n, th 2! — 7,0’) (9.4.16) 
where 


G° («a —7,t.2' —1,t') 


(9.4.17) 
With the boundary conditions in (9.4.3), we obtain 
1/2 7 t 
Geert) = eae exp * fark (7) (9.4.18) 
; 2rih (t — t’) Ade ee ic 


On the other hand the explicit expression for 7 (7), in terms of F'(7), is 
from (9.4.2), (9.4.3) given by 


nieve far EOD OH fay KOO lt) ary 


m (t — t’) (t — t’) (t—t’) 
where ; 
K (rT) =' dr’ F (r’) (9.4.20) 
suppressing the t dependence for simplicity of the notation, and note that 
K(t) =0. 


Upon substitution of (9.4.19), (9.4.20) in (9.4.18), this leads to 


G (wt;2't') = ( = ) "exp 4 {ee 52) far en 


2mihT h| 2 T T 
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— ([ K(@) - x fw (K (r))? +vK} 
(9.4.21) 


where T=t—t’. 
As another application of a time-dependent forced dynamics, we consider 
the interaction described by the Schrédinger equation 


Ot. oe _. A gis 
ins + 5m daz 2 mux +a0F (| w(a,t) =0. (9.4.22) 


The corresponding classical solution 7 (t) satisfies the equation of motion 
m(ij+wn) =F (9.4.23) 

with boundary conditions taken to be 
VQ) 2 MO) =e (9.4.24) 


In terms of the quantum deviation z = x — 7, from the classical solution, 
and in terms of the function ® defined through 


i = exp (= i) D (9.4.25) 


(see (9.4.7)), (9.4.22) becomes 


) RPO ner mi mwn? 
ih—4 F)|@= 4.2 
At | Ima 2 ( 2 el )| B.-A) 
where ah a 
eS age 7, (9.4.27) 
2 2 
is the classical Lagrangian. 
Upon setting 
+ nt 
b= lew; | dr Le (7| XY (9.4.28) 
0 


the equation corresponding to the one in (9.4.11) becomes 


f O-, <a oe oN 


ihe, + Iman 2 ma? x = 0. (9.4.29) 


The same reasoning as given in deriving (9.4.12)—(9.4.17) shows that 


G (at; x't’) 


im . f i oft mw 1/2 
=D [(x — 1) 9 - (2’ — 1) H' (ow; | dr Le (7) ==) 
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imw AAD Gil thin nie 2 (a — 7) (2' = 77’) 
x EXP Se {[ ny) + (a — 9)" | cotwT mut 
(9.4.30) 


where we have used (9.2.23) for the harmonic oscillator Green function with 
variables z, z’, and we recall that 7’ = y(t’), 7’ =7(t), (t-) =T. 

With the boundary conditions in (9.4.24), the expression in (9.4.30) be- 
comes 


1/2 it 
gi a( = = | Le (r). AB31 
eee) Career aaa 7 eee) eer 


Upon solving the classical system in (9.4.23), (9.4.24), as done for the 
linear potential, and substituting in (9.4.31) gives after some labor 


21 a2 
G (at; 2't') = (= )"" exp ( — eae) cos wT 


OrihsinwT ‘hsinwT 2 
1 t 
— axa! + — | dr F(r)(xsinw(7 —t') +2’ sinw (t — 7)] 
Mw Jer 


1 
mw? 


i dr F(r)sinw (t— 7) [ dr’ sinw (7’ — t') F(r’) \) . (9.4.32) 


The factor exp (i fii dr Le(r) /n) in (9.4.18), (9.4.31), with the clas- 


sical action is a general property of quadratic interactions such as: 
[mw? (t) 22/2 — xF (t)] (see Problem 9.23). 


9.5 The Law of Reflection and Reconciliation with the 
Classical Law 


In this section, we use the method of Green functions to carry out a sim- 
plified, but illuminating, analytical treatment of the reflection? of a particle 
off a reflecting (infinite) plane surface which is taken to be the z = 0 plane. To 
this end, we first determine the Green function in half-space z > 0, denoted 
by GS, (xt;x’t’), with boundary conditions 

Ghee) Mtoe 0 GS. (xt;x’t’) ep 0 (9.5.1) 
where we have used the notation x = (x, y, z), and for z < 0, 2’ < 0, a particle 


is assumed not to be able to penetrate. 


* The law of reflection has received some special, rather non-technical, but never- 
theless fascinating treatment by Feynman (1985). Here we are considering mas- 
sive non-relativistic particles. 
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Since the physics of the problem occurs in the region —co < x < ow, 
-—co < y < w, 0 < z < ~w, we may develop Fourier transforms in the first 
two variables and a Fourier-sine transform in the last variable to satisfy the 
boundary conditions in (9.5.1). That is, we use a representation for the delta 
distribution given by® 


d?p ree ‘ . gz... gz! 
6° (x, x’) = io (anh? ip dqexp [ip + (r —r’) /h] sin =; sin (9.5.2) 


where x = (r,z), with r, r’ parallel to the (x,y)-plane, to obtain for G4. 
(see (9.1.12)) 


dp /f% 2 
Go, (xt;x’t! =in [ <2, / —d 
45 ) Re (27h)? P th q 


dp? eiP(r-r')/Pe-iw'T/A gz ga! 
ie onh] aes as sin = sin. (9.5.3) 
p—\|o=—+2>— — le 
2m 222m 


where T = t — t’ > 0. The latter may be integrated over p®, to yield 


2 ioe) 
GY, (xtixt) =f RS fF ag etree eri(vve")r/amn 
0 


2 (2nh)* mh 
/ 
x sin = sin — (9.5.4) 


leading finally to the closed form expression 


3/2 im|r — x’ |? 
Cxie eure aa =( ie ) sa 
+> (628) = (op nT 
im(z— 2’)? im(z+ 2’) 
9.5.5 
Ne ORT EP ORT Bo) 


The Green function Gos satisfies the completeness relation 
co 
i ar" | dG. Octee HG ar et = Co Octet) 19.5.6) 
R2 0 


where t >t” >’. 

As a simplified description, and as a working hypothesis, a particle which 
reaches within the interval 0 < z < 6, for some given small 6 > 0, providing 
loosely speaking a “skin depth” for the reflecting body, above the z = 0 plane, 
is considered to have reached the reflecting body. 


° We follow the treatment given in: Manoukian (1987c). 
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Suppose that a particle initially emitted at time t’ = 0 from a point Q at 
a height z’ >> 6, above the z = 0 plane, reaches the reflecting body (location 
within unknown), at some time, say, T’. Given that this has occurred with 
probability one, we determine partial contributions to the full conditional 
amplitude of finding the reflected particle at any given z at time t. 

To obtain the conditional amplitude of detecting a particle at some height 
z, given that the particle has reached the reflecting body, we consider, in 
the process, a Gaussian region, along the z-axis, about the point z = 6/2, 
with standard deviation o and integrate, for simplicity, symmetrically for 
the amplitude along the z-axis. We thus obtain for the latter amplitude the 
expression (T = t —T"), 


Gar ) i? Cae - Is (9.5.7) 


where, using the z-dependent part of the Green function in (9.5.5), 


— i‘ dZ im(z—Z)* im(z+Z)* 
OF” wae hee ORE xP ORT 
im(Z — 2)” im(Z + 2’) (Z=§/2)" 
x [es nT exp ORT? exp a> (9.5.8) 


Here we have extended the Z-integration beyond the region 0 < Z < 6, which 
is justified provided the integral 


1 0 (Fix) 2° i 9 (Gag / oy 
dZe = Z = 
Qo he a 20? TO / Sos 20? 


1 co 
ney: 5/20 


is small, where the first equality follows from symmetry. An upper bound to 
the integral in (9.5.9) is given by (see Problem 9.11) 


foe) 2 

=/ dZ exp —Z?/2 < 2 BSC /2  ehyoya: (O10) 

Var Ja 1 a 

For 6/20 = 10, for example, the upper bound in (9.5.10) is bounded above 

by 16 x 10774 in comparison to one for the normalized Gaussian distribution. 
The integral in (9.5.8) may be explicitly carried out by using the integral 

of three Gaussian functions 


i- exp — (z, — z)? /202 exp— (22 — z)* /202 exp—(z3 — z)” /203 


dZexp—Z?/2 (9.5.9) 


dz 
S68 V2T01 V 2102 V 2703 
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1 B 
= ex 9.5.11 
aor |-a| (9.5.11) 
where 
A= ot03 + 0303 + of07 (9.5.12) 
B = 03 (2 — 2)? +02 (2 — 23)" +0? (22 — 23)”. (9.5.13) 


Hence upon normalizing the amplitude in (9.5.7), we obtain for the con- 
ditional probability density of detecting the particle at z given that it had 
reached the reflecting body, and was emitted initially from a point of z- 
coordinate 2’, the expression (2z’, z > 6) 


4 , 
aT" o? oa? 2712 QTT' 07 zz! 2(T+T')o Be 
Va c exP ( roms cosh G cos KC] 


Ps o2z!2T2 (T+T’)20%2!2 
[exp ( C ) exp ( TRC [m2 )| 
(9.5.14) 
where 
4 "2 Qrpl2 h? 


The probability density is non-zero for almost all finite z. However for a 
large (classical) mass such that 


—o >h (9.5.16) 


with an order of magnitude reference set up by h, and a macroscopic limit 


y 


Tar min (z, 2’), (9.5.17) 


0o< 


P3 simplifies to 


(9.5.18) 


e 1 j pe Rh 
a pre THE P| Bary CP 


and is easily verified to be normalized over z. The density is highly peaked at 
the classical value for z, for small o? satisfying (9.5.17), but with m sufficiently 
large so that (9.5.16) is also satisfied. 

Now given that a particle had reached the reflecting body and is detected 
at a point (2, y,z), we determine partial contributions to the full conditional 
amplitude that the particle’s detection point has (a, y)-coordinates x and y, 
which was initially emitted from the point (2’, y’, 2’). 

Given that the above experiment has been realized, a partial contribution 
to the full conditional amplitude will be obtained as coming from an inte- 
gration over a Gaussian region in the (x,y) plane with standard deviations 
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01, 02, respectively, about an arbitrary point (%,%). In this case, using the 
x, y dependent part of (9.5.5), and the product formula for three Gaussian 
functions in (9.5.11), one easily obtains for the partial contribution to the 
conditional amplitude the expression 


'T T! 2 
emp | Tey (Ge -1) (9.5.19) 
where 
ih = TTR? 
Bo [o3 (T+) — - — | (9.5.20) 
4 2 Qs h? 


and a, in the phase factor exp (ia) in (9.5.19), is given by 


1 (a/'T+eT’ _\? 1 (y'T+yT'  _\? 
Ci ( TT! *) G7, ( PEL i) 
(9.5.22) 
The partial amplitude (9.5.19) is properly normalized. We obtain the full 
amplitude by multiplying (9.5.19) by 270102 and taking the limits 0; > oc, 
02 — ©, giving unity, as expected, since one is then covering the entire 
reflecting surface. We note that since the partial amplitudes in (9.5.19) do 
not vanish for different pairs (Z,%), the reflections may occur from almost 


anywhere on the surface. 
Again for a large (classical) mass m such that 


h 
a= —(T+T')TT’ 
2m 


T+T' 
oA oF >h (9.5.23) 


with an order of magnitude reference set up by A, the partial amplitudes 
(9.5.19) become arbitrary small for all Z, g, that is, as we move all around 
the reflecting surface, and for sufficiently small o;, unless 


wT +aT’ — yT+yT' 
pap 8 pp 


(9.5.24) 


are arbitrarily small. On the other hand, with the condition (9.5.16) satisfied 
with (9.5.17) not violated, the probability density P3 in (9.5.18) becomes 
small unless 


T 
z- 2 (9.5.25) 
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is arbitrary small. The conditions (9.5.16), (9.5.23) may be satisfied for a 
classical particle, for large m. For a sufficiently large m, which will be the 
case for a classical particle as just mentioned, the standard deviations 0, 01, 72 
may be taken to be quite small, giving a point-like impression for the reflection 
region about (Z%,Y), without violating the bounds in (9.5.16), (9.5.23), thus 
recovering the law of reflection, corresponding to the expressions in (9.5.24), 
(9.5.25) being arbitrarily small. 


9.6 Two-Dimensional Green Function in Polar 
Coordinates: Application to the Aharonov-Bohm Effect 


The two dimensional retarded free Green function is from (9.1.38) given 
by 
: 12 
m im |x — x’| 
Imiht ©? QKT 
where T = t —¢’ > 0. Working in polar coordinates, x = r (cos ¢, sind), we 
may rewrite the latter as 


GS (xt;x’t’) = (9.6.1) 


7 2 12 / / 
0 ete Tce im (r +r ) mrr’ cos (¢ — ¢’) 
Gs (x5 XE) = 5 ae oo ( DAT ie iar 
(9.6.2) 
We use the generating function of the modified Bessel function® Ij, (z) 


exp 5 (s+ :) = S- (s)* Ip (2) (9.6.3) 


k=—0o 


where 


Tx (2) =In (2) (9.6.4) 


for integer k values. 
The modified Bessel functions I, (z), with v not necessarily an integer, 
satisfy the differential equation 


ea + 2e = (274 ) L(g) =. (9.6.5) 


Upon choosing s = expi(¢— ¢’), we may rewrite the Green function in 
(9.6.2) as 


GU Gtx’) = ye ek(o-8') prt (rt; rt’) (9.6.6) 


k=—0oo 
where (T > 0) 


° The classic reference on Bessel functions is: Watson (1966). 
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m im (r?2 +r? . 
FL GET t= sanT OP ( ae ) L\x\ (—ip) (9.6.7) 
mrr’ 
paar (9.6.8) 


The expansion (9.6.6) is just a Fourier series expansion in complex form, 


where | 
oo Uik(¢—9" 
6(¢-¢)= > —— (9.6.9) 


k=—0o 


for the angular part of the Dirac delta distribution. 
We may use the Poisson sum formula (see Problem 9.13), 


Ss f(k) = s i dae SF 0h) (9.6.10) 


k=—0o k=—0co 


for a given function f (A), to rewrite (9.6.6) as 


GS (xt;x’t’) = Da HE (xt x'#’) (9.6.11) 
k=—0oo 
where 
Hp (xt;x’t') = / dX ene any Tr (ré;r't’) (9.6.12) 
and for T > 0, we have from (9.6.7) 
: 21 pl 
oe yy ™m im (r +r ) : 
Fo) (rt rt’) = amanr ex? ( AT Z| (—ip) (9.6.13) 


providing an alternative representation of the two dimensional Green function 
G° in polar coordinates. Note that exp (ik (¢ — ¢’)) Foy, and HP are not the 
same functions (see (9.6.10)). 

From (9.6.11), (9.6.12), we may infer that the amplitude for a particle to 
go from (r’, ¢’) to (r, 6) may be written as a sum of amplitudes, each specified 
by an integer k. For k 4 0, k # —1, an amplitude in question corresponds 
to one to go from (r’,¢’) to (r,) by winding around the origin exactly |k| 
times on the way to the point (r,¢). The sense of a rotation is specified by 
the sign of k with a c.c.w. one for k > 1 and ac.w., one for k < —2. Fork = 0, 
k = —1, we have a c.c.w., c.w. rotations, respectively, not making full circles 
around the origin. A given integer is appropriately referred to as a winding 
number. 

The differential equation satisfied by G% (xt;x’t’) including the factor 
0 (t — 1’), in polar coordinates is given by 
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0, (10 0,12 ak 
ling (+7, "Or | 7 a) | vere 
=ih oar) 5(@-¢')6(t-t), (9.6.14) 


where we have used the definition 


6&2 (x —x’) = eee) 5(¢-#), (9.6.15) 


r 


implies from (9.6.6) and (9.6.9), (9.6.13) that 


AiO (gee lees Sa 6 aa 4 ih O(r—r’) ; 
ins "2m €E Or’ ar =) | Hg, ee) 20 r oes) 
(9.6.16) 


We now apply the polar coordinate decompositions carried out above to 
study the celebrated Aharonov-Bohm effect.’ 

We consider an “infinitely” long, current carrying, solenoid of circular cross 
section of arbitrary small radius ro — 0, placed all along the z-axis, and, in 
polar coordinates, we take for the vector potential outside the solenoid (see 
Problem 9.14) the expression 


® . 
=,6 (9.6.17) 


where o is a unit vector in the direction of increasing of the angle ¢; ® is 
the flux 


= fs . ds (9.6.18) 


and B is the magnetic field inside the solenoid, with the surface integral 
carried over the cross sectional area of the solenoid which survives for rp — 0. 
The magnetic field outside the solenoid is zero, and one explicitly checks from 
(9.6.17) that V x A = 0 there. 

Although the magnetic field is zero outside the solenoid, the line integral 
over a closed path, through which passes the solenoid, is not zero, and is 
given by 


fa -dx = 6 (9.6.19) 


with dx = @rdd¢+#dr, or as obtained directly from (9.6.18) by the applica- 
tion of Stokes’s theorem. Because of this non-vanishing flux, the latter may 
have an observable effect, outside the solenoid, even though the magnetic 
field is zero there. This is the Aharonov-Bohm effect. 


” This was also studied in the path integral formalism in Gerry and Singh (1983); 
Schulman (1971). 
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To study this effect, we consider the Green function of a charged particle 
of charge q in the presence of the vector potential A. It satisfies the two 
dimensional differential equation 


qd 2 
9 (p- 4A) Boe Noe, lees 
ihn mn G (xt; x’t’) = ih 0° (x — x’) d(r—-1’) (9.6.20) 


where c is the speed of light. 
For the vector potential in (9.6.17), and in polar coordinates, (9.6.20) 


reads 
7a aie | GC Se Sa 
OE Im ror! Or) | 72 Od ee 


G, (xt; x’t’) 


= ih oer) 5(@-¢')6(t-t/) (9.6.21) 
where ‘ 
3s 
= (9.6.22) 


expressed in terms of the flux ©®. 
To solve for Gi, we carry out a Fourier series as in (9.6.6): 


Gy (xt;x’t’) = S- elk(¢-0') pt (rt; r't’) (9.6.23) 


k=—0o 


to obtain for F,* (rt;r’t’) the differential equation 


,0@ Wild a 1 ee 
(35; "2m Ee "Or 7? (& — Ao) |) (rt;r't’) 
ih 6(r—r’) 
on = Pe) (9.6.24) 


Upon making the ansatz 


: 24 ,l2 
Fe (rir) = sar exp aoe FY (k.2) (9.6.25) 


where z = —ip (see (9.6.8), (9.6.7)), we obtain from (9.6.24) the following 
two equations (see Problem 9.15): 


. m im(r? +r? ih o(r—r’ 
ih 6(T) ae oo es 7 Ft (k, z) = —6(T) ( : ) (9.6.26) 
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and for T > 0 we have 


G Dee [2 + (k 20)"]) Ft (k,z) =0. (9.6.27) 
dz? dz 

Equation (9.6.26) provides a boundary condition constraint as we will 
see below, where we recall that z depends on T, while (9.6.27) provides the 
solution F'* (k, z). 

The solutions which satisfy (9.6.27) are Ij,) (z), I_-\,\ (z), jv) (z), where 
v = (k— Ao). Kj, (2) diverges for z — 0, ie., for rr’/hT — 0 and hence at 
the origin. For vy = n, an integer, I_),; (z) = Ij, (z) (see (9.6.4)). For v not 
an integer I_},) (z) also diverges for z — 0. Hence we are left with Jj,) (2). 
Therefore it remains to verify that the asymptotic behavior of Jj, (z) for 
T — 0 is consistent with (9.6.26). 

With z = -ip = —imrr'/hT, r > 0, r’ > 0, and hence p is real and 
positive, the asymptotic behavior of Jj, (z) for T — 0, is given by® 


° i i 1 i(p—|v |r 
Iwi (~ip) A oa e os eles ] (9.6.28) 
: 2 —ilyv|ar 1 T 
I, (~ip) = le. eil¥I"/2 cos (o- (+5) 5): (9.6.29) 


Multiplying the expression on the right-hand side of (9.6.28) by 


m im tr + | 
z exp: | -——_ 
2nihT 2QhT 


for T — 0, we obtain for the resulting expression the behavior 


1 1 
an rr’ 


which is consistent with (9.6.26) since for r > 0, r’ > 0, the second term in 
(9.6.30) is zero. For p > ~, ie., p large, but T # 0, both terms in (9.6.28) 
contribute since the factor (m/2mihT) in (9.6.25) would not be relevant in 
this case. 

For T > 0, the Green function in (9.6.23) may be then rewritten as 


c (r—r’) + 5 (r+r’) ane (9.6.30) 


Figs 00 
m im (r* +97 ik(d—d! ; 
Gy (xt;x't’) = SanT oP (eo) 5 SRURe )Ne-ro| (—ip). 


k=—oco 


(9.6.31) 


* See Watson (1966), p. 203; Gradshteyn and Ryzhik (1965), p. 962. Ij.) (—ip) is 
related to J\,| (g) with the latter having the familiar cosine behavior, divided by 
/xp/2, for p — oo (op. cit. p. 203, p. 952, respectively) (see Problem 9.16). 
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From the definition of Xo in (9.6.22) we note that if 


go; 
Qnhe 


(9.6.32) 


is an integer, i.e., flux is quantized, then by making a change of the sum- 
mation variable k > k+ N, G, (xt;x’t’), up to the overall phase factor 
expiN (¢ — ¢’), coincides with the free propagator in (9.6.6). Interesting sit- 
uations arise when the flux is not quantized, that is when Ag is not an integer. 
In such cases, we may write 


do = N +60 (9.6.33) 


where N is some integer, 


0<d9<1 (9.6.34) 
and (9.6.31) takes the form 


lg hycs oe im (r? +r?) iN(¢-¢') 
GE REE 5 age oP ( QAT 7 


x yo e*(#-#') F454) (—ip) (9.6.35) 


k=—0o 


where now |k — do| > 0 for all k and I\,_5,| (—i) vanishes for p > 0). That is, 
each term in the summand in (9.6.35) has the built in vanishing property for 
r — O and/or r’ — 0, i-e., at the origin, thus making the solenoid a forbidden 
region, for the case in (9.6.33), (9.6.34), for each winding number. 

By using the equality in (9.6.10) and making a change of the continuous 
variable A > + Ao, (9.6.31) may be rewritten in the form 


im (r? +r’? ; , 
Gy (xt; x't’) => ue exp (“Ge eiro(o-¢ ) 


2nihT 2AT 


x Se (civ) i 7 diexpir(¢— ¢ + 27k) I) (-ip). (9.6.36) 
k=—0o seed 


This expression is to be compared with the free one in (9.6.11)—(9.6.13). 
The term exp irg (¢ — ¢’) is nothing but the exponential of ig/fc times the 
integral 


(7,¢) 

/ A-dx = ©(¢-— ¢’) /2n (9.6.37) 
(r',’) 

obtained by integrating along a direct line segment joining the end points. 

This term is also obtained by a naive examination of (9.6.21). The first factor 

in the summand in (9.6.36) is the k*” power of the exponential of (ig/fc) 
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times the flux ©. That is, the amplitude for the particle to arrive at (r, ¢) is, 
up to the phase factor exp iro (¢ — ¢’), the sum of amplitudes of arrival at 
(r,d) after an arbitrary number of rotations around the solenoid acquiring 
additional phase factors (exp ig®/ he)* with the sense of a rotation specified by 
the sign of k. For a quantized flux defined in (9.6.32), these phases disappear. 
On the other hand for a non-quantized flux, as given in (9.6.33), (9.6.34), the 
mere presence of such, in general, different phases may have an observable 
effect even though the magnetic field is zero outside the solenoid! This is the 
celebrated Aharonov-Bohm effect. The importance of the phase factor 


exp (Zpa ; ax) = exp (40) (9.6.38) 


cannot be overemphasized. Such a phase factor is usually referred to as a 
Wu-Yang? phase. 

Now we come back to our expression in (9.6.35). In practice, one is in- 
terested, in general, to the limit mrr’/AT — co involving large distances 
r — oo and/or r’ — oo, with T 4 0. In such cases we may use the asymp- 
totic behavior given in (9.6.28) to write formally for p > co, with T #0 in 
(9.6.35), 


2Tp  in(o—o! . 
Va OH) Tg tq) (He) 


co co 
S33 3 cik(o-o’) 4} ip ‘ eik($-9") @ilk—dolm 
1 


k=—0o k=—oo 


; We a Niok 
— 97-0 _ ah). = ele ik(¢—4") ,-i]k—So|m 
2re 3 (o—¢') + 7 e ) e e (9.6.39) 


k=—oco 


where p — co means p large. Here we note that since we are considering the 
case T # 0, the second term on the right-hand side of (9.6.39) does not give 
rise to a 6(r +r’) term to the asymptotic behavior of (9.6.35) and gives a 
non-zero contribution. 

As an application of (9.6.35), with the asymptotic expression in (9.6.39), 
we consider the case in (9.6.33), (9.6.34) with N = 0,0 < 69 < 1. [The 
situation with —1 < 69 < 0 may be handled similarly. | 

To the above end, suppose, for the purpose of a simple but concrete illus- 
tration, one has an initial state wo (x’) which is uniform in ¢/:-A < ¢’ <A 
for some small angle A, and is zero otherwise, with |x’| = R a fixed large 
radial distance from the solenoid with the latter located at the origin 0 (see 
Figure 9.3). This will allow us to sum the series on the extreme right-hand 
side of (9.6.39) explicitly. That is, equivalently one has an extended uniform 


° Wu and Yang (1975). 
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source, rather than a point-like one, which initially emits particles. The source 
is situated at a fixed large radial distance from the origin and is described by 
an arc subtended by a small angle 2A symmetrically about the angle zero, 
say, with 0< A « 7/4.1° 

To determine the probability of particle detection, for T 4 0, we must then 
integrate, in the process, over ¢’, on the extreme right-hand side of (9.6.39), 
from —A to A. That is, one is summing over all initial configurations and 
hence equivalently adding up all amplitudes from every point on the source 
and ending up at the observation point in question for some T > 0. 

Thus the amplitude for a particle to be found at an angle ¢, for |x| = 
|x’| — oo, is up to an unimportant multiplicative factor independent of do, 
with @ not in the range (—A, A), is given from (9.6.39) to be 


"| 


Co 


- ind SKA ie sole 
A= Je ae 0 (9.6.40) 


k=—0co 


We note that for ¢ not in (—A, A), the term in (9.6.39) involving 6 (¢ — ¢’) 


SOURCE 


Fig. 9.3. The solenoid is placed at the origin 0, and the extended source is 
described by an arc subtended by a very small angle 2A symmetrically about 
the angle zero. The detector and the source are at large radial distances 
|x| = |x| = R — o0 from the origin. 


does not contribute. Also note that sinkA/kA — 1 for k > 0. 
Since 0 < do < 1, (9.6.40) may be rewritten as 


Re sinkA ag sinkA 
A= —idom —ik(¢—7) sin idot ik(@—-7) 6.41 
e 28 a +e me hee (9.6.41) 
where note that e~!** = ei*7, 
To verify the Aharonov-Bohm effect, it is sufficient to look at the point 
@=7 in (9.6.41) (see Figure 9.3.). To this end we use the sum™ 


10 Tt is worth noting that a non-zero integer part N of Aj may contribute in specific 
situations. 

' For the summation of such trigonometric functions see: Gradshteyn and Ryzhik 
(1965), pp. 38-41. 
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TA ok (9.6.42) 
k=1 
to obtain from (9.6.41), at ¢= 7, 
2 
|A? = ($) cos? (5g) + sin? (5pm) (9.6.43) 


giving the relative intensity, in the presence of the solenoid with 0 < 69 < 1, 
the result 


g 
cos? (dom) + (=) sin? (dom) <1 (9.6.44) 


where A/7z is some small number. The mere fact that the intensity is altered 
(reduced) is a statement of the existence of this effect. 
It is remarkable that the expression for the amplitude A in (9.6.41) may 
be summed exactly. To this end it is sufficient to consider 7/2 < ¢ < 37/2. 
We use the identities 


m/A, m-A<@<7+A 


loc) . kA 
1420s cosk(¢—m) = 
k=1 0, m/2<¢<n7-A,t7+A<¢< 37/2 


(9.6.45) 
|. r/2<$<3n/2 


“sinkA . re 1+ cos(¢@+A) 
a A snk (O n) = zen | eee 


k=1 
(9.6.46) 
being, respectively, even and odd functions of (¢ — 7), to obtain from (9.6.41): 


aan eee _ Sin (d07) 1+cos(¢@—A) ‘i oie 
|A|" = (F cos (dom) 4 5A In E = a es)) + sin” (do7) 
(9.6.47) 


for 7 -A << a+A, which reduces to (9.6.43) for ¢ = 7, and 
1 1+cos(¢— A) : 

1a (mE) oa 
for 7/2 << 6 < wm-A,7+A < ¢ < 37/2. It is interesting to see that 
(9.6.48) gives a non-vanishing contribution, in the regions thus defined (see 
Figure 9.3), due to the presence of the solenoid with 0 < do < 1. 

The densities in (9.6.47), (9.6.48) are plotted in Figure 9.4 for 6) = +1/4, 
A = 7/100. With a uniform initial wavefunction with sharp cut-offs at ¢/ = 
+A adopted, these densities become arbitrarily large in the limits @ — 7 > 
+A, in their respective intervals, which is the price one pays with an idealistic 
initial wavefunction (see also Problem 9.18). In the region 7—A < 6 < 7+A, 


just around the point ¢ = 7 (see also (9.6.44)), the density in question is 
reduced from one over the solenoid-free case (equivalently for 69 — 0), while 


|A|? = sin? (6p77) 
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Fig. 9.4. Plots of the densities |A|? in (9.6.47), (9.6.48) for 69 = +1/4, 
A = 7/100 in the regions r-A < @< 7+A, 7/2 < 6 < T-A, 
m+A < ¢ < 37/2, respectively, in the graphs (a), (b), (c), with ¢ expressed 
in radians. 


outside this interval the intensity, as mentioned above, is non-zero but rapidly 
goes to zero to the right and left of +A, a—A, respectively. The Aharonov- 
Bohm effect (with d9 4 0) is clearly seen. A similar analysis may be carried 
out for -1 <6 <0. 

We note that the illustration of the Aharonov-Bohm effect given above is 
based on the scattering of the charged particle off the origin due to the non- 
zero flux only, i.e., is purely of electromagnetic origin in a quantum setting. In 
Problem 9.17, the reader is asked to extend this analysis, to the more difficult 
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case, when a particle may be scattered off the origin (by the solenoid), even 
for d9 — 0, by the application of appropriate boundary conditions at the 
origin, and study how this scattering is modified by the activation of the 
solenoid, i.e., for d9 > 0, as an illustration of the Aharonov-Bohm effect. 


9.7 General Properties of the Full Green Functions and 
Applications 


9.7.1 A Matrix Notation 
We use the following convenient matrix notations as 


(x|G(D) x’) = Ga(x T;x’0) (0.7.1) 


(x|1|x’) = 6”(x — x’) (9.7.2) 


with G(T) satisfying the differential equation 


lin - n| Gi(T) = +ih1 0(T). (9.7.3) 
OT 

As before, the Green functions G+ considered are those with built in bound- 

ary conditions as specified by the +ie prescription adopted and spelled out 

below. This equation may be integrated directly but it is more instructive to 

proceed as follows. We introduce the Fourier transform 


Sass ii dp? —ip°T/h ) 
G(T) = +ih if ake Gs (p") (9.7.4) 
where 
[p° — H] Ga(p°) =1. (9.7.5) 
Upon introducing the resolution of the identity (see (1.8.18), (1.8.14)) 
= if dd 6(\— H) (9.7.6) 
we have from (9.7.5) 
fo a 
Gi (p°) = ———— d\ d(A- H 9.7.7 
0= f GayewOIA-H. 77) 
po f ddd(A—- H) ig 
ee —ip’T/h 
Gs(T) = +ih ie anh | (hae ye (9.7.8) 
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It is easily shown that the denominator in (9.7.8) (see Problem 9.20) may 
be written as 


1 22 oo aeons 
(po =e) | da exp [+i(p" — A + ie)a] (9.7.9) 
from which 
Gi(T es n | dX 5( A Hf) ) fda exp | ia(A ic)] 
Anes 4 


f ap? eit at 
x i anh ox? [tip?(ha = T) /h| 


co eo 1 
2 / Mohan SOSH | das(a + z) _ (9.7.10) 
0 
Since a is positive, G(T) is zero for T < 0, and G_(T) is zero for T > 0. 
That is, upon integration over a, (9.7.10) is equivalent to 


Gs(T) = (47) dd 5(A — H) exp [-iT (A ¥ ie) /f] (9.7.11) 
G(T) = O(£T) exp [-iTH/hi] exp (—e|TI). (9.7.12) 


For ¢ — +0, it is easily verified that G(T) satisfy (9.7.3). 
We may rewrite (9.7.11) as 


G(T) = O(£T) G(T) (9.7.13) 

where 2 
G(T) = / dA 6(\ — H) exp(-iT/h) (9.7.14) 

for ¢ — +0. . 

From (9.7.1), we also have 
Gs(x T;x’ 0) = O(+T) G(x T;x’ 0) (9.7.15) 

and as in (9.5), 

G(x T;x’ 0) = / dA (x|6(A — H)|x’) exp(—iTA/h). (9.7.16) 


Complex conjugation gives the relation 


(G(x T;x’ 0))” = G(x’, -T; x, 0). (9.7.17) 
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9.7.2 Applications 


Let 
T/h=T (9.7.18) 


in (9.7.16) and consider the integral 


1 fd | 
Txsé) = f - G(x fir;x’ O)el®™, =e — +0 (9.7.19) 


271 T— Ile 
=O. 


for real €. 
From (9.7.16), the latter is given by 


I(x, x36) = / dd (x |5(A — Hx’) = / dT) (9.7.20) 
271i T— le 
For \ > €, we may close the contour of the integral in the complex 7-plane 
from below and obtain zero from the residue theorem since ¢ — +0. On the 
other hand, for \ < €, we may close the 7-contour from above to obtain, from 
the residue theorem, that the 7-integral multiplied by (1/277), gives one. The 
latter just provides the definition of the step function O(€ — A). That is, 


Co 


Cee / dd (x|6(A — Hx’) O(E — d) 


—Co 


é 
= J aaca - ax) 


—co 


= (x|Pu(O)|x) (9.7.21) 


where in writing the last equality we have used (1.8.14). 

Suppose that for A < €, we are dealing with the discrete spectrum of 
the Hamiltonian in question, and € is not in its spectrum, then from (4.5.3), 
(9.7.21) for x = x’, we have the useful relationship 


1 dr ier 2 
Ori ie G(x, hr; x, Oe 6 = oS [ervey (x)| O(€ a r) 
205 A,v(A) 
= n(x; €) (9.7.22) 


defining a density of states, where 


Pr,veay x) = x|A,4(A)) (9.7.23) 
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are the eigenstates in the x-description, and v(A), not to confuse it with the 
dimensionality of space v, specifies the degenerate states corresponding to 
the eigenvalue 4. 

Upon integrating (9.7.21) over x, for € not in the spectrum of H, we 
obtain 


1 , dr . 
i eae : ifr = 
Jc x5 f Gx, hrix, Oe 2 a 6) 


—oo 


N(H; 8). (9.7.24) 


That is, the operation defined on the left-hand side of (9.7.24) gives the 
number N(H,€) of eigenvalues of H, taking into account degeneracy, less 
than €. The importance of (9.7.24) cannot be overemphasized. 

To find the number of eigenvalues, taking into account degeneracy, falling 
between two points £1, £2, £2 > £1, with these two points not in the spectrum 
of H, may be obtained from (9.7.24) to be given by 


1 , dt : . 
Vv : i€2T _ ,i€iT 
fa x 55 / ome G(x, hr; x, 0) (e emit) (9.7.25) 


From (4.5.6), (9.7.19), (9.7.21), 


1 7, dr ier 
an 7 eo hr; x',O0)es7 = = Sc Wr,vy (x Wray x')O(E — A) 
268) A,v(A) 
= n(x, x’;€) (9.7.26) 


defines a non-local density of states. This expression will be used in Chap- 
ter 13 dealing with multi-electron atoms. The following property obtained by 
complex conjugation is to be noted 


(n(x, x’; £))* = n(x’, x; €) (9.7.27) 


as obtained, for example, from the expression on the left-hand side of (9.7.26) 
and (9.7.17). 
From (9.7.16) 


) Yeax ft 
and with 
Pp 29 


we have from (9.7.16) 
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Ee - v(x)| G(x, hr;x’,0) = / dd [A — V(x)] (x |5(A — H)|x’) e174 


—a 


Co 


= fo (x |(H — V(x))6(A — H)|x’) e7'™., 


(9.7.30) 


From (1.8.14), (4.5.14), (4.5.15), we may use (9.7.30) to infer that 


1 7 dr, 0 
Vv i€r |; a ltggl 
fa x 55 ‘ = ie is. v| G(x, hr; x’, 0) 


x/=x 


= S- (r v(A) Ea A, v(2)) @(€ — A). (9.7.31) 


A,v(A) 


This expression will be also useful in studying multi-electron atoms in §13.1 
(see also (4.5.15)). 
A similar expression to (9.7.31) as follows from (9.7.28) is 


[ex = | ae i F-Gte trix, 0) = S> rA@(€-A) (9.7.32) 


Ti Tle 
_ A,v(A) 


giving the sum of eigenvalues of H less than €. 

Before considering specific cases, we note that, if Ao is an eigenvalue of 
AT and 9,1, not in its spectrum, are such that €; < Ap < €2 and that there 
are no other eigenvalues between £; and & then (9.7.25) gives the degree of 
degeneracy of Ao. In detail, from (9.7.24), (9.7.25), this may be written as 


271 Tle 
=O. 


ir ed 
Jos = 7 G(x, hr; x, 0) (e827 — 7) = SO. (9.7.33) 
(Ao) 


An elementary, though rather formal application of (9.7.22), since the 
latter was written for a discrete spectrum, is to the free electron gas. In 3D 
we have 


: 0 p? / : 3 / 
Go,0'(X, At; x’, 0) = idg,0°5(7)6°(x — x’) (9.7.34) 


where o,0’ are spin indices. From (9.7.15), (9.7.34), we have 


based = d°p ip:(x—x’)/h .—ip?r/2m 
Go,o'(X, fit; x’, 0) = O0,67 Orne nye e ; (9.7.35) 
TT 
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Upon setting 
G(x, hr; x,0) = S$” Goo (x, hr; x, 0) (9.7.36) 


we have from (9.7.22), (9.7.35) for the density of states 


d?p 1 Peo de : 2 
nes i(€—p? /2m)r 
n(x; €) / (Qah)? Qni f. a 


d°p pe 
= 2 | (anh)? € F) (9.7.37) 
leading to the familiar expression 
1 (2me\3/? 


As an illustration of the formula (9.7.25), consider the Green function in 
(9.2.23) for the one-dimensional harmonic oscillator. Then from (9.2.23) 


G(x T;x 0) = (-_) eae ( ee tan (F)) (9.7.39) 


2rihsinwT 2 
and ocivT/2 
ie dz G(a T;x 0) = [oe (9.7.40) 
Hence 


i aes | an G(x hr; x 0) (ei27 — e'617) 


ee) 271 got =e 
1 °° hdr evihwr/2 , 
_ ifoT i€1T 
Ori I. T — ie (1 — e~ihwr/2) (en? ese) (9.7.41) 
which upon using the expansion 
ev ihwr /2 


<= 7 re => e thw (n+1/2)r (9.7.42) 
e- NWT 


and the integral representation of the step function 


@(€) = af. OT _ gtr (9.7.43) 


se HIE 


we obtain for (9.7.41), the explicit expression for the number of eigenvalues 
between £; and 9: 
S- de huw(n + 1/2)) — O(€1 — h(n + 1/2))] . (9.7.44) 


n=0 


Other properties of the Green functions will be considered in other chapters 
as well. 
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9.7.3 An Integral Expression for the (Homogeneous) Green 
Function 


From (9.7.16), (2.4.1), we may write 


G(x T;x'0) = | Pai (p|x’) 4 * i GelGOe= Dip ep ATA 


Qh) 
(9.7.45) 
in v dimensions, which satisfies the homogeneous equation (9.7.28). 
Using (2.4.8) and defining 


F(p,x,T) = e1P*/? eip*T/2mh i dd (x|6(\ — H)|p)e7'7*/" (9.7.46) 


with the latter being independent of x’, we rewrite (9.7.45) as 
. = / 
G(x hr; x'0) = | om mea 


= f(x —x’,x,hr). (9.7.47) 


= p’r/2m F(p,x, fir) 


| expi 


We introduce the variables 


G=x-x’ (9.7.48) 
n=x (9.7.49) 
noting that 
) fs) 3) 

= 9.7.50 
Vi = VE4 V5 +2Ve-Vn (9.7.51) 

and rewriting (T = hr) 
G(x hr;x'0) = | om elP-¢/h—P*7/2™] (yy, ir) (9.7.52) 

d (2rh)v b ? ati 


we have from (9.7.28), (9.7.29) 


i i d’p’ cilp’-¢/h—p’?r /2m] 


Oe ee all '.V eye V(n) >} F(p',n,hr). (9.7.53) 
ae Ub p 7 Qu n N p Ns T). shes 
Upon multiplying the latter by exp(—ip- C/A) and integrating over € (or 
multiplying by exp(ip - x’/h) and integrating over x’ ), we obtain 
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gee V 4 ey _ vin) F( hr) =0 (9.7.54) 
2ae ue nN Qu n N p,n, T) = VU. ae has! 


It is more convenient to have an exponential representation 
F(p,n, hr) = exp ( _ iU(p,n, 7, h)) (9.7.55) 


and rewrite (9.7.47), (9.7.52) as 


lg . 2 
Goxhrx'0) = | BR exp if? :-F| exp ( —iU(p,n,7,h)) 


(2rh)” h Qu 
(9.7.56) 
with the initial condition 
U(p,n,7,h)|,_, =0 (9.7.57) 
so that 
G(x hr;x’0) oe 6”(x — x’). (9.7.58) 


From (9.7.54), (9.7.55), the following differential equation for U(p,n, 7, A) 
is obtained 
) hp noe h? 
—U+—-V,U -—V,U- —V,U-V,U -V| =0 9.7.59 
Or + Lb y Qu nN Qu yn y ( ) 
with U satisfying the initial condition in (9.7.57). 

In (9.7.54)—(9.7.59), we have used the parameter 7 needed in the applica- 
tions in (9.7.22), (9.7.24)—(9.7.26), (9.7.31)—(9.7.33). Some of these applica- 
tions will be considered in the next section as well as, later on, in Chapter 13 
in the study of multi-electron atoms. 


9.8 The Thomas-Fermi Approximation and Deviations 
Thereof 


In view of applications of the formulae (9.7.22), (9.7.24)—(9.7.26), (9.7.31)— 
(9.7.33), written in terms of integrals in the integration variable 7, we wish 
to investigate the expressions in (9.7.56)—(9.7.59) for G (x fit; x’0) on its de- 
pendence on fi for a given 7.1? 

To the above end, to the leading order in h, we have from (9.7.59), 


a 
5, U-V x0 (9.8.1) 


or 


12 Note that in this section, G(xhr;x’'0) is expressed in terms of the parameter of 
interest 7 and not T. 
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(see (9.7.57)), leading from (9.7.56) to 


dv . a 2 ? 
stag p p G 
G (xfir;x'0) = / nh” yp exp 1 e- a e,| exp (—iVT) 


pv fule—x? 
= (<4) exp 1 (Sect -—V (x) TE (9.8.3) 


which will be referred to as the Thomas-Fermi semi-classical approximation. 
Upon substituting (9.8.3) for x = x’ in (9.7.22), for example, we obtain 


nose) = [ P, (¢-vx)-F) (9.8.4) 


which for v = 3, gives 


= (74 = es (9.8.5) 


ne) = he 
This semi-classical solution is usually referred to as the Thomas-Fermi ap- 
proximation. 

Several applications of the expression in (9.8.3) will be given in our study 
of multi-electron atoms in Chapter 13. 

To find the deviation of G(xfr;x’0), for example, from the Thomas- 
Fermi semi-classical approximation, in view of the applications mentioned 
above, we write for U satisfying (9.7.57), to order h? and a given 7, 


U =Up +h 6U, + h75U2 (9.8.6) 
thus obtaining from (9.7.59), 


ft GE RAO (9.8.7) 
OT m 


written in terms of the variable x (see (9.8.2)). That is, 
72 
dU, = —— p- VV (x). (9.8.8) 
2h 


For dU, we get 


a eee 


2 
Or Qn” 2p? 


2 
WV + om (VV)? + iB (p-V)?V (9.8.9) 


giving 
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. 3 3 
1 2 2 oi 2 T 2 
2 ae Oss 8.1 
Wig VV eel) ee Y): (9.8.10) 
We set 
: . ; . 3 : 
A a 6a 4 7 _ Pyivky (9.8.11) 
Qu 6p? 
at 9 Sa 
Bi = ar PVviv (9.8.12) 
‘p BE 
of =— (at) (9.8.13) 
and note that 
ee eon le ae it pw (9.8.14) 
~ lou Qu 6p? Pia 
and pe 
1 2u\" Ronee 
=( ee kal 9.8.15 
Jaa eae (os 


to order h?, for a given rT. 
From (9.8.6), (9.8.2), (9.8.8), (9.8.10)-(9.8.12), we may write for 
G (x fir; x0) in (9.7.56) 


. if? 55 Te 2 
exp | —i | V (x) Ba aa Vv aa 7 (VV) 
x | Pe exp — (p' A?*p* + Bip’) (9.8.16) 
(Q7h)” 


with a summation over repeated indices understood. 
Upon changing the integration variables in (9.8.16) as 


p > p' +03 (9.8.17) 
we have 
p) Al®p* + Bip) — p) Ai*®p® + (BIC! + C1 AI*C*) (9.8.18) 
where 


BIC) + CUAIRCE = > Bici 
2 


2 2 
w LPT gy (2H gik) BT gay 
4 2p iv 
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= (VV)? (9.8.19) 


to order h? for a given rT. 
Finally we use the integral 


v/2 
d’p exp (=p! Aitp*) = 9.8.20 
: p exp (—p’ A?*p") acGa ( ) 
to obtain from (9.8.16), (9.8.18), (9.8.19), (9.8.15) 
i al Pode AP vein ge ete ET 2 
G(x hr;x 0) ~ (54-) e lav a 
(9.8.21) 
The latter may be also rewritten as 
her? ih?r 2 
hr; ete = (ey 
G (x hit; x 0) | + 1D (V°V) Dy (vv) 
x | ee exp —i [p?/2u + V (x)] r (9.8.22) 
(27h)” 


giving the following deviation from the Thomas-Fermi semi-classical approx- 
imation 


5G (xhi7;x 0) > (v7) = 2 (VV)? 
x / Son exp —i [p?/2u + V (x)| 7 (9.8.23) 


for a given T. 
The above expression will find an important application to multi-electron 
atoms when, in the process, we carry out the r-integral in (9.7.32). 


9.9 The Coulomb Green Function: The Full Spectrum 


9.9.1 An Integral Equation 


The Green functions G4 
equation 


Oho, 
iia + 7, V" - VO0)| G3 


(xt;x’'t’) for a given potential V(x) satisfies the 


(xt; x’t’) = +ih 0°? (x —x’)d(t-t') (9.9.1) 


ot 


in 3D, and their free counterparts (89.1), with corresponding boundary con- 
ditions, satisfy 
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. a he 2 0 ate A Le 3 / / 
noe y | G2 oxtsx'e) = tin (x —x’)d(t-1¢). (9.9.2) 


The solution of (9.9.1) may be written as 


Gx (xt; x't’) = GY (xt; x’t’) 


¥ dx! dt” OD (xt xt") V(x") Ce (x x't) (9.9.3) 


as is easily verified. We introduce the Fourier transforms 


G4 (xt; x’t’) _ an fs eo pee 'G + Gs (Pp p’;p’) 
x oI - rh Qrh)°® x be] d 


x eip-x/h—ip’-x’ /h,—ip®(t-t’) /h (9.9.4) 
3 3y/ 63 rr 
Go(sexf) = anf $ ee ies ieee 5 P’) 
5g = Qrh Qnh)° p? ns Pp cae 
Qu 
X @iPX/hg ip! x! /Re—ip? (tt!) /h (9.9.5) 
For the potential in question 
» 
V(x) = re r = |x| (9.9.6) 
3 ip-x/h 
a ant? - a — (9.9.7) 
Ls 1 
we then obtain 
Giggs = B(p-p) ama | dp" Gs (p", psp!) 
= ) ’ ‘ p” ne h (2n)° (p _ p”) 4 p? 
ma oat P-5 +e 
LL 2h 
(9.9.8) 
It is convenient at this stage to set 
2 
pa eo, (9.9.9) 


2p 
This procedure, as we will see later, applies to the continuous spectrum 


as well.!? For simplicity of the notation we write G4 (p,p’;p°) simply as 
G (p, p’) up to (9.9.29). 


'3 We use the Schwinger representation of the Coulomb Green function: Schwinger 
(1964). 
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We introduce the unit vectors in 4D-Euclidean space 


2 2 
Po—-P* pop ) 
T= ; 9.9.10 

(3 + p?’ p2 + p? ( ) 


w=l (9.9.11) 
with the latter equality easily verified. 
Let : ; 
- 2 
cay = PP any = CPO (9.9.12) 
Pot+Pp Pot+Pp 


Upon differentiation of cos y, as a function of the variable |p| = p, we obtain 


Apps, 


“1 — dp (9.9.13) 
(p2 + p2)” 


siny dx = 


which upon multiplying by sin y sin 6 dé d@, leads to 


2 2\ 3 
dp = (PtP \ an (9.9.14) 
2D0 


where dQ is the solid angle element in 4D (see Appendix III, (II1.8)), 
dQ = sin 6 dé désin? x dy (9.9.15) 


whereO< y <7. 
From the property of the Dirac deltas 


[eee (p—p’)=1= fan (2-0) (9.9.16) 


we may infer from (9.9.14), that 


po \* 
(p—p)= ( ) 6(Q—-’). (9.9.17) 
P+ BG 
In detail, 
6(6-O)d(d-P)d(x-x’ 
sin 6 sin“ x 
If we set ; a ; 
if POE, SAPOR 
= : 9.9.19 
” (5 +p?’ p6 va) ; 
one readily obtains 
1 Ap? 1 
= af (9.9.20) 


(p—p’)? (Pp) +P?) (Ps +P?) (N—-1/)? 
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The properties in (9.9.14), (9.9.17), (9.9.20) simplify the form of the in- 
tegral equation in (9.9.8) and the solution will be then read off from the 
analysis provided on the Poisson equation in 4D in Appendix III. 

To the above end, set 


(p2 + p?)* G(p,p’) (vB +0)" 


A (0,0) = 
16 up) 


(9.9.21) 


then we see upon multiplying (9.9.8) by 


(v5 + 9°) (w+?) 
16pup5 


and using (9.9.14), (9.9.17), (9.9.20), the integral equation (9.9.8) becomes 
replaced by 


A 
A (9,0) = 6(Q—29') —_ i a“ neh (@"/). (9.9.22) 
m™hpo J (1-9 


The above may be also rewritten as 


1 // I Au 1 N Ph a, / 
feo [aie SD are se 0) =6(Q—). (9.9.23) 


On the other hand, from (III.3.11), (If.3.12) and (III.3.2) in Appendix III 
with 47 =1= "7: 


saa") = SED, nen") (9.9.24) 
n=0 
1 CO 
Sa 9.9.25 
ara = Uren (9.9.25) 


where the U, are Chebyshev’s polynomials of type II (see Appendix III), 
which formally lead for (9.9.23) the expression 


Sf n+1 Ap 
fers |S 5 dy sti] Um (nn) A(Q”,0') 
n=0 
“(n+l 
ae ( ee Un (n+). (9.9.26) 
n=0 uy 


Now we use the orthogonality /completeness relation over the solid angle dQ” 
in (II1.4.1) of Theorem IV.1 in Appendix III, to infer from (9.9.26) that 
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Bo Lace 
A (2,2) = 5 x) Un (1-1) (9.9.27) 

n=0 [n+ 1 + — 

poh 

which may be rewritten as 
ives n? j 
A(2,9') = = x Una (nn). (9.9.28) 
poh 


9.9.2 The Negative Spectrum p® < 0,A < 0 


From (9.9.21), we then have for the Green function 


16pp5 


G(p,p’) = (1-1). 

(D3 + p?)” (p§ +p’) me 5 ¢ + zy" 

poh 
(9.9.29) 
In terms of the variable p® in (9.9.9), with p® — p® + ie, (9.9.29) reads 
2 
1 (") 
BQO Ve 
Gi (p, psp ) —_ pen 2 2 12 2 
p—P tie poke 
2p 2 


nUp— ; 
x ae mW) ( ap + 2 H). (9.9.30) 
n=1 (o 0 a ee Rn? ; + ie) 


For p’ = p, n = 17, Un_-1 (1) = 1 (see (III.2.10)) in Appendix III, and for 


pr? 
fa (9.9.31) 


for a given fixed n, (9.9.30) leads to 


pr 8|AP we 
(0° +) Ge (p,p;p°) = n? Se (9.9.32) 
ne he r2 (v2 + “yu ) 
n2h? 


giving the degree of degeneracy n? for the eigenvalues in (9.9.31), and the 
normalized momentum probability density in the state n: 


Ds! 8|Al? pe 
In (p)? = _ 


4 
n> h°r2 (v° + a 


(9.9.33) 
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The normalizability of the latter follows from the value of the integral 


d3p 1 
if uae (9.9.34) 
(p?+a?)" 8 |a| 


where the integration measure here is d°p. [Relative to the measure 
d?p/(2rh)*, one must multiply (9.9.33) by (27h)* regarding normalizabil- 
ity.| 

The wavefunctions in the momentum description may be also obtained 
from (9.9.30) for p 4 p’. To this end, one may expand U,_; (n - n’) in terms 
of Gegenbauer polynomials! defined by 


Cita (2) = wi (2) Un—1 (2) (9.9.35) 
in the form 
Un—1 (n- 17) ea ) Fy n(x’) Pe (cosa) (9.9.36) 
where 
Fy n(x) = 2° tae VY a J/ae+ Isin’ y Ch*5_, (cosy), (9.9.37) 


(nt Ol 
with the P; denoting the Legendre polynomials, and 
cos a = cos 6 cos 6’ + sin #sin &! cos (¢ — ¢’). (9.9.38) 
Upon using the expansion (see (5.3.67)), 


Ar £ 


‘ $31 ee Se S * hour 
Pe (cos a) = (20 4 1) eat m (0, ¢) Y m (0 @ ) (9.9.39) 
and setting 
_ Alu 


it is not difficult to show from (9.9.30), at the pole in (9.9.31), that the 
wavefunction, up to a phase factor, in the momentum description is given by 


joe Peal au (ap) (n—€— cet 
jeue am Cane eG 


acs oe) Yom (9,4) 
(9.9.41) 


'™ See, for example, Gradshteyn and Ryzhik (1965), pp. 1029-1031. 
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where we have finally used the definitions in (9.9.12). The Wn ¢m(p), as 
following from (9.9.30), are automatically normalized with respect to the 
measure d?p. 

It is easily verified from (9.9.36), (9.9.37), (9.9.41) that 


n—-1 2 
32D [nem (P)? = bn (PIP (9.9.42) 
£=0 


m=—é 


thus coinciding with (9.9.33). 

One may formally take the Fourier transform of wp ¢ m (p) in (9.9.41) and 
make a transition to the configuration space description and this is left as an 
exercise to the reader (see Problem 9.21) 


9.9.3 The Positive Spectrum p® > 0 


To obtain an expression for the Green functions G4 (p, p’; Pp’), for p® > 0, 
we go back to (9.9.28). 
To the above end, we note that for any € 


(1—€)? +€(n—- 1)” =14- 2 2-0 (9.9.43) 
since n? = 1 = 9”. Hence from (III.2.1), in Appendix III, with |€| < 1, we 
may write 


Co 


1 
=) Uni (n°) (9.9.44) 
Ja-2? +e-n)| » 


or upon multiplying the above by €, and explicitly taking the derivative with 
respect to € on both sides we obtain 
as an 
{ aa Sane Up aCe a ). (9.9.45) 
Ja-eP+em—nyP] 


By multiplying this equation by €, and taking the derivative with respect 
to € gives the useful equation 


a aa z= Son? e(Un-i(n-n’). (9.9.46) 
la-e?+em—ny?] ot 


In reference to (9.9.28), we note from (9.9.9) that with p® — p® + ie, 
p® > 0, corresponding to Gi, respectively, 


pe = —2yup® Fie. (9.9.47) 


9.9 The Coulomb Green Function: The Full Spectrum 597 


Hence concerning the Green function Gi, we have 

po = (2up°) e~'* (9.9.48) 
for ¢« — +0, and for G_, 

po = (2up”) e'”. (9.9.49) 
Accordingly, we have the rule: 


1 Pe cds - 
aes etin/2 = (9.9.50) 


Po /2pp® / 2p? 


with p° > 0, for the Green functions Gz, in the interpretation of 1/po in the 
denominators in (9.9.28). 
Let 


=. p >O0 (9.9.51) 


and using the fact that in (9.9.28) we may write 


1 : 
(n—1+iy) 
| dé € 7 (9.9.52) 


nt iy - 


we have directly from (9.9.46) and (9.9.28) that 
d é (1 - e) 

2 
“la-9? +é(n-0")| 


AQ,2) =< | ‘eth ae 


~ 972 
20 0 


(9.9.53) 


Finally we use the equalities in (9.9.20), (9.9.50) and the definition in 
(9.9.21) to obtain explicitly 


', att i Le 1 ; +i d eae) 
Gt (p,p ey) 7 \/ 8p0 or g "de ae | 1 2 


s-—a-9)| 
(9.9.54) 
where 
pz =(p-p')” 5 apie 5 ,  p’>0 (9.9.55) 
( = ee cE ie] (0° — R <= ie] 
2p 2p 


and ¥ is defined in (9.9.51). 

The Green function in (9.9.54) for the positive spectrum p? > 0, and \ > 
0, A < 0, will find detailed applications to Coulomb scattering in Chapter 15, 
§15.5. 
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Problems 


9.1. 


9.2. 


9.3. 


9.4. 


9.5. 


9.6. 


9.7. 
9.8. 


9.9. 


Show that in one-dimension the analogous expression to the one in 
(9.1.30) is given by 


foe} d 0) 
GS (zt; 2't’) = / oh [sop oP (iv2mp® la — 2’ | /h) 


x exp (—ip? (t — t’) /h) . 


Show that an integral representation of G° (xt; x’t’) in two dimensions 
is given by 


O forge gly) — TE. " oa 2mz\ ict 
C° (atin) = ef tes (Is “iE 


where Jo is the zeroth order Bessel function. 

Derive the Green function for the linear potential in §9.2 by the 
method used in deriving the Green function for the quadratic potential 
in the same section. 

Carry out a study of the Green function in the one dimensional po- 


tential barrier 
Vo, 0<a<a 


Oe ‘e elsewhere 


where Vo > 0. It is advisable to rewrite V(a) as 
V(x) =[© (x) — 9 ( — a) Vo. 


Obtain an expression for the probability density distribution on an 
observation screen in a single "slit" experiment where the "slit" is a 
sharp circular hole. Find then the probability of observing a particle 
outside the classical shadow of the hole on the screen. Extend your 
analysis to a double slit experiment involving two sharp circular holes 
of equal radii. These analyses are conveniently carried out in terms 
of so-called Lommel functions (see also the appendix to §15.2). [Ref: 
Manoukian (1989).| 

Show that for a time independent F(t) = F in (9.4.1), the expression 
in (9.4.21) goes over to the one in (9.2.10). 

Carry out in detail the steps leading from (9.4.22) to (9.4.27). 

In reference to the first term in (9.3.14) with the denominator [p°? + 
mA? /2h? + ie], show that by closing the p®°-contour in (9.3.2), in the 
complex p°-plane, from below, the semi-circle of infinite radius gives 
zero contribution and hence an application of the residue theorem 
gives (9.3.16) for the contribution of the pole p® = —mA?/2h?. 

Study the Green function of an oscillator for which w in the harmonic 
oscillator problem is replaced by a general time-dependent one w (t). 


9.10. 


9.11. 
9.12. 
9.13. 
9.14. 


9.15. 
9.16. 


9.17. 


9.18. 


9.19. 


9.20. 
9.21. 


9.22. 


9.23. 
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Show that (9.5.2) leads to (9.5.3) and finally to (9.5.5) for the Green 
function G%., in half-space. 

By integrating by parts, derive the inequality in (9.5.10). 

Derive the expression for the probability density given in (9.5.14). 
Derive the Poisson sum formula in (9.6.10). [A classic reference on this 
is: Morse and Feshbach (1953), p. 467.| 

(A review problem in electromagnetics). Show that the vector po- 
tential A of an infinitely long solenoid of circular cross section, at a 
distance r outside the solenoid is given by the expression in (9.6.17). 
Using (9.6.24), (9.6.25) derive the two equations in (9.6.26), (9.6.27). 
By relating the modified Bessel function Ij,)(—ip) to Jj, (p) 
J, (ie) = eT iluln/2 7 (p) and using the known asymptotic behavior 
of Ji, (e) for p — oo, verify the corresponding one for [j,) (—ip) as 
given in (9.6.28), (9.6.29). 

By the application of appropriate boundary conditions at the origin, 
as discussed at the end of 89.6, to allow the scattering of a particle 
off by the solenoid at the origin, even for 69 — 0, investigate how this 
scattering is modified by the activation of the solenoid, i.e., for dg > 0, 
as an illustration of the Aharonov-Bohm effect. 

For the illustration of the Aharonov-Bohm effect at the end of 
89.6, consider rather formally initial states such as exp (—|¢'/4|), 
exp (- |o’ 3 /y*)) with y > 0 very small so that contributions from 
|b’| > y~ may be neglected. Investigate the nature of the observation 
intensities as functions of @. 

Extend the analysis provided in the example on the Aharonov-Bohm 
effect at the end of §9.6 to single arc-slit and to double arc-slits exper- 
iments. The reader may place the arcs in the most convenient way to 
simplify the analyses. 

Verify the identity in (9.7.9). 

Carry out a Fourier transform of ¢nem (p) in (9.9.41) to x-space. Here 
you will need properties of some special functions which will allow you 
to explicitly carry out the Fourier transform. 

Investigate the nature of the Green function for the 1/ |x| potential in 
two and one dimensions. 

Show that for a potential of the form V(z,t) = mw? (t)a?/2 — 
«F'(t), the Green function takes the form G (at;x't’) = N (t,t’) x 


exp (i do dr L, (r) /h), with L.(7) the classical Lagrangian, and set 


up an equation that would determine N (t,t’). [Note that L.(7) de- 
pends on 2, 2’, t, t’ as well.| 


10 


Path Integrals 


The path integral formalism of quantum physics, as an alternative to the 
operator approach, was introduced by Feynman! in 1948. In recent years, it 
has been applied in almost all areas of physics and has become a powerful 
and essential tool to do quantum physics. There is an intimate connection 
between this formalism and its classical counterpart, and the action, as the 
time integral of the Lagrangian, appears naturally in it. In its simplest form, 
it involves in developing an expression for the amplitude (xt|x’t’) for a par- 
ticle initially at x’ at time t’ to be found at x at a later time t >t’ as a sum 
over all paths beginning at x’ and ending up at x. Quantum physics being 
probabilistic, this expression involves, in general, in addition to the classical 
path joining x’ to x, an (uncountable) infinite number of possible paths join- 
ing these two points. The importance of a so-called Lagrangian formulation 
of quantum physics was emphasized by Dirac and fully exploited by Feynman 
in his classic work. 

The present chapter deals with a fairly detailed treatment of path inte- 
grals. We approach the problem as a logical extension of our study of Green 
functions in the previous chapter. A key result in the analysis is to rewrite the 
amplitude (xt|x’t’), mentioned above, in terms of a completeness relation as 


(see also (9.6)) 


(ties) = Jorx' (xt |x//t") (x!"t" |x't’) (10.1) 


with t” conveniently chosen so that t’ < t” < t. By repeated application of 
(10.1), one may rewrite (xt|x’t’) as integrals of the product of a large number 
of amplitudes each of which describes the time evolution in infinitesimally 
short times. This technique, usually referred to as the “time slicing method”, 
allows one to readily obtain the path integral representation of the amplitude 
in question. §10.1, §10.2 deal, respectively, with a free particle and a particle 
in a given potential. For interactions involving velocity dependent potentials, 


' Feynman (1948); Feynman and Hibbs (1965). 
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such as in the interaction of charged particles with external electromagnetic 
fields, the completeness relation in (10.1) is extended to sums over momenta 
as well. This leads to a phase space analysis and is the subject matter of §10.3 
with emphasis put on the interaction of a charged particle with an external 
electromagnetic field which is of central importance in physics. $10.4 deals 
with a systematic analysis of path integrals with constrained dynamics.” This 
subject matter has become quite important in recent years and deserves the 
proper attention given here. §10.5, $10.6 deal, respectively, with the problem 
of Bose and Fermi excitations with special considerations for their interactions 
with external sources. In §10.6 the necessarily tools for integrations over so- 
called Grassmann variables are also developed in great detail in order to 
handle Fermi excitations in the path integral context. 


10.1 The Free Particle 


In this section, we derive the path integral expression for the amplitude 
that a free particle in R”, which is initially at x’ at time 7’, is found at x at 
some later time t. 

Our starting point is the completeness relation (10.1), as applied (N — 1) 
number of times which reads: 


epee = farx 2. U’XN_1 [ (xt]xn—1 tn—1) 


x tpg tea [two adg ++ (x1 tr! Yo | (10.1.1) 


where (see (9.1.38)) 


v/2 F 2 
ORE Te oe | re een eee eer (SC rc meee ZR 
eas . Qrih (that = tr) 2h (thea = ty) 
(10.1.2) 
t>tny_-1>°:: >t, >, and we set x = xy, x’ = xo, t= ty, t! = to. 


By choosing t,41 — ty, = (t— t')/N = T/N, we may rewrite (10.1.1) as 


? Cf. Dirac (2001), Faddeev (1969); see also Senjanovic (1976). 
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Fig. 10.1. A path contributing to the expression for (xt|x’t’). In the contin- 
uum limit N — oo, one has an uncountable infinite number of paths. 


Upon taking the continuum limit N — oo, defining formally the measure 


m Nv/2 {[N-1 
9(x(-)) = lim (stn) I] ¢’x:; (10.1.4) 


N- oo 


and in this limit, converting the sum in the exponent in (10.1.3) to an integral, 
we have the so-called path integral expression for the amplitude (x t|x’ t’) 9: 


(xt|x’t’), = oe P(x (-)) exp F [a ma] (10.1.5) 


(8) =x’ 


written as a sum over all paths going through the volume elements d’x,, 

.., d’xy_1, respectively, about the points specified by x1(t1): (x1,t1), .-., 
Xn—1(tnw-1): (Xn-1,tw-1), as each one of the variables x1, ..., X\—1 moves 
all over R’, i.e., as we integrate over all of these variables, beginning at x(t’) = 
x’ and ending at x(t) = x, and take the limit N > oo. 

A particular case of (10.1.5) is the amplitude for a particle to begin at 
the origin x’ = O and end up at the origin x = 0. The expression for this 
amplitude may be directly read from (10.1.2), and we may write for (10.1.5) 
in this case: 


fees I(x ())) exp E [« an 7 (atom) mae 
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This equality will be useful, as a normalization factor, in some cases for a 
particle interacting with a given potential as we will see in the next section. 


10.2 Particle in a Given Potential 


We consider a Hamiltonian of the form 


2 
H(t) = ~ + V(x,t) = Ho + V(x,t) (10.2.1) 

m 
where, for greater generality, V(x,t) may depend explicitly on time but is 
independent of p. The explicit time dependence of V(x,t) is assumed to 
come only from a-priori given c-functions of t such as, for example, in the 
time-dependent forced dynamics studied in §9.4. As an operator, V(x, t) is a 


multiplicative one with respect to |x’), ie., 
V(x, t) |x’) = |x’) V(x’, t). (10.2.2) 
Since for a state |W(t)), the Schrédinger equation reads 
ine |W (t)) = A(t) |U(t)) (10.2.3) 


and (x’|(t)) = (x’t|W), one has 


ine (x't| = (x’t| H(t) (10.2.4) 
and similarly 
- ine |x’t) = H(t) |x’t). (10.2.5) 


Upon setting (to + t1)/2 = t, ti — to = €, we may write the amplitude 
(xity |xoto) as 


(xity |xoto) = (x1,¢ + 5 |xo,t = =) . (10.2.6) 


For ¢ — 0, then (10.2.4), (10.2.5) lead for (10.2.6)? 


€ € 
(x11 |xXoto) & (x: ja = i H(t)|xo ) ~ (x; lexp ee) xo) . (10.2.7) 
The exponential representation in the last equality is essential to obtain a 
unitary expression and satisfy the group property in time. The final verdict 
of the whole formalism is its consistency with the Schrédinger equation. 


3 We note that in (10.2.7), it is understood that we eventually take the limit 
€é — 0, otherwise one would necessarily have the time-ordered structure given in 
the Appendix to §2.5. 
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On the other hand for ¢ — 0, the Baker-Campbell-Hausdorff formula (see 
Appendix I) states that 


exp = Ho exp -FV(x, t) ~ exp a (cH(t) + O (€*)) (10.2.8) 
which from (10.2.7) gives 


(xit1|xXoto) & fox (x: ected 


oi 


m \v¥/2 ic [m (x1 — Xo : 
~ (—— t 10.2. 
Ga aE (; ( e ) acer ) 022) 


for ¢ — 0, where we have used (10.1.2) corresponding to the free Green 
function. 

We first check that (10.2.9) is consistent with the Schrédinger equation 
for a wavefunction (xt|W). 

To the above end, we have to verify, from (10.2.9) for « ~ 0, that 


m \¥/2fi, ie [m (x-x'\? : : 
(x,t+e|W) = (=) forstomi (3 ( 7 ) V(x’, t) | (x’t| WV) 
(10.2.10) 


is consistent. Upon making a change of variables x’ — x’ — x = z, we obtain 


m \v/2 fy im 5 
athe | ~ (sone) fa oe (sre ) 


. f ae (Talx\V (x0) (T(x) U(x, t)) (10.2.11) 


eT EME 9) 


where T,,(x) is the Taylor operator 


T, (x) = (: +2 V4 5 ~). (10.2.12) 


Using the integrals 


m \¥/2 pL, imz? 
(-) Jc zexp (3) = (10.2.13) 
m \v/2 ; imz? 
vy J — 
(—) fe ze exp ( a2 ) =% (10.2.14) 


m \v/2 3 imz? ., ihe 
Vv ey | ao SUD 
(——-) Jc ZLZZ exp (SE) =5 — (10.2.15) 
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and so on, and keeping only terms up to order ¢ only in (10.2.11), we get 
W(x,t+e) — U(x,t) TE WPU(x, t)- EV(x, t) U(x, t) (10.2.16) 

which leads to the Schrédinger equation in the limit ¢ — 0. 


The completeness property of the Green function functions provide from 
(10.2.9) the limit 


. N-1 P: 
x exp 5 SS € E Se - **) Vo) (10.2.17) 


where #, = (ty ttp4i1)/2=t +(k+1/2)e, =U +hke, k=0,1,...,N—1, 
e=(t-t)/N = trey — tr, to v’, tn t, Xo x’, XN =X. 
As in (10.1.5), we may rewrite (10.2.17) in the limit N — co 


(xt|x!t!) = a F(x (-)) exp x [ow (S20) —V(x(r),7)) 


(t))=x! 
(10.2.18) 
as a sum over all paths beginning at x(t’) = x’ and ending at x(t) = x. 
Here we recognize the Lagrangian 
M.9 
L(r) = ox (7) — V(x(r),7) (10.2.19) 
as a c-function, and f[, i dr L(t) = Aas the action. It is not always true, how- 
ever, that the Lagrangian simply appears in the integrand in the exponential 
in (10.2.18) in every case (see, for example, Problem 10.5). 
As an application, consider the potential V(a) = —Ez, in one dimension 
(see also §9.2) and the corresponding amplitude 


«(t2)=x2 i te oa 
(x9 tg|a1t1) = ‘| Q(x (-)) exp ;/ dr [Fx°(7) + Ex(r)|]. 
au(ti)=21 h ti 2 
(10.2.20) 
In addition to the classical path in (10.2.20), we have to consider all paths 
joining the point 2(¢,) = 71 to x(t2) = x. Accordingly, any given path may 
be described by the function 


a(T) = x-(T) + y(7) (10.2.21) 


where x,(T) is the classical solution, and y(r) denotes the deviation of x(r) 
from the classical one at time T. 
The boundary conditions are 
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t(ti)=21, x(t) = 22 (10.2.22) 
Xe(t1) = 71, X-(t2) = 12 (10.2.23) 
and hence 
y(t1) = 0 = y(ta). (10.2.24) 
The solution of # 
Le =— 10.2.2 
#_(7T) = — (10.2.25) 


satisfying the boundary conditions in (10.2.23) is given by 


£.(T) = =O ti) 4 (es £1) a a | = (10.2.26) 
t-(T) = eee + &e(t1)(t2 — t1) + a1. (10.2.27) 


Also note that 


eNO) Ex(r) = (“eo + Exe(7)) 


2 2 
+ m&,-(T)y(T) + Ey(7) 4 my) 
=L,(r) 4 my) mo (&e(7)y(r)) (10.2.28) 


where in writing the last equality we have identified L,(7) with the classi- 
cal Lagrangian corresponding to the classical motion, and made use of the 
equation for x-(7) in (10.2.25). 


Hence 
te te tz +2 
if drL(r) = / drL.(r) +f fg (10.2.29) 
ty ti ty 2 
where on account of the boundary conditions in (10.2.24) 
te d 
ae &(T)y(T)) = &e(te)y(te) — &e(ti)y(t1) = 0. (10.2.30) 
ti 


In detail, (10.2.26), (10.2.27) give 
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thus defining the action for the classical path. 

Finally with the change of variables x(7) to y(7), as given in (10.2.21) 
for all t; < 7 < te, and from the translational invariance of the measure in 
(10.1.4), we obtain 


7 y(t2)=0 . ta +2 
(x2 t2|x1 ti) = exp (;5:) / PD (y(-)) exp : / ry (7) 
h y h t 2 


(t1)=0 1 
(10.2.32) 
From (10.1.6), (10.2.31), this leads to the expression given in (9.2.10), where 
now T= te — tis 
The derivation of (#2 t2|x1t1) for the time-dependent potential —xF'(t), 
for a time-dependent force F(t) (see §9.4) using the path integral technique 
is left as an exercise to the reader (see Problem 10.1) and the expression for 
which is given in (9.4.21). 


10.3 Charged Particle in External Electromagnetic 
Fields: Velocity Dependent Potentials 


We consider the interaction of a spin 0 charged particle, say, of charge 
e, with a priori given external electromagnetic field described by the pair of 
fields ®(x,t), A(x,t). The Hamiltonian in question is given by 


(p- £A)" 


A(x,p,t) = ae 


+e® 


2 

e € e 

= 2 p:A A-p+ 
2mc 


2 
Smezt* + e®. (10.3.1) 

It is yet not clear in what sense the path integral derivation given in 
the last section applies to this important system. For one thing, here we are 
dealing with a velocity dependent interaction. 

To determine the amplitude (xt|x’t’), and, in the process, replace the 
operator p by a c-variable counterpart, we may use the resolution of the 
identity (see §2.4) 


1= | Pa p’t) (p’t| (10.3.2) 


for any conveniently chosen time t. 
To the above end, we note that we may write 


A-p+p:A=(p-A—A-p)+2A:pr 
= -ih(V-A)+2A-pp (10.3.3) 


where pp is the operator p operating to its right, and 
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A-p+p:A=(A-p—p-:A)+2p,-A 
=ih(V-A)+2p,-A (10.3.4) 


where py, operates on its left side. 
Accordingly, we may rewrite the Hamiltonian H, with the operator p in 
mind, as an operator operating to its right or left as 


2 . 
h 2 
Hi Gop See A Aap th ee 0038) 


2m 2mc mc mc? 
(p,)>  ieh e€ e? 
Ay, (x, p,t) = ae are a e A) = yee “A ie é . (10.3.6) 


The expressions for the Hamiltonian as spelled out in the above two ways 
may be now applied in the following manner. 

An amplitude (x; ¢1|xoto), with t = (to + t1)/2, ¢ = t, — to, may be 
written as in (10.2.6) 


(x1 ty |X to) => (x1,t+ : |xo,t = =) . (10.3.7) 


From the dynamical equation 


42 
ina, (x1,¢| = —_ (x14 H (x, Pp, t) 


= (xi t| Hp (x, p, t) 
= (xit| Hp (x1,P, t) (10.3.8) 


we have for e — 0 


€ 
(x1,t+ ;| ~ (xit| (1 me Hn (x1,P, ») 
-(—, oP. ze /h (1 — ae i (x1, p’ 0) (p’t| (10.3.9) 
(20h)° 2h ai 
where in the last equality we have used (10.3.2), (2.4.8). We note that 


AR (x1,p’,t) is a c-function. 
Similarly, from 


) 
ih, |xo,t) = H (x, Pp, t) |xot) 


= A, (x, p, t) |xot) 


= Ay, (Xo, P,t) |Xot) (10.3.10) 
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we obtain for ¢ ~ 0 


E le 
xo, t - =) > (1 _ 5p te (xo,Pst)) |xot) 


d?p' —ixo-p’ i€é 
= it (anh)? e 7 ixo-p /h |p’t) (1 = Teas oxo.) . (10.3.11) 


Equations (10.3.9), (10.3.11), when substituted in (10.3.7) lead to 


(x1 t1 | Xo to) 


hg me eee ie 
=[5 De ei(x o)p /h (1-55 (a (x1, p’,t) + Ay (x0.'1)) 
TT 


dsp’. Sep le 
~ f HP a teers (Ar (x1, p’,¢) Te Hy, (xo, P’,t) ) 


(2rh)° 
(10.3.12) 
fore ~ 0. 
In detail 
Hr (x1,p’, t) on A, (xo, Pp’, t) 
B. 

p’ ieh € 

hare Ones [(V-A),-—(V-A)o] - mae (Ai + Ao) 
e 2 2 
+55 ((A1)? + (Ao)”) +e +e (10.3.13) 


where the notation fo = f(xo,t), fi = f(x1,¢) has been used. 
The Gaussian p’-integral in (10.3.12) is elementary and may be carried 
out to yield, 


: 2 
ee ( m ye ice |m (x1 — Xo 
x x ~ | —_ xp | 

PETE \ Oni eae é 


= A A D D 
weed Xo) (Ao + A1) yi 0+ %1) Bila 
Cc E 2 2 


for ¢ ~ 0, where 


e2 (A,;—Ao)? _ ieh 
2mc? 4 4mc 


Aoi = 
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The latter being the sum of differences, and not being divided by ¢, is smaller 
than the first three terms in the square brackets in (10.3.14). That is for 
t; — tp =e ~ 0, we may take 


ee ek 
Bel EL ROOK = oars AP he |D é 


e (X1 — Xo) . (Ao + Ai) (Po + 21) 
Cc € 2 2 


+ 


(10.3.16) 


Now it is not difficult to verify that (10.3.16) is consistent with the 
Schrédinger equation to be satisfied by a wavefunction (xt|W). To this end, 
we have from (10.3.16) for « ~ 0 


bet+elyy = (sm) fats’ exp = | (A—* ; 
SPE ES \ pee ae ae coe 


+ ; e | rim. 


(10.3.17) 


By making a charge of integration variables x’ > x’—x = z, and carrying out 
Gaussian integrals as in (10.2.13)—(10.2.15), in the manner given in verifying 
(10.2.16), it is straightforward to show (see Problem 10.2), that (10.3.17) 
leads, upon taking the limit ¢ — 0, to 


) 
ih W(x, t) = H(x,p,t)U(x, 6). (10.3.18) 
The expression in (10.3.16) may be then systematically used to obtain 


3N/2 
(ctl) = Jim (sme) Oa ePan a 


. N-1 2 
i Xk41 — Xk 
x XP 5 E ( (=) 
FeO : 
_ © (Xk+1 — Xk) | (A(xx, te) + A(Kx41, th) 


E 2 


4 (®(x,, te) + O(xpai, tx)) 


2 


where tp41 = (ty ttysi)/2=U+(k+1/2)e, t. =t’'+ke, k=0,1,...,N—1, 
eE= (t—t)/N = tryt = "EE to t.; tn t, Xo x! XN = X. 
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In the limit N — ov, (10.3.19) may be written as 


x(t)=x 
(xt xt) =f Ax()) 


(=x 


x exp * | dr ee + =X(7) - A(x(T),7) — c(x(7).7)) 
(10.3.20) 


where we recognize the T-integrand as the Lagrangian of a classical charged 
particle in the external electromagnetic field described by (®, A). 

The procedure developed above may be formally applied to Hamiltonians 
with velocity dependent potentials in general. Using a standard notation for 
such Hamiltonians H(q,p), written in terms of q’s and p’s, we may move all 
the p’s to the right of q’s, using appropriate commutation relations, as done 
above, thus defining Hp(q, p), and similarly defining Hy,(q,p), we may infer 
from (10.3.12) that 


d’p i(a—aays —ie 
(qiti ldgto) ~ f eee (He (q),P1) + Hy (ao: P1) ) 


(2rh)” 
(10.3.21) 
where ty} —to =E 0. 
The expression (10.3.21) allows us to deduce formally by using the com- 
pleteness of the Green functions 


d’p, d’pn 
2th)” ~° (Qrh)” 


. N-1 
1 G41 7 Ak 
x exp (i ys EPK+1° (au=t)) 
k=0 


. (N-1 

i E 

x on ae ( S 3 (Ar (G415Px41) + a) 
k=0 


(at|q’t’) = Jim fd’qy---d’qy_1 


(10.3.22) 


where € = tp41—ty = (t-0/)/N, ty =t, tp =U’, Qn =, G9 = W’, a8 defined 
below (10.3.19). 
Upon taking limit N — oo, we have 


a(t)=q 
(at la’) = | Hal), P(-) 
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where we have defined a classical Hamiltonian H, by* 


1 


H-(a(r), P(7)) = 5 (Ain(a(r), (7) + H(a(7),P(7))) (10.3.24) 


The paths-measure is formally given by 


: d’py oe v d’p; 

Ag(), P(-)) = Jim (nh)? u (« won) (10.3.25) 
with the pre-limiting expression involving one additional v-dimensional mo- 
mentum integration, with the measure d”’py_/(27h)”, than the q-ones. In the 
limit N — oo, the number of integration variables goes to infinity and in this 
limit, one rather formally integrates over q(7) and p(r) in phase space for all 
t! <7 <t with the boundary conditions q(t’) = q’, q(t) = q. The expressions 
in (10.3.23)—(10.3.24), for velocity dependent potentials, should, however, be 
taken only rather formally for general cases and are not void of ambiguities 
and we will not go into consistency problems of this representation here with 
generality. For the particular case, however, where 


H(q,p) = = + V(q). (10.3.26) 


Equation (10.3.22) becomes 


d’p, see d’Pw 
(Qrh)” = (27h)” 


(atlq’t’) = Jim fd’q,---d’qy-1 


N-1 
x exp (-j 5 (V(du4i) + via.) 


We may then explicitly integrate over the p,’s to obtain 


(qt |q/t’) = li ( : ee d’ d’ 
= lim ee 
a Ia Noo \2Tihe a dv-1 

“Note that if Hr(q,p) = > Gnmfn(qi,.--,q) hm (p1,---,;pv), which may 


include a constant term, where fn, hm are real, then Hi(q,p) = 
So Gimhm (p1,---;Pv) fn (qi;---,qQv), and He in (10.3.24), expressed in terms 


of c-numbers, is real. See, for example, (10.3.5), (10.3.6), and Problem 10.5. 
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i ar (3 (t= = my V (digi) + van) 


E 2 
(10.3.28) 


which in the limit N — oo coincides with (10.2.18), where in the latter we 
have, in general, an explicit time-dependence coming from a given c-function. 


10.4 Constrained Dynamics 


The purpose at this sections is to develop the formal theory of path in- 
tegration when constraints are present in the theory. Let q = (q1,..-,q), 
Pp = (pi,---,Pv) denote canonical conjugate variables, in general. With the 
path integral expression in (10.3.23) in phase space in mind, suppose because 
of underlying constraints in a theory such variables are not all independent 
and one succeeds in isolating a maximal number of independent canonical 
conjugate variables (qj,...,@h,);(pi,--- Diaz), where M < v. Then according 
to (10.3.23) one may formally infer that 


qa(t)=q 
(at |a’t’) = / 9(a*(),P*() 


(t)=q! 


x exp hi i dr (p*(r) ‘ q* (7) — H* (q*(7), p*(r))) (10.4.1) 


for the corresponding amplitude, with Hamiltonian H*, expressed in terms of 
the independent variables, where 2(v — M) of the variables q, p are functions 
of q*, p*, and one in turn is working in a subspace of dimension 2M < 2v of 
the phase space. 

It is not, however, always easy to solve for such independent components 
and an alternative general expression is needed which deals with all the com- 
ponents (q,p) with the constraints imposed directly on the path integrals. 
This is the aim of the present section. We note that all the variables in the 
path integral (10.3.23) are c-variables and hence are more easily dealt with. 
This, as mentioned before, is the most attractive feature of path integration 
which solves the quantum mechanical problem in terms of c-variables and 
corresponding trajectories. With this in mind, we need some notions of con- 
strained classical dynamics which are dealt with next. Due to the technical 
nature of this section, it may be omitted at a first reading. 


10.4.1 Classical Notions 


We recall Hamilton’s equations 


OH OH 
n=, Sa 10.4.2 
dk = 3p, Dr Dax ( ) 
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where the dots denote time derivatives. The Hamiltonian H generates the 
time development in the sense that for t — t — 5t = ¢ for a change &t = 
t — t = 57 of the time variable by a parameter 7 (see (2.1.15), (2.1.16)), 


gk (t) > ax (t) — bax (t) = Me (t) = Oe (E+ 57) 
= gy (t) + 74x (t) (10.4.3) 
where we have used the fact that q, (¢) = qx (t) for pure time translations, 


Pr (t) — pr (t) — dpe (t) = Pr (t) 


= p(t) + 7px (t) (10.4.4) 
a oH oH 
dq, = —~—47, dp, = —— Or. (10.4.5) 
OPr Oqx 
An important property of such a transformation is that the Jacobian is 
one: a 
dk) Pk 
eee = 1 10.4.6 
I GnsBm) core 


and hence the measure of integration in phase space remains invariant: 
[[dadp. — [] aa ap, (10.4.7) 
k k 


and the transformation (10.4.3)—(10.4.5) is said to be a canonical one. 
Other canonical transformations may be defined similarly by 


5¢% = Bhi (10.4.8) 
OW 


where W is referred as the corresponding generator with respect to a para- 
meter €. Infinitesimal changes dt of the time variable have been defined in 
(2.1.14), (2.1.16) by 5t = 57 and we set the parameter 5€ = 67, W = —H. 
On the other hand for pure space translations 5q; = 5a;, according to the 
definitions in (2.1.13), (2.1.15), and we must set 5 — 8€; = ba;, W — pj. 
These are in conformity with the definition of the generators in (2.3.14) and 
consistent with (10.4.8). For a function F'(q,, px) of the gz’s and the p,’s, 
(10.4.8), (10.4.9) dictates that for F — F —5F = F, 


OF OF 
aes Gag - 5 8Pr) 
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OF OW OF OW 
= s d€. 10.4.10 
7 (5 Op, ODE 7) ‘ ( ) 


Thus introducing the Poisson bracket defined by 


OF OW OF OW 
ape 10.4.11 
t Ip. » (i Op, Opp a) ( ) 
we may rewrite (10.4.10) in the compact form® 
bF ={F, Why df (10.4.12) 


Again the Jacobian of the transformation (10.4.8), (10.4.9) is one. As a 
consequence of this, a Poisson bracket is invariant under canonical transfor- 
mations, i.e., in detail 


@ dB OA eB) aS a 0B OA =) 
Og: Op, ODE Ode. OGr OPk = ODk OGu ) 


(10.4.13) 
We recall some of the important properties of the Poisson bracket: 


{A, Bhp ps. a S> 


k k 


(Fi, Fo}p.n. = — (Fo, File, (10.414) 
{ai Fi + agF, F3}p p = a1 {Fi, F3}p p. + a2 {Fh, F3hp p. (10.4.15) 

for any constants a1, Q2, 
{Fife F3}p3 =F l{P, Ppp +{Fh, Bhp, (10.4.16) 


and satisfy the Jacobi identity 


{fi {fo Pte p tes. + {42 {43 Alps tes +143, 1h, Fate step, = 9- 

(10.4.17) 

We are, in general, interested in canonical transformations involving more 
than one generator defined through 


SF =S-{F,Wahp x bfa (10.4.18) 


for some parameters €,, where the generators W,, satisfy the algebraic rela- 
tions 


{Wa, Walp. = >, a gWy (10.4.19) 
Y 


° Tn particular for W5€ = —H5r1, F = F(q(t), p(t)), and by definition (see (10.4.3), 
(10.4.4), 5F = F-F = F (a(t), p(t) —F (a(t) +57a(t), p(t) +8rp(t)) = —F6r = 
{ F, —H br or F= { F, Ely where F’ = dF’/dt, as expected. 
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with the latter stating that the Poisson bracket of any two generators may 
be written as a (linear) combination of the generators in question, where the 
coefficients c” g are referred to as structure constants. 

Transformations q, p — 4,p, H (q,p) — H (q,p) are canonical, leading 
to Hamilton’s equations (10.4.2), if the difference between the Lagrangians 
(times dt) 


(p-q—H (q,p)) dt (p-G— A (@.p)) dt (10.4.20) 
of the corresponding actions A — A is a total differential. That is, 
(p-q— H(q,p)) dt—(p-q- H(q,p)) dt =dU (10.4.21) 


We leave it as an exercise to the reader to establish this property for 
infinitesimal canonical transformations given in (10.4.12). 

Now consider a Hamiltonian H(q,p) describing a system with 1 inde- 
pendent constraints, v1 < v: 


¢1 (q,p) = 0,...,¢,, (4, p) = 0. (10.4.22) 


These are referred to as primary constraints. 

For a function F(q,p) of q,p, we shall use the notation F'|, with a bar, 
when the constraints in (10.4.22) are imposed, in its definition. 

The constraints in (10.4.22) may be taken into account by introducing 
Lagrange multipliers Ag and by introducing the Hamiltonian 


Hr = H + Naba (10.4.23) 


with a summation over a from 1 to v; understood. The resulting Hamiltonian 
Hr then is parameterized by the Lagrange multipliers A,, and the pertinent 
physical question and the corresponding uniqueness problem arise as to what 
the explicit expressions of these multipliers are. 

The constraints in (10.4.22) must be satisfied at all times. That is, it is 
necessary to have 


a| = {¢6, Hr}p.,,| =0. (10.4.24) 
for all @. In detail 


ob = {o,, Eas + Na {d~, gatp p. + {,, Ais Poise Po (10.4.25) 


(see (10.4.16)). In particular, we note that the last term in (10.4.25) will, in 
general, vanish when the constraints in (10.4.22) are imposed. 

We consider the following two cases regarding the equations in (10.4.24), 
if they do not provide trivial identities: 


(1) Those equations that may be independent of the multipliers A., and hence 
they cannot be used to solve for multipliers, but impose new constraints 
on the q’s and the p’s. 
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(2) Those that may depend at least on some of the multipliers. 


In case (1), new constraints have been introduced and they are referred 
to as secondary constraints, defined by 


bel = $g| =0. (10.4.26) 


We may repeat the above process, hopefully a finite number of times, by 
adding the newly discovered (secondary) constraints to the original set in 
(10.4.22) and to the sequel ones thus generated. This in conjunction with 
type (2) equations, which would allow us to solve for as many of Lagrange 
multipliers as possible, and will lead to all primary and secondary constraints 
and no more equations of type (1) arise. We will denote all of these constraints 
(primary and secondary) by 91,...,QxK. 

The Hamiltonian with all these constraints may be now written as 


Hr = H + paQea (10.4.27) 


with a summation over a from 1 to K. Here not all the Lagrange multipliers 
P1,--5PK are necessarily determined. We will see which ones are determined 
and which are not by considering two classes of constraints to be defined 
shortly below. 

The constraints 


1 (4,p) = 0,...,Qx« (a, P) = 0 (10.4.28) 


hold in a kK dimensional subspace of phase space which we will denote by 
I. We will still use the notation F'| when restricted to the subspace T. We 
assume the irreducibility of these constraints with respect to the subspace I, 
ie., if 

F|=0 (10.4.29) 
then 


K 
PE. 6.0% (10.4.30) 
a=1 


where the coefficients c, may depend on q, p. 
A constraint is called first class, denoted by ®q, if 


(Pig Oat 4s | 0 (10.4.31) 


for all Qg. Otherwise a constraint is called of second class. We assume that 
linear combinations, with appropriate coefficients, depending, in general, on 
q, p, have been already carried out to bring as many of the constraints as 
possible to first class. 

We may then write (Q,) = (®,y, Ug) where (®,) constitutes of first class 
constraints and (W,,) constitutes those of second class such that no coefficients 
Co, not all zero, may be found to make cy WV, of first class. Clearly 
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det {Va,Vs}p_| #0 (10.4.32) 


since otherwise, i.e., det {Vq, Velpp| = 0, would imply that we may find a 
vector [cy --- cx], with not all of the cg equal to zero, satisfying the matrix 


equation 
[{Vo; Vatpp | [ea]| = 0. (10.4.33) 


From (10.4.16) we would then have 
{Va,ceVa}pp| =0 (10.4.34) 


for all a since V | = 0, which implies that cgW is of first class thus reaching 
a contradiction. 
With the first and second class constraints thus defined, we may rewrite 
(10.4.27) as 
Ar =HA+ Naka + pave (10.4.35) 


with the sum over a from 1 to, say, vy — M, and the sum over @ from 1 to, 
say, L. The positive integer L is necessarily even.® 

All the Lagrange multipliers pg, multiplying the second class constraint 
functions Vg are determined in I’. This follows at once by noting that 


0= {Hr, Ui }pn.| = {H, Us hp.| + pa {¥s, Us hpn.| (10.4.36) 


and the fact that det {Ug,Vy}p 5 | # 0, where we have also used (10.4.29) 
for the first class constraint functions ®,. 
On the other hand, 


0= {Hr, ®a}p pl = (0, ®a}p pl] +0 (10.4.37) 


and the Lagrange multipliers 7. multiplying the first class constraint func- 
tions remain neces For one thing we note that for a function G of 
a(t), p(d), G(t = = {G, Ar} implies that G (t b) = G(0 0) | + G(0 0) |t, in 
the eee of t = 0, for a given initial condition G(0)], and that 
G(t t) | is not unique since G(0 0) |, in general, would depend on the multipli- 
ers 1, (0) and thus would S ene for different values of the 7, (0) (see 
Problem 10.9). 

Before discussing the technical details concerning the undetermined mul- 
tipliers associated with the first class constraints, we provide a couple of 
examples. 

Consider the quadratic Hamiltonian in 3D space (or in 6D phase space) 


d 


pl 
H = = + —mw’*¢q? (10.4.38) 
2m 2 


®° This follows from the fact that the determinant of an anti-symmetric matrix (see 
(10.4.32)) of odd order is zero. 
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with the simple constraint function ¢1 = q — q, 


oil = (aH — a2)| = 0 (10.4.39) 


from which one obtains 


1 

{¢1,H}p 3, = a (pi — pa) - (10.4.40) 

leading to a secondary constraint 
v1| = (pi — p2)| = 0 (10.4.41) 

and 
{1, H}p p = —mw? (qi — q2) = —mw? gy. (10.4.42) 
Since 

2={¢1,¢1}pp #0 (10.4.43) 


we have two second class constraints leading to a 4D physical subspace T' of 
the 6D phase space. 

The following system deals with a situation involving a first class con- 
straint only. Consider the Hamiltonian 


H = =—+V(q’) (10.4.44) 
m 
in 3D with constraint 


| = (a1P2 — Piga)| = 0 (10.4.45) 


implying the vanishing of the third component Lz of the angular momentum. 
Due to the conservation of Ls: 


em (qip2 Pas P1192) tp p. =0 (10.4.46) 


there is no secondary constraint. 
Upon writing 


Hr = sae V(q?) +A (qip2 — p12) - (10.4.47) 


On the subspace [ of phase space of dimension 6 — 1 = 5, specified by 
the constraint in (10.4.45), A is undetermined. This ambiguity is resolved by 
introducing an additional constraint function ~ which amounts in working in 
a 4 dimensional subspace [* C I in phase space and the Lagrange multiplier 
is then uniquely determined. We continue to use the notation F'| for a function 
F of q, p when restricted to I*. 

We may choose 

X = p2 cosa — p; sina (10.4.48) 
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where a is an arbitrary angle such that 


{¢, X}p | = —(p2sina+ p; cosa) # 0. (10.4.49) 


The introduction of a new constraint y, such that {¢, x} 4 0 automati- 
cally turns the example above to one with second class constraints only and 
the two corresponding Lagrange multipliers are then determined. 

The arbitrariness in the choice of the additional constraint y function is 
often referred to as a gauge freedom — the interpretation of which will be 
spelled out below. 

The validity of x| = 0 for all t, leads from (10.4.47), (10.4.49) to 


—{H, x}p.pl =At¢x}pB.| (10.4.50) 


which allows the determination of |. 
The interpretation of the above construction becomes clear by introducing 
the canonical variables: 


Q=qcosa—q sina, P=y=p2cosa—p; sina 
gq =qncosat+gqsina, p; =p cosa+ posina (10.4.51) 


% = 93, D3 = Ps. 


We leave it as an exercise to the reader (see Problem 10.10) to show that 
the transformation q1, 92, 43, Pi, P2, Ps — GT. Q, 4g, py, P, p3 is actually 
canonical consistent with Hamilton’s equations. It is easily checked that the 
pair of constraints ¢| = 0, y| = 0 may be replaced by the pair Q| = 0, 
P|=0. Also note that 

do 


{¢.x}p.5 = 39 (10.4.52) 


By choosing the additional constraint | = 0 with y defined in (10.4.48), 
with a given fixed a, the dynamics, as will be seen below, may be then 
described in the qj-q3 plane making an angle a with the original q;-q3 plane 
as shown in Figure 10.2 by the appropriate choice of the canonical variables in 
(10.4.51). The choice of the additional constraint y| = 0, fixes once and for all 
the choice of the plane, specified by some angle a, to describe the dynamics 
of the particle in question. Such a choice of an additional constraint may be 
referred to as fixing a gauge and the parameter a, in this context, may be 
called as a gauge fixing parameter. 

With the canonical variables defined in (10.4.51), we may write 


P? *2 
Hr= a + V(Q?+q°7) + 1Q+ poP (10.4.53) 


where q* = (q{, 93), P* = (pt, p3), with 


622 10 Path Integrals 


q2 


Fig. 10.2. The introduction of the additional (subsidiary) constraint x fixes 
once and for all the plane in which the dynamics is described by choosing 
appropriately new canonical variables as defined in (10.4.51). The angle pa- 
rameter a, defining the angle between the gj{-q3 and qi-q3 planes may be 
referred to as a gauge fixing parameter. 


(P? + p*?) *2 


2m 


+V(Q? +q*)||= + V(q*?). (10.4.54) 


2m 


Needless to say that p1|, p2| may be determined as Q,P are second class 
constraints since {Q, Plog =1F0. 

The introduction of such additional constraints yg = 0, which may be also 
referred to as subsidiary constraints, turn first class constraints into second 
class ones, which are chosen such that 


det {®a,xeh}p p | #0 (10.4.55) 


generalizing (10.4.49) (see also (10.4.36)) for more than one first class con- 
straint function: ®,,...,®,—,7. Clearly, then v — M subsidiary constraint 
functions are needed yj,...,%,—1z- We will choose them such that 


{XaXétn a |=0 (10.4.56) 


for alla, 6 =1,..., y— M in addition to the restriction in (10.4.55). 
In the sequel, dealing with path integrals with constraints, we will con- 
sider only theories with first class constraints. Problems with second class 
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constraints are, however, given in the problems section and additional com- 
ments concerning them will be made later. A generalization of the Poisson 
bracket, due to Dirac, will be, however, introduced to deal with second class 
constraints at the end of this section. 

We will thus be considering v — M first class constraints only 


with v — M additional subsidiary ones 


with the latter satisfying (10.4.55), (10.4.56). The constraints (10.4.57), 
(10.4.58) lead to a 2M dimensional subspace I* for the physical subspace 
of phase space, with M < v. 

The constraints functions ®, are assumed to be irreducible in the sense 


that 
v—M 


{Go,Pp}p—n = >, Ah, (10.4.59) 


y=1 


and that for any function F(q,p) such that F = 0|, 


F(ap) = >> f74, (10.4.60) 
where the coefficients c? a f7? may depend on q, p. 


10.4.2 Constrained Path Integrals 


We may choose (xa,p*), (Qa,q*), w= 1,...,4 — M, as canonical vari- 
ables, with Q, as the canonical conjugate variable to y,. We may then rewrite 
the right-hand side of (10.4.1) as 


i, (11 dyalr ) sues if dr (xa@a +p" eq° = it) (10.4.61) 


where 


de I]oe — Qa (a*,P")) 8 (Xa) IQadXa | da Oak (10.4.62) 


and the Dirac deltas in (10.4.62) lead to the restriction of H in (10.4.61) to 


H (QasXar9"sP*)|,-0.9,-alarpr) = A” (a",p*)- (10.4.63) 


We may also make the substitution 
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[] 6 (Qa - Qa (a*, P*)) dQa > (Isa ».] 


wet (55%) 


(10.4.64) 
by making use of a property of the Dirac delta distribution in one or more di- 
mensions. Remembering our choice of (Qa, Xa) a8 pairs in the set of canonical 
conjugate variables, we also have 


a 
= ®& 
dQ, 7 


a 


= {9X hp-n: (10.4.65) 


(see also (10.4.52)). 
That is, we may rewrite (10.4.62) simply as 


d™ p* 
Mx 
dp = (Ise. 5 (Xa) d® str] |det {,,va}ppld”a "Onn 
(10.4.66) 
Now we invoke the formal property of the invariance of 
[[¢@adxad"q*a™ p* (10.4.67) 


under canonical transformations to infer that with the choice of canonical 
variables (q,p), we may rewrite (10.4.61) as 


fawn) exp x i dr (v ‘q-H+ =u) (10.4.68) 
ee exp = ified dr— = exp = + (u(t) —U(t')), (10.4.69) 


in general, is a phase factor depending, however, only on the end points 
evaluated at t and t’ (see (10.4.21)), and the expression for U is dictated by 
the generators of the canonical transformation itself. Also dy is, now given 
by 


du = (IIse.090 »] |det {®,,Xa}p, B. | a” q ae (10.4.70) 


For our application described by the system in (10.4.44), (10.4.45), x| =0 
n (10.4.48), the generator W of the canonical transformation 


(Q,x,4",P") > (4,P) (10.4.71) 


for infinitesimal transformation is given by the generator 


W = —L3$a = — (q@p2 — pig2) a (10.4.72) 
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(see (10.4.55)) in the sense that for a ~ da > 0, 


6Q = {Q,-Ls}p_ = —a5a (10.4.73) 
etc., and the conservation law 
d 
qo {L3,H}p.p =90 (10.4.74) 


implies that U = 0 in this case. Also 


PQ+p*-q'=p-q (10.4.75) 
Q@t+q’@=¢ (10.4.76) 
P? +p" =p? (10.4.77) 


as a consequence of rotational invariance, and note the constraints ¢| = 
L3| = 0, x| = 0, as imposed by 6(¢)d(x) in the measure in (10.4.70). Using 
the notation in (10.4.44), we have 

p”? 
2m 


H (q,p)| = H* (a*,p*) = 5— + V(a™). (10.4.78) 


In some instances, as in the above example involving invariance under the 
canonical transformation, no phase factor in (10.4.68) arises. In other cases, 
such that when t — co, t! — —oo such a phase, if non-zero, may be unim- 
portant in describing the dynamics of the problem. We shall not, however, go 
into such additional details here. Finite canonical transformations, not just 
infinitesimal ones, is the subject of Problem 10.8.” 

Finally we show that the constrained path integral is invariant under the 
variation of the subsidiary constraints, i.e., when we make the replacements 
Xa > Xa — Xa = xX), with the x/, as new constraints. 

Quite generally, we may write 


bxa = {Xa,AsPa}pp + bap (10.4.79) 


with a summation over @ understood, for arbitrary bg since ®g| = 0 in I*. 
The coefficients ag are uniquely determined in I* in terms of the 5yq since 


{xa Peho p a| = bX (10.4.80) 


as a consequence of the property in (10.4.55). 
With ag®, as the generator of the transformation, we also have 


bPa = {Pa,49Pa}p 5 


” For a fairly detailed treatment of canonical transformations, cf. Sudarshan and 
Mukunda (1974). 
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= {®,,ag}p p Os + {Oo, Oa}, p a8 (10.4.81) 
or upon using (10.4.59), 
dO, = Ma, Py (10.4.82) 
where 
Mag aihs, ee + cag. (10.4.83) 


The general expression of the matrix [M,,] in (10.4.83) allows an additional 
linear combination of the ®g to be present in (10.4.81). We note that the 
elements of this matrix are infinitesimal. 

From the following 


{®q — 6a, x3 — Sxehpp.| = (day — May) {87,8 - xa}pp.| (10.4.84) 


we may infer that 


det {®, — 58a, x6 — 5xehp.p | = det (1 — M) det {®,, x3 — 5xa}p p | : 


(10.4.85) 
Also we use the property of the Dirac delta 
[] 6 (. - 5®.) = 118 5a — Mag) ®g) 
&(® 10.4. 
= gaa wy LD ane) 


to see that for ya > Xa — dX¥a = xX, we simply have the transformation 


(Is) 5(x2)) lee (®,-r3h0 


ae: (IIs.00)) ldct {2,, x5 bpp |: (10.4.87) 


Finally, we note that ag®, as the generator of the transformation 


d 
= | =a, } Oa 
(ai) 


=i0); (10.4.88) 


d : 
—AgPo + da®a 


dt 


For our earlier application in (10.4.44), (10.4.45), (10.4.48), 


5x = — (pi cosa + p2sina) a 


ae eee (10.4.89) 


hence a = —da, implying the invariance of the path integral under variations 
of the angle of rotation a. 
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10.4.3 Second Class Constraints and the Dirac Bracket 


In (10.4.31), the restricted Poisson bracket {o,Q8}5 5 | of a first class 
constraint ®g with all constraint functions 1g is zero. Otherwise a constraint 
function is called of second class and has been denoted by Wg, 6 =1,...,L, 
where L is necessarily even.® It is straightforward to define, self consis- 
tently, a new bracket f+, ae obtained from the Poisson bracket, such that 
15 Q},| = 0 for all constraint functions. This new bracket, obtained be- 
low and referred to as the Dirac bracket, has also other desirable properties. 

To the above end, note that since det {Wa, Vehop # 0 (see (10.4.32)), 
we may find a matrix c = [c,g] such that 


{Was Wel cs tay = Van (10.4.90) 


On the other hand, (10.4.36) gives 


0= wv, 


= {Ya H}p 5 | + {Va Vahp pps (10.4.91) 
which from (10.4.90), this leads to 


pp| = —cay {VA} pp | (10.4.92) 


Accordingly, for an arbitrary function F' of q(t), p(t), 


F| = ({F ijn —{F Va}p py cas{¥s, H}p 5 )|+noet Fh Patpp | 
(10.4.93) 
This suggests to introduce the modified bracket defined by 


{Hp ={o hen, — {) Volpp coslUar tee, (10.4.94) 


and simply write 


F| = {F, H},| + nef F, et ea (Qn } D — 0 (10.4.95) 


expressed in terms of the underlying Hamiltonian H of the system (see also 
(10.4.100)), where 
(HO) oe | = Py ezt |. (10.4.96) 


From (10.4.95), we see that it is of particular interest if one has only second 
class constraints since in this case FP’ | = { F, A } f 
For two functions F’, G of q(t), p(t), (10.4.94) implies that 


{FG}, a LE Geos = {F, Vahppcas{ a, Gp p. (10.4.97) 


For any two second class constraints functions U,, U,, (10.4.94) then also 
leads to 


8 See footnote 6 below (10.4.36) for a demonstration of this. 
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{Uy }5| = {Uy }ps| i {Wy, Volo p cos{Ua, Vy tps | 
=0 (10.4.98) 


and note that 
{®q,Og},,| = 0, {®q,Ve},| =0 (10.4.99) 


Finally, for any function F of q(t), p(t), 
{Parl }.| =, (10.4.100) 
and for all constraint functions Qg,, 
{Oe Op F5)= 0. (10.4.101) 


We leave as an exercise to the reader to check the basic properties of a 
bracket in (10.4.14)—(10.4.17) for {-,-},, as well. 


10.5 Bose Excitations 


We represent the Bose operators ap, al, defined in §6.1-86.3, §6.5, as 
follows 


(10.5.1) 


aBa 


d 
dpe 
al, > p*. (10.5.2) 


This representation is obviously consistent with the corresponding com- 
mutation relation in (6.1.13). 
The n-particle state (see also §6.5), may be defined by 


(8* | n) = Vn (B") = (10.5.3) 


with the vacuum state simply given by 
(8° |0) = Yo (B") = 1. (10.5.4) 

The normalization condition (§6.1) 
(n |m) = dam (10.5.5) 


and the explicit value of the integral 


i oO exp (—8"B) (Yn (8*))" Ym (8*) = bam 86) 


271 
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as obtained directly from the definition in (10.5.3), by working, for example, 
in polar coordinates (see Problem 10.13), allow us to introduce the identity 
operator 


— [ ARAB 66 | gxy (ge 

a= f SF Be oP 1pr) (or (10.5.7) 
and write 

(n | m) = (n|1|m) = dam (10.5.8) 


consistent with (10.5.6). 
We will consider, in general, any operator B, which as a function of al, 
ap, may be written in the form 


B= S~ Bam (ai,)" (ag). (10.5.9) 


From (6.1.31), (6.5.10) one easily gets 


(BLM) = > Bm tarp BV n, M — m) (10.5.10) 


with the restriction n < N, such that N —n = M — m, with 6(-,-) denoting 
the Kronecker delta. 
In the representation (10.5.1)—(10.5.4), 


* Pe\ (ao) (8')™ 
(0° |B 8 a> ut NBEO a 


* QI\N—n 
= 2 LAW 0, Mm) Bamana FAD 
N,M n,m (N —n)! 
= exp(6*0’) 5° Bam(8")"(8')™ (10.5.1) 


where in the first equality we used the completeness of the states |N), and 
in the last equality we have made a change of variable N — n — N, in the 
process. 

Upon comparison of (10.5.11) with (10.5.9) we obtain the rule 


* PV * AI 
(8° |B| 8) = exp(B"2) Blot 4+ app" ° (10.5.12) 
In particular, for the identity operator, this gives 
(3* | B™) = exp (8*8") (10.5.13) 


which is also directly verified from (10.5.3). 
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From (10.5.7), (10.5.13), we obtain the property 


de*d er 
|p") = | Stee Ber BB (10.5.14) 
271 
With (0|, denoting (n = 0], this leads, upon multiplying (10.5.14) by (0], to 
de*d Bia be 
i. 1 6-8 Behe — 1, (10.5.15) 
271 


This result may be also obtained by expanding exp (6*p), on the left-hand 
side of (10.5.15) and then using (10.5.6). 
Other useful relations which readily follow from (10.5.14) are 


dB"d8 a6 8% g\n — (yn 
i, se Be (8)" = (p) (10.5.16) 
/ ABTA 8° BS" reba" _ oo" (10.5.17) 
271 


which in turn generalize (10.5.15). 
From (10.5.7), we have the following representation for an operator B 
obtained by writing B = 1B1, 


dp*dB dads" —(8"a+8"*6") |3*) (6*| B |e") (6"|. (10.5.18) 


271 2Ti 


B= 


For the product of two operators B,, Bz this gives 


aah / OOP AAP ("8488") 19) (6*| Bar |B) (B*| (105.19) 


271 271 


where 


_ [dBi ara, 


271 


(B"| Bas |B") (B"| Bo |G) (Bj| Bi |B") (0.5.20) 


with the rule given in (10.5.12) for computing the matrix elements on the 
right-hand of (10.5.19). 
In the sequel we use the notation 


(B"|_B|B") = B(B", 6’). (10.5.21) 


For a Hamiltonian H (al, ap;t), the time evolution operator U(t,t’), 
t’ — tis then from (10.5.12), (10.5.18), by repeated applications of (10.5.19), 
(10.5.20), given by 


U(t,t’) = / 97d AG" AA" 5 (8° 8-+8""8 1g) (a™| UB", B'st,t!) (10.5.2) 


271 271 
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where 
N-1 
‘ dG d Gr, 
* I, PN: oa l k 
U(6", 5; t,t’) eof (I = 


N 2 N-1 
x exp (>: (Gibe-s — peHt(Ge, sits) - > *%) 
ioe = (10.5.23) 


and 64 = 6, Bo= 6’, & =U+(k—1/2)e, k=1,...,N, t-# =Ne. 
We may add and subtract the term {%,6y in the exponential in (10.5.23) 
to obtain in the limit N — oo 
B* (t)=8* 


ver ate) = ff G0), ew(5"(0), 0) 


x exp jar (ina* (7) B(r) — H(8*(r), B(n); 7)) . (10.5.24) 


The exponentials in (10.5.24) may be combined in the form 


+ pt 
exp | dr (ihB*(r)B(r) — ihB* (7) B(7)5(7 — t+ 0) — H(B*(7), B(7);7)) . 
h t’ 
(10.5.25) 
As an application, we consider the Hamiltonian 


(ay, ap;t) = hwaljap — F(t)(ap + al) (10.5.26) 


(see also (6.2.2)), where for simplicity we have not included a zero point 
energy. 

For the Hamiltonian in (10.5.26), one is dealing in (10.5.23)/(10.5.24) 
with Gaussian integrals. These may be carried out directly from (10.5.23) by 
a method which will be explicitly worked out in detail for the corresponding 
case with fermions in the next section (see Problem 10.14). They may be also 
carried out by the method applied in $10.2 by considering the deviation of 
B*(r), B(r) from their classical solutions with boundary conditions 6*(t) = 
B*, B(t') = 2" by using, in the process, the expression in (10.5.24). These give 
the following explicit form 


2 t 
U(B*, B';t,t') = exp (B* pe") exp (je | dr ele) 
t’ 


; t 
x exp Gai dr owrle(r)) 
t’ 


1 t Fe ; ; 
x exp (-z/ ar | ar! F(a) Fe) (10.5.27) 
t’ if 
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With F(7) assumed to vanish outside the interval (t’,t), we may introduce 
the Fourier transform 


F(w) = / dr el" F(r) (10.5.28) 
and using the notation 
y= if dr i dr’ F(r)e 0-9) @ (7 — 7')F(r’) (10.5.29) 


we obtain from (10.5.27), the following expression for U(t,t’) in (10.5.22), 


* he Qt 
U(t,t') ew [ ISB AB*AE (6° 04+0"*6) oxy (arste™(t-*) 


271 271 


x exp (if*e" F(w)/h) exp (ip%el*" F*(w)/h) |B") (31. 
(10.5.30) 


In particular, this gives 
Y 
(0 |U(t, t’)| 0) = exp (-3) (10.5.31) 


(compare with (6.2.12)), where we have used (10.5.4) and the explicit inte- 
gral (10.5.15) which gives unity for the integral corresponding to the measure 
dG*d8/27i followed, again, by one for the integral corresponding to the mea- 
sure d3’*d@" /27i. 


For (n |U(t, t’)| 0), the d6’*dp’/27i-integral gives one, and hence 


= Rh? de*dé =g* + Q* —iw (8)” 
(n |U(t,t’)| 0) = e77/ [oe exp (iste MF (w) /2) Te 
iF —iwt /f, se 
= LF@e t/a" eh (10.5.32) 
vant 
where we have used (10.5.3) and (10.5.16), which is to be compared with 
(6.2.22). 
Finally, we derive a generating function which allows the determination 
of all the matrix elements (n |U(t,t’)| m). 


To the above end, we multiply (10.5.30) form the left by ((ct)"/vnl) (nl, 
from the right by |m) ((c)"/vinl), and sum over n,m = 0,1,..., to obtain 


S- Gals (n |U(t, t’)| m) ee =e / dé"dp de" dp" -(s"p+8'"6") 


] 271 271 


S 
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x exp (« (eens) ai 5o™"FW)) J 


x eho be exp G (peer ))) ; 
(10.5.33) 


The integrals may be carried out exactly thanks to the integral in (10.5.17) 
to yield 


S a (n |U(t, t')| m) an = au exp (etclewtt-#9) 


x exp Ga were) exp (Goer) .  (10.5.34) 


The matrix elements (n |U(E, t’)| m) are then obtained from (10.5.34) by 
carrying out n derivatives with respect to c* and m derivatives with respect to 
c’, then setting c* = 0, c’ = 0, and dividing by Vm!n!. This is to be compared 
with the expression for (nt| mt’) in (6.2.30) in the present notation. 


10.6 Grassmann Variables: Fermi Excitations 


In this section, we introduce anti-commuting c-variables, referred to as 
Grassmann variables, and learn how to differentiate and integrate with re- 
spect to such variables. Finally Grassmann variables are used to develop 
path integrals to describe Fermi-particle interactions with external sources. 
We consider in turn real and complex Grassmann variables. 


10.6.1 Real Grassmann Variables 
We introduce v anti-commuting real variables 71,..., 7y: 
{1j. nk} = 0 (10.6.1) 
j,k =1,...,v. Equation (10.6.1), in particular, implies that 
ni, = 0 (10.6.2) 


fork =1,...,v. 
As a result of (10.6.2), we may expand a function of (m,...,m) = 1 
involving at most the product of the v distinct Grassmann variables: 


FO) = FO feat he (10.6.3) 


k=0 21,..-,2% 
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where for k = 2,...,v, the coefficients cae 


;, are totally anti-symmetric in 


Uysesy Lk. 
The left-hand derivative with respect to a Grassmann variable 7; is defined 

by 

) ts 

15 
where 6; denotes the position of 1; in the product 7, ---7;-+- 7, from the 
left, and the variable 7; is omitted on the right-hand side of (10.6.4). If the 
former product does not involve n;, the right-hand side of (10.6.4) should be 
replaced by zero. For example, note that 


O 
On; J J 
and 


a) 
{an} ik (10.6.6) 


Similarly the right-hand derivative with respect to a 7;, denoted by 0/0n;, 
pas 


is defined by i 
May Tin BE = (1937 ni, iy (10.6.7) 
gat 


where 6; denotes the position of 7; in the product 7, ---7j +++, from the 
right, etc. 


For a fixed k in {1,...,u}, because of property (10.6.2), one has to inves- 
tigate the meanings of only the following two integrals: 


/ dine, / aie (10.6.8) 


Assuming translational invariance of the integrals for nm, — + Qrn = 
where ax is another Grassmann variable, which anti-commutes with nz, we 


obtain 
an = om (10.6.9) 


= om Nk (10.6.10) 


for arbitrary a,. Hence we conclude that 


fom =0 (10.6.11) 
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for any k in {1,...,v}. 
The second integral in (10.6.8), defined as a c-number, may be normalized 
arbitrarily, and we choose it to be 


om Nk = V20 = -[m dn (10.6.12) 


for any k, where in writing the second equality we used the fact that the 
nature of 7, as a Grassmann variable requires that dn, also anti-commutes 
with the Grassmann variables to make the second integral in (10.6.8) a c- 
number. The latter is consistent with the definition of the differential operator 
d= >> dn,.0/0m, satisfying the rule 

k 


d (ning) = dn ng + dN; (10.6.13) 
and, on the other hand, one has explicitly from (10.6.5) that 
d (nin;) = dy nj — Inj 1% (10.6.14) 


which upon comparison with (10.6.13) implies that 
{dni,n;} =0. (10.6.15) 


From the rule given in (10.6.12), we may define the multiple integral 


an m [an m fo fan Ny = (2r)°/? (10.6.16) 


fans fan Tym = (20)/?. (10.6.17) 


A function f(n,) of n, in {71,..-,7)} may be written as 


or 


f (7%) = co + em (10.6.18) 


where co, c are c-numbers. For any other Grassmann variable a,, which anti- 
commutes with nz, we have 


if dine (mu — ox) f(a) = VIE (co + co) 


= V2rf (ar) (10.6.19) 


which allows us to introduce the Dirac delta, in this context, given by 


(10.6.20) 
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The reader is asked to derive some of the immediate properties of 5(7,) that 
follow from (10.6.20) in Problem 10.15. 

From the rule of integration in (10.6.12), (10.6.11), we may then introduce 
the representation 


d 
S(m—ax)= | fe PR oipx(m—on) (10.6.21) 


271 


as is easily checked, where px is a Grassmann variable which, in particular, 
anti-commutes with nz, az. 

Since dn, 6 (7% — @,) commutes with all Grassmann variables, we have 
the useful property 


fan 6 (m — a1) +++ dy 5 (my — ay) (m)™* +++ (Mv) 


= (a1)"" +++ (ay) (10.6.22) 
where €; = 0 or 1, (-)° = 1, 
foo -++ (My)? da’ 54 (MW — &) = (a1) + (ay)? (10.6.23) 
and 
d°q = di - ++ djy—1dm (10.6.24) 
6% (1) — &) = 6 (My — Gy) 6 (My—1 — y—1)--- 8 (mM — a1) (10.6.25) 


and note the different orderings in d’n and 64(n — «). 

By using the property that a product of two Grassmann variables 
Pro, commutes with all Grassmann variables, we may infer from (10.6.21), 
(10.6.25) that 


a d4P ip 
64(n — a) = / aie pune) (10.6.26) 
where 
d%4p = dpydpy-1--+- dpi. (10.6.27) 


For an arbitrary function f(n), as given in (10.6.3), we define the trans- 
form 


= pro dn’ eiem, (10.6.28) 


Upon multiplying (10.6.28) form the right by (d%p/(27i)”) exp(—ip +n) and 
integrating we obtain 


) ae —ip-n _ = f fa’ d’n , dye) eip-(n’—n) 
[io (27i)” : J (27i)” 
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= f fla) arn’ 54(n/ =n) (10.6.29) 


or 


a 
n)= f F() oar -ien (10.6.30) 


where we have used (10.6.23), (10.6.22), (10.6.26). 

Finally we note that the rule for the change of integration variables in 
the evaluation of an integral turns out to be quite simple but surprising. 
Consider the change of variables 71,...,y — Q@1,-..-,Qy defined by the linear 
combination 


j = Cire (10.6.31) 


where the Cj, are c-numbers, and qj,...,a@, are Grassmann variables. To 
the above end it is sufficient to consider the integral of the product 7,..., 7. 
In this case, we have 


fan Meet = [rea J Ck, Ok, . Cok, Xk, (10.6.32) 
where J is the Jacobian of the transformation to be determined. Since 
Op, * Up, = A -+ yerr he (10.6.33) 


where ¢*1'"*» is the Levi-Civita symbol equal to +1, —1 if {k,,..., ky} is and 


even, odd permutation of {1,...,v}, respectively, and equal to zero if any 
two or more of the indices ky,...,ky are equal, and 
Cip, + Cone" * = det C (10.6.34) 


we immediately obtain from (10.6.32) that 
= (det C)~" (10.6.35) 


being the inverse of det C (!). 


10.6.2 Complex Grassmann Variables 


Out of two real Grassmann variables ap, ay satisfying {aR,a;} = 0, we 
may define a complex Grassmann variable a = ag+iay. Immediate properties 
which follow from this definition are 


{a,a}=0, f{a,a*}=0, {a*,a*}=0. (10.6.36) 
Also imposing a reality restriction on the product 
a*q = —2iajaR (10.6.37) 


implies that 
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(azaR)* = —ajaR 
= aRaI = —azaR (10.6.38) 


where in writing the second equality we have used the anti-commutativity 
of az, aR, and in the last one, we have used the reality condition of these 
variables. 

From (10.6.12), and upon writing 


[eonan = v2n = ( fear an) =~ f(dax)* ai = f(dan)* on 


(10.6.39) 
we may infer that 

(dar)* _ —dar (10.6.40) 

and similarly 
(day)* = — (day) . (10.6.41) 

From the definition 

da = dar + iday (10.6.42) 

it follows that 
(da)* = —da*. (10.6.43) 


Also from {da*,a*} = 0 and 


(f dae)’ = foe (any =~ f ae 
( [aa a) - [acer (10.6.4) 


To define integrations over complex Grassmann variables, we first note 


that for consistency with (10.6.11), (10.6.42), (10.6.43) 


one obtains 


Joo =0 (10.6.45) 


Also to obtain a consistent definition of the integral in (10.6.45), we consider 
the transformation of variables ap,ay > a*,a: 


e 7 ; (; = ee (10.6.46) 


define the Jacobian of the transformation given in (10.6.35) 


i ye ot: 
y= (act 5 (| =) =i (10.6.47) 
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and use the fact that arap = aa*/2i to infer that 


Qn = [eax daz ay aR = [aera (2i) (aa* /2i) (10.6.48) 


or 


da* da 
ete a 10.6.49 
V2n V2 ( ) 


Hence a consistent definition with (10.6.45) is 


da* da 
—o* =1= | —=a. 10.6.50 
V2 V 20 ( ) 
For integrations over v complex Grassmann variables a 1,...,Qy, satisfy- 
ing 
{aj,an}=0, {ai,az}=90, f{az,az,}=0 (10.6.51) 
consider the integral 
dajd da*da; 
I =| ie Raaese PW cae ES exp (—a; A;;a;) (10.6.52) 
27 27 


where Aj; are c-numbers. The commutativity of a} Aja1, for example, with 
all the Grassmann variables, and so on, allow us to rewrite the integral J as 
(since a? = 0,...,a2 = 0) 


os da; (—a%, Aj,101) da, da* (—a% Aj, vv) da, 
5 / 20 _ 20 


* xk * * : 
da; OF, Aj da*, as Ajyu 


Vir Vv2n 


da; dax , ? 

= = OR a Aju ++ Ajyt- (10.6.53) 

Upon using the properties 
GG, So eae (10.6.54) 

and oe 
ge AS fant Ajit = det A (10.6.55) 

we obtain 
dajd dada, 

/ oa bed —— exp (—a*Aj;a;) = det A. (10.6.56) 


For det A 4 0, (10.6.56) gives rise to another useful integral upon making 
the change of variables a; > a; + 9j;, aj > aj +7;,j =1,...,v where the 
7, 7; are Grassmann variables. This leads, from (10.6.56), to 
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pas da*xday, 


exp — (a; Ajja; + a7 b; + CFa;) 


I 27 
= (det A) exp (c}.A;,"b;) (10.6.57) 
where 
by = Ain; (10.6.58) 
cf =f Ay = (AZ) of. (10.6.59) 
For A a Hermitian matrix, 


10.6.3 Fermi Excitations 


We represent the Fermi operators ar, al, defined in (6.4.1)—(6.4.3), as 
follows 


d 
ap —> —— (10.6.61) 
da* 
al = a* (10.6.62) 


where a* is a Grassmann variable. The representation (10.6.61), (10.6.62) is 
obviously consistent with the anti-commutation relations (6.4.2), (6.4.3). 
The vacuum state and the single particle state may be then defined by 


(a*|0) = Po(a*) = 1 (10.6.63) 
(a*|1) = Y1(a*) = a* (10.6.64) 


where we recall that (a*)? = 0. We may rewrite (10.6.63), (10.6.64) in a 
unified notation 


(a? (n= dh (a") =(e)" > - = 051. (10.6.65) 


From (10.6.56), (10.6.50), we have 


| re exp (—a*a) (Wn (a*))" Wm (2*)) = Sm, n= 0,1. (10.6.6) 


The identity operator is defined by 


l= / sR exp (—a*a) |a*) (a*| (10.6.67) 
27 
and 
(n | m) = (n|1|m) = bnm (10.6.68) 


according to (10.6.66). 
An operator F’, which is a function of a', a, has the general structure 
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PS Se Cantal yl Cay" (10.6.69) 


n,m=0,1 
where the Cy, are c-numbers. It is easy to see that for N, M =0,1, 

(N |F| M) -S~ Cam 6(N —n,M —m) (10.6.70) 
where the summation is over all integers n, m,0<n<N,O0<m<M such 
that N-n= M—m. 

In the |N), N = 0,1, basis 


F= S° |N)(N|F|M) (M| (10.6.71) 
N,M=0,1 


with (N |F'| M) given in (10.6.70). In the representation (10.6.61), (10.6.62), 


(a*|Fla’*)= S> (a*)% (NFM) (a’)™. (10.6.72) 


N,M=0,1 
The latter is explicitly worked out, from (10.6.70), to be 
(a* |F| a’*\ = Coo + Co1a + Cioa* + C0"! + Cooa* al 


=> (Coo t Coa" t Ci9a* t Cy a*a"') exp (a*a’). (10.6.73) 


Comparing (10.6.73) with (10.6.69) we have the rule 
(a* |F\a’*) = exp (a*a’) F | +_,.» = F(a*,a’). (10.6.74) 


From (10.6.67), (10.6.72), (10.6.74), we have the following convenient rep- 
resentation for the operator F' obtained by writing F = 1F 1, 


dis ES da*da = 


(exp — (aa + 9"8)) |a")F (a*, 8) (B"|.  (10.6.75) 
For F' the identity, (10.6.74) gives 
(a®* | a’*) = F (a*,a’) = exp (a* a’) (10.6.76) 


coinciding with the expression obtained from the use of the completeness 
relation 


(a* |a’*) = SU (a* | n) (n | a’*) 


n=0,1 


I 


S~ (a*)"(a’)" =1+a%a' = expa*a’ (10.6.7) 


n=0,1 
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as expected. 
For two operators F>, F, which are functions of a!, a having the general 
structure in (10.6.69), we have directly from (10.6.75), 


Fahy = f Bee oxy — (ara + 8B) la") Fas (0°, )(0" 


27 27 
(10.6.78) 
where, using (10.6.75), (10.6.76) 


d md ‘d rq 1 
Fas (a",p)= f SE (exp — (aia + "6" - pa’) 
x F(a?) Fi(a!*, 8) (10.6.79) 


or upon carrying out the elementary integrations over 3’* and a’, we obtain 
in a convenient notation 
* dajdA, * * * 
Foi (a3, 80) = ) eS (exp —a7} 1) Fy (a3, G1) Fi (a7, 80). (10.6.8) 


For a Hamiltonian H(a',a;t), the time-evolution operator U(t, t’), t’ — t, 
is then from (10.6.74), (10.6.75), by repeated applications of (10.6.80), given 
by 


U (t,t) = 


joe de*dp (exp ee a) ja*)U (a*, 6; t,t") ("| 
27 Qn 


(10.6.81) 


N ; N-1 
x exp (>. Cus = cH (a4, ast) = oi] 


k=1 k=1 


and a%, = a*, 69 = 8, th =U +(k—-1/2)e,k=1,...,N. 

Of particular interest is the vacuum expectation value (0|U (t,t’) |0). 
Since (0 |a*) = 1 = (G* |0), we may immediately carry out the a, 3* inte- 
grations. This in turn leads to the explicit evaluation of the a*, 2 integrals 
in (10.6.81) and we may effectively set a* = 0, 69 = 0 in (10.6.82) recalling 
that we are integrating over Grassmann variables. That is, 


= ee 
(OU (t.t)10) = Jim [ (I soit) 
k 


= 
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cord (5 (0 i — 1) Bri sa (Gx — Br—1) tH (xi, ust) ) 
(10.6.83) 


where ay = 0, Bo = 0. 
Upon taking the limit N — oo in (10.6.83), we obtain 


a*(t)=0 


ow eeo= f(a) .aO)es5 far (5 lo atm 


t! 


‘)=0 
—a* (7) B(r) ] — H(a* (7), 8 (7) 1)). (10.6.84) 
As an application, we consider the Hamiltonian 
H (af. ar; = = hw (t) aap — n* (t) ap — alin (t) (10.6.85) 
(see also (6.4.11)), where w(t) is, in general, a time-dependent c-function, and 
n*(t), n(t) are time-dependent Grassmann variables, where for simplicity we 
have not included a zero point energy. 


The sum in the exponential in (10.6.83) or (10.6.82), with a3, = 0, 69 = 0, 
may be then written as 


N-1 N-1 ; N-1 
d kU —1 — > ah Ge + ge 2, (of (x41) Be + axn (i))  (10.6.86) 


where 


Yk = 1 —iew (tx) . (10.6.87) 


The expression in (10.6.86) is conveniently rewritten as 


vam: ay, Ags 3; = azdr = Ce BK (10.6.88) 
where j,k =1,...,N—-1, 
| 1. 070 0| 
eyo (L.A 
0 =F. 1 
A=[Ajj] = a (10.6.89) 
1 O 
0 0—-yn-11 


det A = 1, and 
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by =—Zen (te), oh = —Zen* (fess) (10.6.90) 


The inverse of A is given by 


1 0 0 0 
72 1 . 
At= "3 9 (10.6.91) 
1 0 


(ya'3°*: YN-1) (¥8Y4°*: YW-1) «°° (Yw-1) 1 


From (10.6.57), we then have for (0|U (t,t’)|0) the expression 


k 
Nim exp | 75 * (teaa ) Larges “yen (£5) (10.6.92) 


where, needless to say, y; = 1, and for 7 = k—1, replace ypy~ by Yr, and for 
j =k, replace yr+17% by one. 
For ¢ ~ 0, we may rewrite 


Vj+1--+:Vk ~ exp —ie (w (tj 41) ao wy (tx)) 


= exp —ie (w (t2) eee tw (tx)) exp le (w (t2) tere +wW (t;)) : 
Hence upon taking the limit N — oo in (10.6.92) we obtain 


(0|U (t, t’)|0) = exp (-= i: dr | dr’n* (7) ei2(7.7') (r-—7')n (r)) 
(10.6.94) 
where 


OG =} ad wir) (10.6.95) 


In (10.6.94), we assume that 7*(7), 4(7’) vanish outside the interval (¢, ¢’). 
For w(t) = w, we obtain the expression in (6.4.18) in the present notation. 
A very similar procedure as above (see Problem 10.16) shows, quite gen- 
erally, with boundary conditions a*(t) = a*, G(t’) = G, for the Hamiltonian 
n (10.6.85), that U(a*, 3; t,t’) is given by 


ho : : 1 
U(a*, 3; t,t’) = exp (ampere ) exp (ers) exp (;123) exp (-7) 


(10.6.96) 


where 
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hee pau n(r)e 27) (10.6.97) 
t 


i= ff dr n*(r)eWir#) (10.6.98) 
t! 
and Q(7,7’) is defined in (10.6.95). 

Upon using (n |a*) = (a)”", (6* |m) = (6*)™ with n,m = 0,1, in com- 
puting all the matrix elements (n |U (t,t’)|m) of U(t,t’) in (10.6.81), and 
carrying out the elementary integrations over the Grassmann variables a”, 
a, B*, @ we obtain from (10.6.96) (see Problem 10.17) 


(n|U(t,t’)|m) = |6nodmo + Snodmi sPs 4 indo P| enh? 


Pi 
+5n1 51 (ee) + |e ~/F (10.6.9) 


The reader is urged to show that for w(t) = w, (10.6.99) immediately reduces 
to the corresponding expressions in (6.4.20)—(6.4.22). 


Problems 


10.1. Follow the procedure given in §10.2 in deriving the path integral for 
the time-independent potential —F'x to derive the corresponding ex- 
pression for the time-dependent one —F'(t)x, and compare your result 
with the one given in §9.4. 

10.2. Verify that the integral equation in (10.3.17) leads upon taking the 
limit ¢ — 0 to the Schrédinger equation in (10.3.18). 

10.3. Carry out the multiple Gaussian integrals over the momenta in 
(10.3.27), for velocity independent potentials, to obtain the path inte- 
gral in (10.3.28) in coordinate space. 

10.4. Investigate the nature of the path integral of a free particle moving 
along a circle of fixed radius a as a non-constrained problem, i.e., by 
writing the Laplacian V” directly as (1/a?) 0?/0¢?. 

(i) Carry out a Fourier expansion of 6 (¢ — ¢’) and show that 


N—-1 2 


ma? we 
(gt |9't") = lim (sary) & = 00 I {> 


_m 
xexp | i So (byt — 65 + 2aky)” 
j=0 


ky=—06 
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where 6n = ¢,¢0=¢. 
(ii) By using the Poisson sum formula, and by appropriate change of 
variables in part (i), show that 


[o.e) 


(gt |¢'t') =a S$” (a(o+2rk),t |ad’,t’), 


k=—0o 


a(d+2rk) 


iy ss / PZ (ad (-)) exp i, [ (900) 


k=—0o ad! 


where (2, t|x,t’) is the free Green function. The expression above 

is an infinite sum of one dimensional amplitudes on the real line 

each characterized by a winding number k. Interpret this result. 
10.5. Consider the following formally Hermitian operator 


p ih 


2,2 
V(x) = 
2m rere ae Dam” pre) Ee 


where a? > 0 is a constant. We note that the operator p stands on the 

right-hand side which explains the notation Hr. 

(i) Show by using [z, p] = if, H may be rewritten with p standing on 
the left-hand side as 


Pp 1 2,2 
H= | | = Ay. 
2m a 2a2m? © Ve) " 


(ii) Carry out the path integral for this system with a velocity depen- 
dent potential as described in §10.3, to conclude that classically 
with x and p as c-numbers now 


1 2 
H, = =(Hp + Hy), =(1+5 +V(a). 
2 a? 


2m 
(iii) Integrate explicitly on the momenta of the corresponding functions 
to show that for the amplitude (a, t|x’, t’), the resulting integrand 
for the corresponding one in the exponential in (10.2.18) is not 
simply given by the classical Lagrangian. Here you will encounter 
a 6 (0) singularity well known in some field theories. 

10.6. Verify explicitly the properties of the Poisson brackets in (10.4.14)— 
(10.4.17). 

10.7. Investigate the nature of the transformation of the amplitude 
(qt |q’t’), with no constraints present, under a finite canonical trans- 
formation via a generator W. 

10.8. Show that for a theory with first and second class constraints ®g, Va, 
respectively, that from 


10.9. 


10.10. 


10.11. 


10.12. 


10.13. 
10.14. 


10.15. 


10.16. 


10.17. 


10.18. 
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{Dq, 0} = 02,8, + 07,We 


one may infer that be. = 0. 

(Hint: Use the Jacobi identity and (10.4.32).) 

For a function G (q(t), p (¢)) write down explicitly the expression for 
G = {G,Hr},,, where Hr is given in (10.4.35) and integrate the 
latter over ¢ in the neighborhood of ¢ = 0 for a given initial condition 
G at t = 0, to show that G depends, in general, on the undetermined 
Lagrange multipliers nq at t = 0. 

For the transformation q1, 92, q3,P1,P2,p3, H — qf, Q,¢, pi, P, ps, A, 
in (10.4.51), with a a constant, 

(i) show that 


Q=0H/OP, P=-0H/0Q, 
Gj =OH/Op;, = pi =—OH/Oq;, i= 1,3. 
(ii) Show in reference to (10.4.45), (10.4.48), 


6) 
eens = a 


For a free particle moving in a circle with primary constraint ¢1| = 
eg +y? — a”)| = 0, study the properties of the constraints arising 
from this apparently simple system. 

Investigate the nature of path integrals in the presence of second class 
constraints and their properties under variations 5y,, of the subsidiary 
constraints. [Ref.: Senjanovic (1976).| 

Establish (10.5.6) by working, for example, in polar coordinates. 

By a procedure similar to the one used for Fermi excitations in §10.6, 
carry out explicitly the multiple Gaussian integrals in (10.5.23) to 
obtain (10.5.27), for the Hamiltonian in (10.5.26). 

Derive some properties of 5 (7, — ax) for Grassmann variables as fol- 
low from (10.6.20). In particular, what is 6(0) for such variables? 

Use a procedure similar to the one given through (10.6.86)—(10.6.95) to 
derive (10.6.96) with the boundary conditions a* (t) = a*, 6 (t’) = 8B. 
Use the expression (10.6.96) in (10.6.82) to explicitly carry out the 
integrals over the Grassmann variables a*, a, 8*, 3 to obtain all of 
the matrix elements (n |U (t,t’)| m), as given in (10.6.99), with n,m = 
0,1. Verify that (10.6.99) reduces to the corresponding expressions in 
(6.4.20)—(6.4.22) for the case w(t) = w. 

Verify that (10.5.27), (10.6.96) may be obtained directly by substitut- 
ing the classical solutions (* (rT) ,G.(7)), (a% (7) , 8. (7)) in (10.5.24), 
(10.6.82) for the Hamiltonians in (10.5.26), (10.6.85), respectively. 
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The Quantum Dynamical Principle 


The quantum dynamical principle (QDP) provides a formalism for quan- 
tum physics which is powerful, easy to apply and is most elegant. In the 
form presented here, it gives rise to an expression for the variation 5(at | bt’) 
of a transformation function (at|bt’) from a B-description at time ¢’ to an 
A-description at time ¢, as arising from any changes made in the parame- 
ters of a Hamiltonian such as of the underlying masses, coupling constants 
(charges,...), prescribed frequencies, external sources, as introduced in 86.2, 
86.4 and also encountered in $10.5, §10.6, and so on. Typical transformation 
functions considered are (qt|q/’t’) and (qt| pt’) written in the (q, p) language. 
The subsequent analysis of the expressions for (at | bt’) provides then endless 
applications to all aspects of quantum physical problems. 

One advantage of the QDP approach over the path integral one, is that 
the former is based on carrying out functional differentiations in the theory, 
while the latter involves in carrying out an (infinitely uncountable) multiple 
functional integrals and it is relatively easier to functionally differentiate than 
to deal with continual functional integrals. The formal equivalence of both 
formalisms is, however, established in §11.4. 

The QDP is entirely due to Schwinger,' and remains to be an extremely 
powerful tool in the development of quantum physics and quantum field the- 
ory.” 

The purpose of this chapter is to show how to do quantum physics by 
using variations $(at| bt’) of transformation functions in a systematic way. 
In §11.1, we derive the QDP and obtain explicit expressions for transforma- 
tion functions in §11.2. So-called trace functionals are introduced in $11.3 
which, in particular, provide useful information on the spectra of Hamil- 
tonians. §11.4 deals with the connection of the path integral formalism to 


' Schwinger (1951a, 1953, 1960c,b, 1962); Lam (1965); Manoukian (1985). 

? For the reader who has some familiarity with field theory see, Manoukian (1986a, 
1987b); Manoukian and Siranan (2005), on how the QDP solves the quantization 
problem of the present gauge theories of the fundamental interactions of physics. 
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the QDP one. Bose/Fermi excitations within the light of this formalism are 
treated in $11.5. Finally, §11.6 develops an extension of the QDP, referred to 
as the closed-time path formalism of Schwinger,* to obtain directly expecta- 
tion values of observables at any time in an initially prepared state without 
first determining the underlying amplitudes. We will encounter applications 
of the QDP again later, notably, in scattering theory in Chapter 15. 

It took years before the path integral formalism was widely used after 
it was conceived.* Old habits die hard, however, we expect that the QDP 
formalism will be also widely used in the near future not only as a practical 
way for computations but also as a technically rigorous method for doing 
quantum physics. We hope that this chapter will have some contribution to 
this end. 


11.1 The Quantum Dynamical Principle 


Consider the general Hamiltonian given by 
H(t, ) = Hy (t) + Aa(t, A) (11.1.1) 


where H,(t), Ho(t, 4) may be time-dependent but H2(t, A) may, in addition, 
depend on some parameters denoted by A. Here stands for any parame- 
ters such as masses, coupling constants (charges,...), prescribed frequencies, 
external sources (such as, e.g., in §6.2) and so on. The time dependence in 
H(t, A) is assumed to come from a priori given time-dependent potentials 
and/or external sources (see, e.g., §6.2). 

The time evolution operator associated with the Hamiltonian H(t, A) will 
be denoted by U(#, A). From (A-2.5.2), (A-2.5.11), in the Appendix to §2.5, 


we have a 
ih Ut, A) = H(t, A) UE, A). (11.1.2) 
For the theory given in a specific description, say, the A-description (see 


§1.1, (9.1)-(9.4), (10.2.3)-(10.2.5)), 
itl 
ins, (at| = (at| H(t,d). (11.1.3) 


One may also work with the Hamiltonian H(t), introduce the correspond- 
ing time evolution operator U;(t), which are independent of X, satisfying 


d 
ih—Ui (1) = H(t) Ui(t) (11.1.4) 
3 Schwinger (1961a). 
4 This is not to mention of the much more dramatic situation involved with the 
Lagrange and the Hamilton formalisms for the time elapsed before they were 
broadly used in classical mechanics after they were formulated in turn. 
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and set, 


_d 
in (atl = ,(at| H(t), (11.1.5) 


The theory, however, is described by the Hamiltonian H(t, A) in (11.1.1) and 
the time evolution operator U(t, d). 
The physical states (at| are clearly related to the states ,(at| by 


(at| = ,(at| Uf (t) U(t,»). (11.1.6) 


This suggests to introduce the unitary operator 


V(t, d) = Ul (t) U(t, A) (11.1.7) 
to rewrite (11.1.6) as 
(at| = ,(at| V(t, d). (11.1.8) 
We note that r 
inTVitrA) = U} (t) Ho(t, 4) U(t, A) (11.1.9) 


where we have used the definition in (11.1.7). 

We are interested in studying the variations of transformation functions 
(cf. §1.2, §1.4) (at| bt’), with respect to the parameters 4, from, in general, 
one description to another. 

To the above end, we use the identity 


ine [VEN VITA V(r, XVI, YY] 
= V(t,d) [Ut 4) (Ha(r, 0") — Ha(r,)) Ur, ¥)] VEE) 


VEN [Ur(r, d) (H(r,’) — H(z, A) U(r, x) vi(tl,’) (1.1.10) 


where we have also used (11.1.7), (11.1.9), the unitarity of U;(7) and finally 
(11.1.1). Here X’ # X, in general. 

We may integrate (11.1.10) over 7 from ¢’ to t, and use the unitarity of 
V(t, A) 


V(ii,AVT(E,A)=1, VIG,A)V(E,A) = 1, (11.1.11) 
evaluated at equal times and for identical parameters, to obtain 


[Vex vtwe,») - Vibrate, )| 


= V(t.) | [ar 0) (HOX) ~ HOA) UC ) vii), 
(11.1.12) 
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Now we set \’ = \+ 5A, to get the variational form of (11.1.12) given by 


5 [vi viel, | 


e t 

= —-V(t,) | dr U(r, A) 8 (7, 4) U(r, d)| Vit, A). (1.1..1.13) 
t’ 

Using the (q,p) language, as in Chapter 10, suppressing the indices speci- 

fying the various degrees of freedom, the Hamiltonian H(7, A) may be written 


as 
H(t, X) = H(q,p,7, A) (11.1.14) 
5H (r, X) in (11.1.1) refers to the change of H(7, A), with respect to A, with 


q, p (and Tr) kept fixed. 
We define a Heisenberg representation of H(r, X), at time 7, by 


IH(7, A) = Ul(r, A) H(q,p,7; A) U(7, A) = H(q(r), p(t), 73). (11.1.15) 


This allows us to rewrite (11.1.13) as 


5 a d) vice’, )| 


= ~2V(t,2) [ar 5H(q(7), p(T), 73) | VI, A). (11.1.16) 


provided the variation 5 of H (q(r), p(r), 7: r), with respect to X, is carried 
out with g(r), p(7) kept fixed, in conformity with (11.1.13), since q(r), p(r), 
given by 


a(t) =Ul(7,A)qU(t,»), p(t) = UT (7, \) pU(r, d) (11.1.17) 


will, in general, depend on X. The g(r), p(r) are Heisenberg representations 
of q, p. 

Now we take the matrix elements of (11.1.16) with respect to , (at|, |bt’).,, 
and use (11.1.8) to obtain 


d(at| bt’) = -; [a (at |5H (q(7), p(T), 7; d) |bt") F (11.1.18) 


This is the celebrated Schwinger’s dynamical (action) principle or the 
quantum dynamical principle. It is expressed in terms of the physical states 
|at), |bt’) which depend on X. Needless to say, q and p in (11.1.18) may carry 
indices corresponding to various degrees of freedom. 

We recall that in (11.1.18), the variation of H, with respect to A, is taken 
with q(T), p(r) kept fixed. Also a and 0 are kept fixed. After all operations 
associated with the variations and subsequent integrations with respect to 
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the parameters . in (11.1.18) are carried out, these parameters may be set to 
have a priori chosen fixed values corresponding to the physical situation of 
the problem, thus obtaining the expression for the transformation functions 
(at | bt’) one is seeking. 

Of particular interests, are the transformation functions (qt|q't’), (qt| pt’), 
(pt|p’t’). For example, (11.1.18) gives 


5(qt|q't’) = -3 [ow (qt |SH (q(7), p(7), T;A) |q’'t’) (11.1.19) 


and 
5(qt| pt’) = [a (qt |5H(q(r), p(T), 73 r) |pt’) : (11.1.20) 


We will make much use of these two equations in this chapter. 
As an immediate application of (11.1.19), consider the Hamiltonian 


H(q,p,7; F(r), S(r)) = H(a,p,7) — oF (r) + pS(r) (1.1.21) 


where Fr), S(7) are numerical (i.e., c number) functions of 7 referred to as 
external sources (see also 86.2), and H(q,p,7) is independent of them.® The 
minus sign multiplying gF'(r) is chosen for convenience. 

Using the definition of the functional derivative 


oro = 6(t-7) (11.1.22) 
OL = (1.1.23) 

we obtain from (1.1.21) 
srt ann F(r), 8(7)) = —q6(t — 7) (11.1.24) 
ropiGlaa F(r), S(r)) =pé(t — 7). (11.1.25) 


With \ replaced in turn by F(r) and S(7), (11.1.19) gives the important 
results that 


(i) gry (atlat’) = (atla(r)lat) (1.1.26) 
GP) sar (ata) = (at e(r)Ia') (1.1.27) 


5 In the Heisenberg representation, H(q(r),p(r), T) will depend on Fr), S(7), and 
if we perform variations with respect to F(7), S(7), then according to (11.1.18), 
(11.1.19), g(7), p(7) should be kept fixed. 


654 11 The Quantum Dynamical Principle 


for the matrix elements of the Heisenberg operators q(T), p(T), for t! <7 <t, 
where it is understood that (gt|q't’) and the matrix elements in (11.1.26), 
(11.1.27) depend on the external sources F’, S (unless they are set equal 
to zero). For the simplicity of the notation only, we have suppressed this 
dependence on F’, S in (11.1.26), (11.1.27). 

For more examples of functional derivatives, consider the functional (i.e., 
function of function(s)) 


t t 
G|F, S$] = [av [ow F(t’) A(z’, 7”) S(7”) (11.1.28) 
t! t! 


where A(r’,7”) is independent of F, S. Then for t! <7, <t, t'<t2<t 


t 
G|F,S] = [art age S(7") (11.1.29) 
t’ 


G[F, S] = A(71,72) (11.1.30) 


Similarly for ’ <tr <t 


5 [ t 
— art far" F(t’) A(7’, 7") F(r” 
sry far far’ POD AG FO) 


t t 
= en A(t, 7") F(r”") + i dr’ F(r') A(7’, 7). (11.1.31) 
t t! 


More generally, let G[F] be a functional of F’. Replace F(r’), wherever 
it appears in G[F], by F(r’) + £6(7 — 7’), then the functional derivative of 
G|F], with respect to F'(r), is formally defined by 


° _.. GLF(’) +<6(r — 7’) - G[F(r’)] 
a ee Z 


(11.1.32) 


keeping in the numerator terms of order ¢ only on the right-hand side of 
(11.1.32) before taking the limit « — 0. 

Before considering detailed applications of (11.1.18), we generalize the 
latter further. To this end, consider an arbitrary function B(q,p,7; A) of the 
variables indicated, and define its Heisenberg representation at times 7 by 


U'(r, A) B(q, p, 7; A) U(r, A) = B(q(r), v(7), 73.4) = Br, A). (1.1.33) 
We note that 
V(t, A) IB(r, A) V(t", A) 


= V(t, d) Vi(r, A) UT (r) B(q,p, 7; A) U1 (7) V(r, A) VEE’, A). (1.1.34) 
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Accordingly, from (11.1.13), and (11.1.34) we have 


5 [vie d) B(r, A) VT(t", | 


F t 
= -2V(t,») i dr’ 8H(r’, X) IB(r, ) Vie’, d) 


+ V(t, A) SIB(r, A) V4 (#", A) 


i 


5 V(t d) [ew IB(r, 4) 8H (7’, A) Vi(t', A) (11.1.35) 


where, according to (11.1.34), the variation in 5IB(r, A) = 6B(q(r), p(7),7; A), 
with respect to A, is out carried by keeping g(r) and p(r) fixed. 

We may use the definition of the chronological time ordering in (A-2.5.6), 
to combine the first and the last terms on the right-hand side of (11.1.35), 
thus obtaining 


5 [ve d) B(r, A) VI(t", )| 


= ~2V(t,9) fer (IB(r, A) 8A (7’, A), VI(t', A) 


+ V(t, A) 5IB(r, A) VT(t’, 2). (11.1.36) 


Upon taking the matrix elements of the above equation with respect to 
,(at|, |bt’), and using (11.1.8), we obtain 


5(at |IB(z, A) |bt’) = -; [ow (at| (IB(r, 4) SIH(7’, A) | | bt’) 


+ (at |8IB(r, \) bt") (11.1.37) 


with all variations taken by keeping g(r), p(7) for all 7 in the interval from 
t’ to t, as well as a, b, fixed. 
In particular, (11.1.37), in reference to (11.1.19), implies that 


(ih) eciy (atlat ela’) = (it 


= (at| (a(7’) (7), |e’) (1.1.38) 
and there is no additional term, as in (11.1.37), since g(r) in (11.1.38) should 


be kept fixed, when varying F'(r’). 
Repeated applications of (11.1.38), give 
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& . $b 


(i So (IM Spey OM gga OM gg (atl) 


S(T) 
= (at| (a(71) +--+ a(t) v(T1) «PCT n)) , |e’) (11.1.39) 
for t' < T1,---5TnyT]+++sT, < t. For the time ordering of several operators 


see (A-2.5.7). Note that all the functional derivatives operations on the left- 
hand side of (11.1.39) commute. 

In particular we note that in considering the matrix elements of the prod- 
uct of non-commuting operators evaluated at coincident times such as in 
(qt | p(T) q(T) |q’t’), the latter is obtained by functional differentiations in the 
following limiting way 


(qt |q't’) (11.1.40) 


(—ih) 


: 5 
Mca) ore 


with « — +0. For replacing such matrix elements of the product of non- 
commuting operators, evaluated at coincident time 7, by functional differen- 
tiations with respect to their corresponding external sources, the time para- 
meters in the arguments of the latter should be infinitesimally displaced, as 
done in (11.1.40), to preserve the initial order of the operators in question. 
Thanks to relations such as in (4.5.31)—(4.5.37), the Hamiltonian does 
not always have to be a polynomial in the dynamical variables. For a typical 
Hamiltonian as in (10.3.26), the question of the ordering of non-commuting 
operators (at coincident times) does not arise. A matrix element such as 


(qt| (q(r))° qt’) may be considered to be obtained as the limit of a symme- 
tric average of (qt| (q(71) q(t2) + q(72) a(71)) /2 |q’t’). 


11.2 Expressions for Transformations Functions 


Given a Hamiltonian H(q,p,t) we derive expressions for transformation 
functions. As before indices corresponding to various degrees of freedom will 
be suppressed. 


1. For a given Hamiltonian H(q, p, t) = H, we introduce a new Hamiltonian 
by multiplying H by a parameter \ and by adding to it source terms as in 
(11.1.21): 

H! = dH — qF(r) + pS(r). (11.2.1) 


From (11.1.19), 


“ (g|qt’), = -; [ar (at\H (ar) 2).7) ae), (11.2.2) 
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where® H(q,p,T) is independent of \. Hence one may apply (11.1.39), by 
keeping q(T), p(7) fixed for all t’ < 7 < t, to rewrite (11.2.2) as 


a Wr a  [ pg, Oe ogee 0 HI 
py (tata = ; en Ui Osa (qt|q't'),. (11.2.3) 


We may integrate this equation over \ from A = 0 to A = 1, to obtain 


(qt|q't’) = exp [-+ far a( ings ihe (qt|q't’), (11.2.4) 


where (qt|q't’) = (qt|q't’),_,, and (qt|q't’)), with A = 0, is determined (see 
(11.2.1)) from the simple Hamiltonian 


HA = —qF(r) + pS(r). (11.2.5) 


The Heisenberg equations (2.3.61), (2.3.62) following from this Hamil- 
tonian are 


q(T) = S(r) (11.2.6) 
P(r) = F(r). (11.2.7) 


These equations may be integrated to 


q(T) = q(t) — fe O(7’ — Tr) S(7’) (11.2.8) 


p(t) = p(t’) + [a O(r — 7’) F(7’) (11.2.9) 


and taking the matrix elements between (qt| and |pt’) for \ = 0, we obtain 


(qtla(7)|pt’) 9 = c = [ew Q(r' — 7) si) (qt|pt’)o (11.2.10) 


(qt|p(7) |pt’) 9 = lp+ [a @(7r — 7’) F(r)| (qt | pt’), (11.2.11) 


where g and p within the square brackets on the right-hand sides of the above 
two equations are c-numbers, and we have used the relations 


o(gt| a(t) = ¢ ofat| (11.2.12) 


p(t’) |pt')) = p|pt’), (11.2.13) 


SH (a(r), p(t), T) involving operators in the Heisenberg representation developing 
in time with the Hamiltonian H’ in (11.2.1), however, depends on \. 
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for A = 0 at coincident times. Equations (11.2.10), (11.2.11) may be rewritten 
as 


~ih (gt |pt’)) = la far (7! — 7) sr) (qt|pt'), (1.2.14) 


es (qt|pt’)o = l>+ [ar O(r — 7’) F(r)| (qt|pt')g - (11.2.15) 


These equations may be integrated to yield 


atv y= em [a fae 20] en [Eo far 50] (0) 
x exp -5 | ‘dr | ‘ar! S(7) Ol = )F(r)| (1.2.16) 


where the exp(igp/ h) factor is to satisfy the boundary condition for F = 0, 
S =0, ie., for H — 0 in (11.2.5). 

Finally to obtain the expression for (qt|q't’)), we multiply (11.2.16) by 
(pt'|q't') = exp(—ig'p/h) and integrate over p, with measure dp/2mh, to 
obtain” 


ltla't)9=5(a—a' - far str )) exp | af arr | 


x exp E | ‘dr | ‘dr S(r) (7 — 7’) F(r)| (11.2.17) 


which is to be substituted in (11.2.4). This latter expression will be, in par- 
ticular, very useful when we make contact with the path integral formalism 
in $11.4. 

2. Consider the Hamiltonian 


2 
Pp 

Hh = — -4qF (tr S(r). 11.2.18 

j= 2 —aF(r) + pS(r) (1.2.18) 

Repeating the analysis given above for the transformation function (gt| pt’) 


we readily obtain 


(qt| pt’) = exp sa fe (ins) | (qt| pt’), (11.2.19) 


where (qt| pt’), is given in (11.2.16). We are interested in evaluating (qt| pt’) 
n (11.2.19) for S(t) = 


” This equation is also valid for several degrees of freedom with an obvious nota- 
tional adaptation. 
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To the above end, set 


fo O(7 — 1’) F(r’) = F(r) (11.2.20) 


then the expression depending on S(t) in (11.2.16) may be rewritten as 
. at 
exp 5 [a S(r) (p+ Fr))| (1.2.21) 
t/ 


and since we will consider the limit S(t) + 0, we may replace (5/5S (r))? 
in a 2. 9) when operating on the functional in (11.2.21) simply by [—i(p+ 


(r)) /A]? . It is then straightforward to obtain (Problem 11.1) from (11.2.19), 
a 2.16) 


(qt| pt’) = exp F (w- ae = r))| 
xem |e far F(r )(1- 2e-n)| 


. t t 
x exp aa [ar far ro (t—7s) F(7’)| (1.2.22) 
2mh t! t! 
where 

TS max(G7 ) (11.2.23) 

with (11.2.22) corresponding to the Hamiltonian 
a 
2m 
To obtain (qt|q't’) for S = 0, we multiply (11.2.22) by (pt'|q’t’) = 


exp(—iq'p/ h) and integrate the elementary “Gaussian” integral over p with 
measure dp/27h giving® 


(qt t’) 1\2 
do la sae x | ela q) | 


xexp |r | fore 1G +291) 


ues ay eS a 
(11.2.25) 


Ho) = — — ¢F(r). (11.2.24) 


8 Needless to say this expression is easily rewritten when one is dealing with sev- 
eral degrees v of freedom. In particular, (m/2nihT)/ * would be replaced by 
(m/2nihT)”’?. 


660 11 The Quantum Dynamical Principle 


(compare with (9.4.21) — see Problem 11.2), corresponding to the Hamil- 
tonian Hq) in (11.2.24), T=t-t. 

3. By an almost identical procedure as obtaining (11.2.4), the transformation 
function (qt|q't’) ,, for the typical Hamiltonian 


H=/4V(@) (11.2.26) 


is worked out to be 


i 6 (0) 
qt|q't') , = exp -+ | dr v(- inser | qt\q't’ 11.2.27 
(atld't) y ah sray)| welaty| an227 
with (qt|q't’) given in (11.2.25). 
One may use the identity (see Problem 11.6) 
F=0 F=0 


for two functionals G1, G2, to rewrite (11.2.27) as 


m 
’t' 1\2 
(tldtn = \/ oar ° “a q) 


ih 
re E T 6F (1) 8F (7) 
i ort 
x exp 7 dr v(Fin)| (11.2.29) 
" F=0 
Since 4 | / 
- 5b 
dr |q +24 t! 11.2. 
exp if a + 7p (Tr ) al ( 30) 
represents the translation operator of F(r) by [q’+(q—q)(r—-t’)/T], (11.2.29) 


becomes 


a 1\2 
Hah m im(q —¢q ) 
(GOO = oninT exp | nT 


Pr occ ees 
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x exp i farv(a+ a) (7 —t') 4 vier) 


where we have made the substitution F(r) —~ Vh Q(r). We note that 
Vh Q(r), in the argument of V in (11.2.31), takes into account the devi- 
ation of the motion of a particle from the one given by the straight line 
[a + (q—¢)(7 — t')/T]. The effect of this deviation is obtained by the 
functional differentiations provided by the exponential factor involving the 
functional differential operators (/5Q(r)) (6/5Q(7’)). 

4. We apply the quantum dynamical principle in (11.1.20) to the forced 
harmonic oscillator problem (§9.4) 


p 


1 
a mu — qF(r) (11.2.32) 


2 
by a method which is rich enough in that we consider variation not only of 
the external field F'(7) but also w, i.e., the variation of a physical parameter 
in the theory. 

Heisenberg’s equations (2.3.61), 


(2.3.62) are given by 
) 


q(T) = me) (11.2.33) 
p(T) = —mw?q(r) + F(z). (11.2.34) 


For t' < 7 <t, the solutions of these equations are given by 


q(t) = ate a 7] 


coswT' mw coswT 


t 
+ = far Kina ERG) (11.2.35) 
™m Se 


p(t) = —mu ate 


sinw(t —t’) p(t’) cosw(t — T) 
cos wT’ mw  coswT 


t 
d 

+f dr — Kiar) Fr) (11.2.36) 
t dr 


, 


where T = ¢ —?’, and the c-function K(r,7’) satisfies the conditions: (t’ < 
7,7’ <t) 

() 
=0 (11.2.37) 
(i 
= (11.2.38) 
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(iii) 
d? 2 / U 
so that G(r) + w¢(7) = F(r)/m, ie., 
d? 2 / y 
— + | K(7,7') =0 for TAT (11.2.40) 
dr2 
and as a function of T 
d 7’ +0 
—* / = 
qh (nr) a 1. (11.2.41) 


It is not difficult to show (Problem 11.7) that (73 = max(7,7’), Te = 
min(r, 7’)) 
sinw(t — Ts) cosw(T< — t’) 
K = ; 11.2.42 
us w cos wT ( ) 


Upon taking matrix elements of (11.2.35) we obtain 


(qt\q(7)|pt’) = sto + ~ / dr’ K(r, r') F(r)| (qt | pt’) (11.2.43) 


t 


where 


(11.2.44) 


i= | cos w(t — t’) D we 


coswI’ mw coswl 


is a cnumber function. From (11.1.20), we may rewrite (11.2.43) as 


“hep (qt| pt’) = a(n + 5 fe K(r,7’) F(r)| (qt|pt’) (11.2.45) 


which upon integration gives 


(qt| pt’) = exp F [ow dc(T) F(7) 


j t t 
Bre [ar Fo) K(1,7') F¢r)| (at|Pt) po. 
(11.2.46) 


To solve for (qt|pt’) »-), we carry out another functional differentiation 
of (11.2.43) with respect to F'(7”), using (11.1.38), and then set F’ = 0 in the 
resulting expression to obtain 


(atl (a(7") 4(0)) Ie) pag = [ sel) ace") = ECs") (att eno 
(11.2.47) 
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we take the symmetric limit 7” © 7, as the average of 7” = 7+0, 7” = 7-0, 
to write 


ih 
(at?) pao = [2O)—2KG.7)] (ative pag (11.248) 
and integrate over T. This gives 
: 2 ’ : 2 ih ’ 
(qt] J dt g°(7) |pt!) poo = | fat ae (7) — — dr K(r,7)] (qt | pt!) poo - 
t’ tf. t’ 
(1.2.49) 


Now we invoke the quantum dynamical principle (11.1.20), in regard to 
the mw?q?/2 term in the Hamiltonian, to obtain 


fe aes) 
iD ae (atl Pe) = (atl far ate hi) ae (11.2.50) 


With g(r) defined in (11.2.44), and K(r7,7’) given in (11.2.42), we have 


sinw(t — 7) cosw(7 — t’) 


K = 11.2.51 
(757) wcoswT ( ) 
leading to the integrals 
/ z 1 
[a K(t,7) = ae tanwT = sop cos (11.2.52) 


t 
1 O p? tanwT 
dr g2(r) = —— |q?— iim Pik a 
¢ T q(T) mu Dw es 5 tanwT' — qpsecw TP eas 
(1.2.53) 


From (11.2.50), (11.2.49), and using the integrals (1.2.52), (11.2.53), we 
obtain upon integration over: w, and noting that 0/Ow? = 0/2wdw, 


(qt| pt’) pop = 


1 i 
———. ex 
VcoswT : | h 


tanwT 
Seem Oe )I. (1.2.54) 
2m w 


This also satisfies the boundary condition 


2 
i P 
(gt Pt") r-0 —S exp | (w aa (11.2.55) 


The expression in (11.2.54) is to be substituted in (11.2.46). Since 


[ dt qe(T a / oe F(r) cosw(t —t’) 


~ coswI 
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t 
P . 

— —— ]/ dF 3 t— 11.2.56 

mw coswT i: TE eee o) ( ) 


is linear in p, we may, after multiplying (11.2.46) by (pt’| qt’) pp dp/27h, 
integrate over p, the elementary “Gaussian” integral to obtain after re- 
arrangement of terms the explicit expression 


mw 
2rihsin wl 


_ mw ((q?+q") / 
xe T 
mi leas wT ( Be ere oad 


(qt|q't’) = 


1 t 
+ — [a F(r) [qsinw(r — t’) + q' sinw(t — 7)] 
TMU Jer 


= — i ‘dr i ‘a! Fr) sin w(t — 7) 
x @(r — 7’) sinw(r’ — t’) F(r))| (11.2.57) 


which coincides with the expression in (9.4.32). 

5. Finally we close this section by making contact with the analysis of the 
dynamics given in the Appendix to §2.5. For a given Hamiltonian H(q,p,t) 
introduce the scaled Hamiltonian 


H'(q,p,7; ) = AH(G,P,7) (11.2.58) 


then as in (11.2.2), we have 


ch (qt\q't') = -; [oe (qt|H (q(7), p(7),7) |q't’) (11.2.59) 


and since H(q,p,T) is independent of \, we have® by repeated application of 
(11.1.37), 


3) (qt|q't’) 


= (-7) [an [owe [am (at| (IM(r1) «+ H(7»)) , lat”) 


(11.2.60) 


where 


° See (11.1.33)-(11.1.37). 
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IH(r) = H(q(r), (7), 7) (1.2.61) 


in the Heisenberg representation of H(q,p,7), developing in time with the 
Hamiltonian AH’ in (11.2.58). For \ = 0, IH(7) = H(q,p,7), (at| — (OI, 
\q’t’) — |q’0) on the right-hand side of (11.2.60). Accordingly (qt|q’t’) for 
X = 1, is obtained by a Taylor series about \ = 0 using (11.2.60), and is 
given by 


- pt 
(qt|q't’) = (q0| (exp 7 [a H(qp, »)) |q/0) . (11.2.62) 
t’ + 
In particular, for a time-independent Hamiltonian H(q,p), 


(qt|q't') = (q0| exp 5 —t) Han) |q'0) (11.2.63) 


as expected. 


11.3 Trace Functionals 


Of particular interest is the transformation function (gt|q't’) for which 
q=q, ie., for 
a(t) =4'(t) (11.3.1) 
as it may be used to obtain information on the spectrum of the underlying 
Hamiltonian. 
Actually, we consider the trace of (gt|qt’) defined by 


x 
[ (qt | qt’) (11.3.2) 


where L will taken to be arbitrarily large. To study the properties of the 
object in (11.3.2), we carry out a Fourier series analysis, 


q(T) = pu dn Xp — r) (11.3.3) 
F(r) = ; > Fy, exp er - 2) (11.3.4) 
S(r)= 5 pp Cae ire : 4) (11.3.5) 
BO = 3 exp =: (7 v) aa (11.3.6) 
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and note that from 


6) 
or, i” Onm (11.3.7) 
we have 
& pe oy ae ee i2an(t — 7’) 
ara 7D o| P d(7 — 7’) (11.3.8) 
Similarly, we may write 
5 -i2nn(r —t’)] OA 
sey Y | 7 aS, (11.3.9) 
6) 


For a given time-independent Hamiltonian H(q, p), we may, in particular, 
use (11.2.4) and (11.2.17), to write 


[ia (tlat) = fi dg exp [+ fara ( ingots | 
<a far sin) eo [ta far een 
xexp[—i far far’ si Q(r’ — 7) FO). 


From (11.3.4), (11.3.5) 


t t 
[a F(r) =F, [ou S(T) = So (11.3.12) 
! t! 


and (see Problem 11.8) 


SoFo  . <a (SnFo + SoFn) 1 
forse ) [ar (rT) = 5 is? on PD Sipe re 


n#Z0 n4~0 


Therefore, 


[oaas( forse) eo[ ta far 
xexp[-i far far’ str) @(7’ — 7) Fn) 
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= (27h) 6(So) 6(Fo) exp E S sah ' (11.3.14) 
nA#0 


For the harmonic oscillator, for example, we clearly have 


) 1 O 0 mw? O 6) 
ps. (sae OS_, 2 OF, 7) 


= exp 
0 


/ dq (qt| qt’) 


n 


1 1 
x (2h) 5(S0) 6(Fo) exp | = S- Sn5— Fn 
n4~0 0 
(ii. 15) 


where lo means setting the external sources equal to zero after the functional 
differentiations are carried out. 
We note that 


1 Ory? if mw? ( @\? 
2m OSo 2 OFo 
foe) foe) . 2 G 2 2 
= nsf ar, | dAg exp - | exp |- ei 2) 
Pa eee 2m 2 


(Q27)h [m | 1 1 
= = ; 11.3.1 
(27) VihT Vihmw?T  iTw Beene 


On the other hand, 
a lors: (= 55. 55 a a st: x) exp E ss sah 
0 
= Ilo (ea) exp (nts? 2 ae) 
ae E (Suge Sango F-n)| , 


= Tle ae ee ex sa Sn S 
~ Pim 88 OSen an)? PO 


Upon writing 


exp iAT 


(27h) 6(So) 6(Fo) 


So—0, Fo-—0 


(11.3.17) 


0 


exp (—i85y Sn) = f ar | CAE gfhide gta Sn gr 1801 Su (11.3.18) 
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with 8 = Tmw?/h(27n)?, it is easily established that the right-hand side of 


(11.3.17) is given by 
-1 


lee) Tw 2 
Lo | 3. 
(=) (11.3.19) 
n=1 
which is the infinite product representation of (Tw/2)/sin(Tw/2) 
_ Tey | ites er 1 
i f - (=) = =iTe exp [it he (n+ 5) : 
(11.3.20) 


From (11.3.15), (11.3.16), (11.3.20), we then have 


[4 (qt| qt’) = Yow [-F hw (n+5)| (11.3.21) 


recognizing the spectrum of the harmonic oscillator Hamiltonian. 
To find the number of eigenvalues < € of a given Hamiltonian H, we may 
use (9.7.24) and (11.3.11) (see also (4.5.4)) to write for the former as 


1 fe ar, 
H: iéT/h 
NUE) ie 


Qri 


x exp 


ond ar S(r .j a( ar Flr «) 
Rep -j | ae i ar’ S(1') (2! — 2) F(r) 


On the other hand, for the sum of the eigenvalues < €, with € not in the 
spectrum of H, we have from (9.7.32), (11.3.11), the expression 


(11.3.22) 


0 


1 Oe Gd = 
eifT/h 


T — ie 
-} [ar n/ ayaa) 


T T 
x (27h) a( dr st”) a( dr re) 


271 Jo 


O 
x than exp 
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aT oe 
x exp if ar | dr’ S(r') O(7' — 7) F(7) (11.3.23) 
0 0 
0 
By using (11.3.14), we may rewrite the latter 
1 f° dates, 5 5 
mee if T/A i i 
Imi J_., T—ie © i INSET) inssr) 
T 
. 6 5 
x exp -if dr a says) 
1 1 
x (2h) 6(So) 6(Fo) exp |= 5° Snx—Fn| | - (11.3.24) 
h oa 27n 


0 


For the harmonic oscillator, for example, this may be written as 


Lf AP 3 ad itu 1 
= eee AE nr. é 
Wiss Foe "OF We ew | ; v(n+5)| 


y ee eS pape cs iT , 
=m (+3) oa | Fe ex F |e n(n+5)| 


= tu (n+5) o(¢—nmw(n+5)) (11.3.25) 


where we have used (11.3.17)—(11.3.20) and (11.3.16). Here € may be taken 
to fall between two consecutive eigenvalues. 


11.4 From the Quantum Dynamical Principle to Path 
Integrals 


Consider the transformation function 


(gt et jo = a(« -qd- [ow s(7)) exp F afer Fin) 


+ pt t 
x exp [5 far rer) far s(7)| (11.4.1) 
t’ 7! 
derived in (11.2.17) for the Hamiltonian 


H = —qF(r) + pS(r) (11.4.2) 
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where 
(tlaH=aiatl, at)\|d’t)=d |dt’). (11.4.3) 


We divide the time interval t’ to t into N subintervals and introduce the 
telescopic representation 


t teqi 
[a S(t) = S- / dr S(7) = € Shay (11.4.4) 
é k=0 % tk k=0 
with 
to =U, ty =t (11.4.5) 
t-?t 
E= te — tr = ( N ) (11.4.6) 
and 
1 tet 
Sh = -/ dr S(T) (11.4.7) 
tk 


as a mean of $(7) on the subinterval from ty, to ty41. 
It is easily verified by explicit integrations over q1,...,q@n—1 that 


(a-¢- arse) 
= [an [ qe: faa 1 0(q1 — d — 51) 


x 6(q2 — 41 — €S2)---6(q — qn-1 — ESN). (11.4.8) 
Note that the N Dirac delta distributions imply that 
(81 b+ Sy) = 4-4 
e(Sg+---+ Sw) =a-H 
(11.4.9) 


eSn =4-4N-1- 


We are interested in making the subdivisions of the time interval from t’ to 
t finer and finer by taking the limits e — 0, N — oo. 
To the above end, we effectively have 


[avec [arse S(r we € Fpyi€ (Seppo +...+ SN) 
t! us k=0 


11.4 From QDP to Path Integrals 671 


N-1 
= € Fe4i(qn = Ok+1) (11.4.10) 
k=0 
and 
t N-1 
af ar’ FO’) =an 9° € Frat (11.4.11) 
é k=0 
with 
Iw=q% d=. (11.4.12) 


Hence the right-hand side of (11.4.1) may be formally defined as the 
N = o limit of 


& “iy . N-1 
1 N 
[oa -+-dqn-1 ooh ae ah 2. Pr+i(Gk+1 — Ck — eS) 


N= : 
x exp 5 i € Feii(dn — Ge41 | exp i = "Feats (11.4.13) 
k=O 


by introducing, in the process, the integral representation of the N Dirac 
delta distributions in (11.4.8), or equivalently as the N — oo limit of 


dp dp ic q q 
1 N k+1— 4k 
| AION Ge ae OR i oz ek ( é ) 


. N-1 SNES 
1 iL 
x exp i S EQr4iF e441 | exp -j S cmussSto : (11.4.14) 
= k=0 


By taking the limit N — 00, (qt|q't’)) may be rewritten in the form 


‘tf a(t)=4 
latla't)y =f (al-).20)) 
a(t')=a' 
i t 
x ex | far (ole) alr) + alr) PO) — 07) 80] 
t! 
(11.4.15) 
where Y(q(-),p(-)) is defined as in (10.3.25), and, needless to say, q(t), q(t’) 
here denote c-numbers. 


For a given Hamiltonian H(q,p,t), one obtains from (11.2.4) the path 
integral expression 


Vig ( 
= Aaq(-), p(- 
S=0,F=0 q(t')=q' ( ) ( ) 


(qt|q't’) 
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exp |e far (oleate) ~ H(alr).20%).7)]| 
(11.4.16) 


thus establishing the formal connection between the quantum dynamical prin- 
ciple and the path integral formalisms. 

As before for a Hamiltonian in the form as in (10.3.26), we may integrate 
over the momenta to obtain 


a(t)=4 i rt m@?(T) 
t\q't’ ~ ) Aa = f dp Say 
COL A2 | Darter ah pee (a(-)) exp { ar | 5 (a(7)) 
(11.4.17) 
where Y(q(-)) is defined as in (10.1.4). 
11.5 Bose/Fermi Excitations 
Consider the Hamiltonian 
H = twa, ag — al, F(r) — F*(r) ap (11.5.1) 


where ap, ay are Bose annihilation, creation operators (§6.1), with 


ap|0)=0, apgl|n)=Vn|n—-1), ak|n)=Vn4+1|n41) (11.5.2) 


and F'(r), F*(r) are external sources (§6.2), operating within a time interval 
from time t’ to t, and are zero at the boundaries tT = t’, 7 = t.!° 
Let U(r) denote the time evolution unitary operator and set 


(n| U(r) = (nz| (11.5.3) 
The Heisenberg representations of ap, al, are given by 
at (7) = Ut (r) a® U(r) (11.5.4) 


where # corresponds to the operator ag or its adjoint al. 


The Heisenberg equations are 
ap(T) + iwap(T) = = F(r) (11.5.5) 
al,(r) — iwal,(r) = —= F*(r) (11.5.6) 


with solutions 


19 See also (8.7.21), (12.6.33). 
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ap(T) =e #-*) ap +i ff dr! e#(-7) @(7 — 7) F(r’) (11.5.7) 


al,(r) =e —iw(t—T) al, ( a / ele(t— 7! O(r' —T)F*(r ae (11.5.8) 


These give the matrix elements 


t’ 


: t 
(0t| ap(r) |Ot’) = F | ar! eM Or — 7) FOr] (Ot|Ot’) (11.5.9) 


: t 
(0t| ab(r) |ot’) = li fo" ee(t—7') F(z!) O(7! -7)| (O¢|Ot’)  (11.5.10) 
t/ 
and hence from (11.5.9) 


“heer (Ot | Oz’) = if ; ‘dre pu ol -1) FOr) (0¢| Ot’) 


(11.5.11) 
leading to 


t t 
(Ot | Ot’) = exp l= | dr | dr! ei#@-7) F* (7) O(7 — 7’) F¢r)| 
a (11.5.12) 
to be compared with (6.2.12), where the latter was given for a real source. It is 
easy to see that (11.5.12) is also consistent with (11.5.10) (see Problem 11.10). 
Of particular interest is the amplitude of excitations (nt|0t’). To obtain 
this amplitude, note that in this case (11.5.9) is to be replaced by 


(nt| ap(r) |Ot") = F [ure “ie(t—7') Q(r — 7) F(r | (nt|Ot’). (11.5.13) 


For 7 = t, we may use (11.5.2) to rewrite the above equation as 


VarT (n+ de[0") =| [ere Rami! NG Fr) (nt|Ot’) (1.5.14) 


providing a recurrence relation in n, whose solution is elementary and is given 


by 2, 
lk [ave lee Fr) 
Vnl 


This is to be compared with (6.2.22) and (10.5.32). 
The transformation functions (nt|n’t’) are obtained in a similar fashion 
(see Problem 11.11) and may be compared with the expression in (6.2.30) 


(nt | Ot’) (Ot | Ot’) . (11.5.15) 
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using the present notation (see also (10.5.34)). For the study of temperature 
dependence of Bose excitations see §6.3, $11.6. 
For Fermi excitations, consider the Hamiltonian 


H= hw al, ap — °(T) ap —al n(7). (11.5.16) 
The external sources (7), 7*(7) are Grassmann variables (§10.6, §6.4), 
ie., 
5 5 
or =0 11.5.17 
{n(r). (7 )} ) 1m ” 8n#(r') \ ( ) 
and 


fee rr} =o. fee itr} = a0—7, (11.5.18) 


The sources also anti-commute with ap, ah. 

Apart from these properties of the external Fermi sources, and the anti- 
commutation relations of the ap, al, (§6.4), the analysis is almost identical 
to the Boson case with n restricted to 0 or 1 above, 


ag|0)=0, arll)=|0),  a4|0) =]1), (11.5.19) 


and due to the anti-commutativity of the sources with ap, ah,, the commu- 
tator 


[av(r),ab(r) n(r)| = {ar(r) ,ab(r)}n(r)=n(r), (1.5.20) 


for example, gives rise to an anti-commutation relation as indicated. 
Equation (11.5.15) is now replaced by 


‘ ¢ n 
(nt| Ot’) = fe owt) n(n] 
t’ 


1 t t . : 
x exp = [ar far e #7) n* (7) O(r — 7’) n(7’) 
t! t! 
(11.5.21) 


where n = 0 or 1, for the fermionic case. 

Needless to say interaction of bosons may be described by adding a 
AM, (a',a) term to the Hamiltonian (11.5.1) in the presence of the exter- 
nal sources followed by an application of the quantum dynamical principle. 
We will not, however, go into these details here. For the interaction of bosons 
and fermions see Problem 11.12. 
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11.6 Closed-Time Path and Expectation- Value 
Formalism 


In the present section, we apply the quantum dynamical principle to de- 
termine expectation values of physical quantities for systems prepared in 
initially specified states. 

Consider, for example, the bosonic system described by the Hamiltonian 
in (11.5.1) in the presence of external sources F’, F*, and suppose that one is 
interested in computing the expectation value (0t/ lat (7) |02’) for some 7 > 1’. 
This, however, cannot be simply obtained by a functional differentiation of 
(Ot’ | Ot’) since the latter is equal to one — a constant independent of F’, F*. 
One may, however, use the completeness unitarity expansion 


(Ot’ |Ot’) = S (Ot’ | nt) (nt | Ot’) (11.6.1) 


n=0 


and note that for t > t’, the amplitude (nt|0t’) corresponds to time evolution 
in the positive sense of time, while the amplitude (0¢’|nt) corresponds to 
time evolution in the negative sense of time from t back to t’. Accordingly, 
to generate expectation values for a system which started initially, say, in 
the state |0¢’), one may introduce, a priori, different dynamics with pairs of 
external sources (Fi, F*) and (i, F*) for the two segments in the positive 
(t’ — t) and negative (t — t’) senses of time, respectively. By doing so (11.6.1) 
becomes replaced by 


CO 


(ot! |0t’) = S— (Ot" | nt)_ (ne | Ot’) , (11.6.2) 


n=0 


which is different from one for different pairs of sources, where + refer to the 
dynamics in the two segments with pairs of sources (Fs Fi), Expectation 
values are then readily obtained from (11.6.2) as shown below as we have now 
generated two Hamiltonians H,, H_, arising from (11.5.1), corresponding to 
the above two mentioned segments. 

Using the fact that (0¢’|nt)_ may be obtained from (nt|0t’) , by complex 
conjugation together with the replacements (Fy, Ft) by (F_, F*) in the 
latter, it is easily seen from (11.6.2) and (11.1.18) that 


5(Ot’ |Ot’) | = at coe [ar [5IH4.(7) — dIH_(7)] Oe’) (11.6.3) 


and for t’ <7 <t, 


CO 


_ Stone ater), | 


n=0 


—ih Ot’ | Ot") 


sy 
SF (7) 
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= (ot Jal (r)|00’) | 


5 
= j,A—__ i tf 
ih Gp (Oe Oe!) 


= J~ (08 |at(r)|nt)_ (nt | ot") , | (11.6.4) 
n=0 


where the bar | means to replace (Fy, F_), (haan) by common source 
functions F', F*, respectively, after functional differentiations are carried out. 
The adjoint operation as applied to the product of operators appearing in an 
amplitude such as (nt|0t’)_ upon complex conjugation of the latter giving 
(Ot’|nt)_, reverses the order of the operator multiplication. For simplicity of 
the notation we have written a* for ae above. 

Accordingly, 


& 
(ih) 5F*(n) (in) SF) (Ot’ |Ot’) = (O#"| (a(71) al (72))_ Ot’) (11.6.5) 
where now (-)_ denotes the chronological time anti-ordering introduced in 
(A-2.5.13), with operators at earlier times move to the left as indicated in 
the latter equation. 

In (11.6.2), one is dealing with two dynamical systems, one in the posi- 
tive sense of time t’ — t, followed by one in the negative sense of time from 
t back to t’ forming a closed-time path. Expectation values of physical quan- 
tities at any time t’ < 7 < t, for a system in a specified initial state, may be 
then obtained by functional differentiations with respect to, say, F',, Ff fol- 
lowed by replacing (High); (Pa #) by common source functions F’, F*, 
respectively, which in turn may or may not be set equal to zero depending 
on the physical system into consideration. This simple, though elegant way, 
of computing expectation values is referred to as the closed-time path and 
expectation-value formalism. 

From (11.5.7), (11.6.2), the sum over n in the latter is easily carried out, 
giving the expression 


t t 
(0t' | Ot’) = exp E | dr if dr! {ertetr-7) [Fi (r) O(7 - 7’) Fx(7’) 
a t! 


+F* (1) O(7! — 7) F_(1') — F*(7) Fy (7) } . 
(11.6.6) 
This satisfies the condition 


(0t’|Ot’)=1 for Fy =F_ (11.6.7) 


11.6 Closed-Time Path & Expectation Value Formalism 677 


as expected. More conveniently, (0¢’|0¢’) in (11.6.6) may be rewritten in the 
compact form 


(0t" | Ot’) = exp Pa [o [er pie eae) Ko] re 
where K(r) = ae) (11.6.9) 


As an application, consider the harmonic oscillator problem. From (6.1.8), 
(6.1.10), we recall that 
h 


J/2mw 


Suppose at time ¢t’ = 0, the system is in the state |0). To compute the 
expectation value of any functions of the displacement operator q(T) at any 
time 0 < 7 < t for the system initially in the state |0), we consider the 
expectation value of the operator exp [ikq(r)], where k is an arbitrary c- 
number. From (11.6.6), (11.6.11) 


oan) 


(ata). (11.6.11) 


kh? : A / 
=ex dr’ 0(r —T 
pear 72? | Gene far" ote" = 7) 


‘ Gara | =a) (010) | (11.6.12) 


These functional differentiations operations give rise to translation oper- 
ators leading to (see Problem 11.13), 


(0 0) mop = (0 0) Lsteg exp (a5 lin) (11.6.13) 


where 
hk? 7 t 
pear er ss | des i: dre 


(0 
x6(71 — T) O(m1 — 72) 6(72 — n| (11.6.14) 


eika(r) 


eika(r) eika(r) 


eika(r) 


0) 


G[F;7] = | dr’ [F* (7’) Pag as F(z’) eler'-0)] (11.6.15) 
0 


and we have used, in the process, that 1 — O(7’ — rT) = O(r — 7’) to simplify 
the expression in (11.6.15). 
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The expression in (11.6.14) may be unambiguously evaluated by not- 
ing that an elementary application of the Baker-Campbell-Hausdorff formula 
(Appendix I) 


exp(A + B) = exp (5 [A.B]) exp(B) exp(A) (11.6.16) 


with A = ihka/V2mw, B = ihkat/V/2mw, yields for the left-hand side of 
(11.6.14) exp [—h?k?/4m?w?], from which we here adopt the definition that 
0(0) = 1/2 on the right-hand side of (11.6.14). Thus we obtain 


(0 0) leer = exp (-45) exp (ee) . (1.6.17) 


mw? J2mw 


In particular for the expectation value of the displacement operator at 
time T, we have 


i : * iw =F iw(r’—7+ 
(0a(r)]0) |= — yom far! [Errore — Ber) eee) 


_iG|F;7] 
V2mw 


as a linear response to the external source F’, and note that the expression 
on the right-hand side of (11.6.18) is real as it should be. 

On the other hand, upon multiplying (11.6.17) by exp(—ikq) dk/27, where 
q is a c-number, and integrating over k from —oo to oo, gives 


eika(r) 


(a(7)) (11.6.18) 


mew? mw? 


2 
(0 l5(q = q(T)) |0) Pas ear exp -73 (4 = (a(r)) ) (11.6.19) 
as the probability density for the displacement of the harmonic oscillator at 
time 7, which initially, at time ¢’ = 0, is in its ground-state, where (q(7)) is 
given in (11.6.18). 

The transformation function (nt! | nt’) for a closed-time path for a system 
which started initially in the state |nt’) may be derived by methods similar 
to the ones used in 86.4 as given below. 

To the above end, we write 


Fa(r) = FP (1) + FP (7) (1.6.20) 


where Fi) (7) is switched on only within an interval from t” to t’, and Fe) (7) 
is switched on only within an interval from ?’ to t. F°)(r) and FY (7) cor- 
respond to the negative sense of time from t back to ¢’ and t’ back to t”, 
respectively. 

To obtain the expression for (nt’|nt’), we note from (11.6.6), that 
(ot” |0t”)"* may be rewritten as 
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(ot” |ot”)"* = (0t” |ot)"~ (O| 08”) "* 


1 ft t ; 
x exp Laz [ar far’ Bie eT Fe) 
t"! t"! 
(11.6.21) 
The decompositions of the sources in (11.6.20) with their causal arrange- 


ments with FO) (r), for example, switched on after FO (7) is switched off, 
allow us to rewrite (11.6.21) as 


PF?) 


(ot”’ | ot’) "* = (ot! ot") AL A+ (ot' Jot”) "* eALB QB At (ot! |ot’) "2 


(11.6.2) 
where 
ae Oates 0 

Ay = if dre ero D R(T) (11.6.23) 

t”’ 

- at 
B= - | dr eiv(t’-7) [FP (7) — F? (7)). (11.6.24) 

t’ 


Equation (11.6.22) is to be compared with the one obtained from a com- 
pleteness unitarity expansion 


" i — " " FO ! jE ! ny EO 
(ot"” |ot"”)” = S> (oe |Nty~ (Nt | Met)"* (Mt |0t”)"* (1.6.25) 
N,M=0 


(a) 
where, for example, (Mt’| ot”) P+ may be written down directly from 
(11.5.15) with an obvious change of notation, and is given by 


(Ay)™ 


A (oe jot”) Fe (1.6.26) 


(1) 
(Me |ot”)*+ = 


and similarly, 
po) (AE) 


VN! 
FO) 
To obtain the expression for (nt’|nt’),* for arbitrary n, we expand the 


exponentials exp(A* A;), exp(A* B), exp(—B*A,) in (11.6.22). 
To the above end, 


" nF _ " / 
(ot” | Ne’) = (Ot" |02’) (1.6.27) 


eALA+ pALB p-B*AL 


SS 4) yet VFO G+ ol (By (-BY 
: a,b,c=0 V(a +b)! JV(a +c)! a! b! au = 
(11.6.28) 
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Upon setting a+b = N, a+c= M, selecting the diagonal element 
N = M = nin (11.6.25) we obtain by comparing (11.6.22) and (11.6.25) and 
using in the process (11.6.26)—(11.6.28), that?? 


(nt’ |nt!)"* = (ot' jot’) S~ ale (-|BI?)”* (11.6.29) 
ao |(n—a)!] al 


now for arbitrary sources F', F_ in operation within the interval from ¢’ to 
t and t back to t’, respectively, 


2 


p= 2 [oe e--7) (Fy (7) — F_(r)) (1.6.30) 


~ Ae 


and we have set (0t’|0t’)** = (Ot’|Ot’) as in (11.6.6). 

It is physically interesting to consider the initial state to be a thermal 
mixture of energy states. This is obtained by multiplying (nt’ | nt’) in (11.6.29) 
by the Boltzmann factor exp(—fhwn/kT) (see also §6.3) and summing over 
all n, 


(1 — ec Me/A) S> eoen/kT (nt! [nt )* = |)? (1.6.31) 
n=0 
where (1 — ea hw/ ee) is the normalization constant for the Boltzmann factor. 


The summation in (11.6.31) is carried out by following the steps in (6.3.5)— 
(6.3.8) to obtain for a thermal mixture at temperature T’ 


1 1 
h2 (chw/kT — 1) 


(t'|t!)” = (Ot |Ot’) exp 


x 


if dr e#(—7) [Fy (7) — F_(1)] 


| (11.6.32) 


where (0t’ | Ot’) is given in (11.6.6). Equation (11.6.32) generalizes the expres- 
sion for (0t’|0¢’) in (11.6.6) to finite temperatures with the initial state of 
the system being a thermal mixture. 

(t’|t')” may be rewritten in a compact form as in (11.6.8). Applications 
of (t/|t’)” such as in (11.6.18), (11.6.19) are given in Problem 11.18. 

Interactions between bosons (and bosons/fermions) may be considered 
by the addition of interaction Hamiltonians to the free oscillators, in the 
presence of external sources, and consider the dynamics in the positive sense 
of time ¢/ — t and then in the negative sense from t back to ¢t’ followed by 
an application of the quantum dynamical principle in the two segments. We 
will not, however, go into these details here. 


4 Similar methods are used in field theory, see: Manoukian (1991). 
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Problems 


11.11. 


11.12. 


11.13. 


11.14. 


11.15. 


11.16. 


11.17. 
11.18. 


. Obtain an expression for (qt | pt’) 


. Work out the details specified below (11.2.19), involving the functional 


differentiations (6/5S(7))’, to obtain the equality in (11.2.22). 


. Carry out the p-integration leading to the result in (11.2.25) and show 


its equivalence to one in (9.4.21). 

(9) corresponding to the Hamiltonian 
Ho) = p?/2m — qF (rT) + pS(r), for S(r) 4 0, generalizing the one in 
(1.2.22). 

Use (11.2.22) to determine (pt |p't’) for the Hamiltonian in (11.2.24). 
Generalize the expression in (11.2.57) for the forced harmonic oscilla- 
tor by adding a term pS(7) to the Hamiltonian in (11.2.32). 
Establish the identity in (11.2.28). 

Verify that K(7,7’) in (11.2.42) satisfies the conditions (i)—(iii) in 
(11.2.37)-(11.2.41). 


. Use the Fourier transforms defined in (11.3.4), (11.3.5) to derive 


(11.3.13). 


. Show that the double Fourier integral in (11.3.18) leads from (11.3.17) 


to (11.3.19). 

Verify that (11.5.12) is also consistent with (11.5.10) for the creation 
operator a'(r) by taking the functional differentiation —ihS/5F (rT) of 
the former equation. 

By a method similar to the one used to obtain (nt|0t’) in (11.5.15), 
derive an expression for (nt|n’t’), and compare your result with the 
one in (6.2.30). 

Interactions of bosons and fermions were considered in (6.5.18), 
(6.5.30) with the latter being supersymmetric. Investigate the nature 
of the transformation functions for such interactions by adding cou- 
plings to external sources in the Hamiltonians. What is the significance 
of the state (07; A] = (0,0| U(r, A) for 7 large, given that |0,0) denotes 
the ground-state of the free Bose-Fermi oscillator in (6.5.8)? 

Show that the functional differentiations in (11.6.12), as translation 
operators, lead to (11.6.13). 

Find (0|p(r)|0) , and (0 |6(p — p(r)) |0) E in analogy to the results 
in (11.6.18), (11.6.19), respectively, for the momentum description of 
the oscillator. 

Spell out the details leading from (11.6.28) to (11.6.29) for the trans- 
formation function (nt! |nt’)"*. 

Follow the steps through (6.3.5)—(6.3.8) to obtain the expression for a 
thermal mixture in (11.6.32) starting from (11.6.29). 

Rewrite (t/|t/)” for a thermal mixture in a compact form as in (11.6.8). 
Find the average displacement a Ja(r)|t’)” and the probability 


density (t'|5(q—a(r))|t")”, generalizing the results in (11.6.18), 
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(11.6.19), for an initial state as a thermal mixture with transformation 
function given in (11.6.32), at finite temperature T. 


12 


Approximating Quantum Systems 


Several approximations were already carried out in earlier chapters. In this 
respect, for example in Chapter 7, we have considered relativistic corrections 
to the energy levels of the hydrogen atom, also the non-relativistic Lamb shift, 
and treated the atom in external electromagnetic fields. In Chapter 8, we have 
discussed the validity of the exponential law in a two-level system (88.1) and 
related approximations involved. In the same chapter, radiation loss in spin 
precession was studied (§8.4), a computation of the anomalous magnetic mo- 
ment of the electron was made (§8.5), the problem of quantum decoherence 
by the environment was considered (§8.7, §8.9) and the so-called geometric 
phase in the adiabatic approximating regime was investigated (§8.13). Some 
approximations are also given in Chapter 15 on quantum scattering, just to 
mention a few of the applications of approximation methods. The present 
chapter supplements these studies by investigating the nature of several ap- 
proximation procedures, some of which are related to the above applications 
in other chapters. Accordingly, the latter material may be read in conjunction 
with the present one. 

Sections 12.1, 12.2 deal with conventional time-independent perturbation 
theories, followed by one on variational methods. High-order perturbations 
and related divergent series as applied to an anharmonic oscillator potential 
is the subject matter of 812.4. In 812.5, we study the so-called semi-classical 
WKB approximation. Time-dependent perturbation theory is treated in §12.5 
dealing, in particular, with the sudden and adiabatic approximations in which 
a Hamiltonian may change rapidly in time in a very short time interval and in 
the other extreme a Hamiltonian may change very slowly during a long time 
span, respectively. In the last section, we study, in the density operator for- 
malism, the response of a system, into consideration, to another system, such 
as the environment. In this section, we derive the master equation describing 
the dynamics of the reduced density operator of the system of interest after 
having traced the density operator of the combined two systems over the 
variables of the other one. 
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12.1 Non-Degenerate Perturbation Theory 


Consider a Hamiltonian H° which, as part of its spectrum, has a discrete 
non-degenerate one with eigenvalue equation 


H® |n)) = ER |n)q (12.1.1) 


where the E? are non-degenerate and the eigenvectors |7), are orthonormal. 
We add to H® a term Hj, referred to as a perturbation, which is in some 
sense small in comparison to H®, thus introducing the Hamiltonian 


H=H°+M. (12:59) 
We suppose that for the new system, we have an eigenvalue equation 
Hf |n) = E,, |n) (12.1.3) 


and that E,, is near E® for a given quantum number n. The shift in energy 
due to the addition of the perturbation H; to H® is defined by 


AE, = E,; —E° (12.1.4) 
and we set 
|n) = |n)o + |n)’. (12.1.5) 


Upon multiplying (12.1.3) from the left by 9(n| and using (12.1.1)— 
(12.1.5), we obtain for the energy shift the expression 


AEn = aaa (12.1.6) 


This is invariant under phase transformations 
|n) > |n) e7 (12.1.7) 


as the phase factor cancels out from the numerator and the denominator in 
(12.1.6). 

To first order in H,, we may replace |n) by |n), in (12.1.6), to get for the 
energy shift 


AE), = o(n|Hi|n) 9 (12.1.8) 
or 
En, = Eo + Ei (12.1.9) 
with 
Ey, = 0(n|Hi|n),- (12.1.10) 


To first order, we write 


In) = |r)o + Ir), (12.1.11) 
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and use the normalizability of |n), |n), 
1= (n|n) ~1+o(n|n), +1(n|n)5 (12.1.12) 


to infer that 
o(n|n), =—o(n|n);. (12.1.13) 


That is, 9(n|n),, if not zero, is pure imaginary. Now we invoke the invariance 
of AE,, in (12.1.6) under phase transformations as in (12.1.7) to carry out 
the transformation 


In) — |r) = |n)e'? ~ |n), (L+i7) + |n), (12.1.14) 
and choose 
y =io(n|n), (12.1.15) 
to note from (12.1.14) that 
o(n|n) ~1+iyto(n|n), =1. (12.1.16) 
The latter means that if we write 
|n) ~ |n)o + |r), (12.1.17) 


then 
o(n|m), =0. (12.1.18) 


Hence from now on, we may choose |n), such that 
o(n|n); =0 (12.1.19) 


using the same notation as in (12.1.11). 

A non-homogeneous equation satisfied by |n), is readily obtained from 
(12.1.3) by using (12.1.9), (12.1.11) together with the eigenvalue equation 
(12.1.1) for the unperturbed system giving 


(H° — E?) |n), = (EZ, — M1) |n)o- (12.1.20) 


Since E}, is determined from (12.1.10) and |n), is supposed to be known, this 
equation may be used to obtain |n) of the perturbed system to first order. 

If the spectrum of H° consists only of a discrete non-degenerate one, then 
we may rewrite the right-hand of (12.1.20), by using in the precess of the 
completeness of the eigenvector of H°, as 


(Ey, - H,) In) = En In) 9 _ S- |™)¢ o(m|Hy|n) 9 


= Ex, |n)o — En |r)o — D2 Im) 0(m|Hiln), 
meén 
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=— S© |m)o0(m|Ai\n), (12.1.21) 
meén 


where in writing the second equality we have used (12.1.10). From (12.1.20), 
(12.1.21) gives 


In), =aln)yo + S> |m)o \ Tm) o(m|Hi|n)o (12.1.22) 


meén 


where a is a constant. In general, the vector a|n),g is introduced since for a 
Hamiltonian with a discrete non-degenerate spectrum, |n)), up to a multi- 
plicative constant, is the only vector annihilated by H°® — E® in (12.1.20). 
Upon multiplying (12.1.22) by 9(n| and using (12.1.19), we may infer that 
a=0. 

To first order, we then have from (12.1.11), (12.1.22) 


=In)ot+ >> Im), Eom o(m|Hi|n)o. (12.1.23) 


meén 


If o(m|H,|n), in (12.1.10) is zero, one may go to second order in H;. To 
this end, we note from (12.1.6), (12.1.5), that the expression for the energy 
shift may be rewritten as 


olr|Ailn)y | o(n|#ilny’ 
1+o(n|ny’ 1 +o{n|n)" 


AEn = (12.1.24) 


With the orthogonality relation in (12.1.19), AE, up to second order in Hy, 
is then given by 


or 


Ey, ~ E+ o(n|Ai|n)9 + 0(n|Hi|n), - (12.1.26) 
Using (12.1.23) this may be rewritten as 


lo(m vile |? 
En = EX + o(n|Hiln)y + >> Bo eds (12.1.27) 
meén 


1 


As an application, though a formal one,” consider the Hamiltonian 


py ww 
Pot — + Ax? (12.1.28) 


with the interaction term H, = \x? added to the harmonic oscillator Hamil- 
tonian §6.1 in one dimension. 


' Here it is sufficient to note that for \ < 0, x — oo or for A > 0, > —oo, the 
interaction term — —oco being unbounded from below in either cases. 
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Since Hj is odd in «, and the harmonic oscillator wavefunctions ¢,(x) 
(§6.1, (6.1.39), (6.1.41), (6.1.42) have definite parities, i.e., |Up(x)|? is even, 
we obtain from (12.1.10) 


Be =): (12.1.29) 
To second order, we have to evaluate 
2 
2_ yr lolm|AaIn)o! 
E? = ye ~~ Bo (12.1.30) 


where E? = hw (n+ 1/2). By taking advantage of the completeness relations 
o(m|x?|n), = S_ o(mlalm’)y o(m’ |x?|n), (12.1.31) 


and so on, and using, in the process, such matrix elements as in (6.1.45), we 
obtain the selection rules m = n+3, m = n+1 for the non-vanishing of 
o(m|x?|n),, leading to 


3/2 
o(n+3|a°|n), = (==>) J(nt1)(n+ 2)(n +3) (12.1.32) 
3/2 

o(n — 3|a9|n), = (+) n(n — 1)(n — 2) (12.1.33) 
hh 3/2 

o(n+1|2°|n), =3 (=*:) (nt+1)Vn+1 (12.1.34) 
3/2 

o(n—1|2*|n), =3 (==) n/n. (12.1.35) 


From (12.1.30), (12.1.27), one readily obtains 


3 2 
En & hw (n+ 5) ( 7 ) ies ae (12.1.36) 


2 2mw hw 


to second order in 4. 
For the eigenvector |n) to first order in A, we have from (12.1.23) and 
(12.1.32)-(12.1.35), 


3/2 
|n)  |n)y - 5 (4) J (n+ 1)(n + 2)(n + 3) |n+ 3), 
Or Ge es 
aan (4) n(n — 1)(n — 2) |n — 3), 


a3 ee (n+ 1)V/n+1|n+1)o 
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d d 3/2 
ae. (4) nJ/n|n—1)o (12.1.37) 


and we note that i” | n), = 0 where In), consists of the last four terms in this 


equation. 

The series in (12.1.27) may be extended to arbitrary orders for a Hamil- 
tonian with a discrete non-degenerate spectrum. Such expansions are referred 
to as Rayleigh-Schrédinger series. 


12.2 Degenerate Perturbation Theory 


Suppose that H° in the discrete spectrum of a Hamiltonian H® is k(n)-fold 
degenerate, 
H In,v(n))o = Ex, |n, U(n))o (12.2.1) 


where the eigenvector |n,v(n))), v(n) = 1,...,k(n), may be chosen to be 
orthonormal. Now we add a perturbation H, to H®, defining a Hamiltonian 
HT, and consider the eigenvalue problem 


H|n) =, |n). (12.2.2) 


When the perturbation Hj is let to go to zero, |n) may not necessarily go to 
one of the particular eigenvectors |n,v(n))) but, in general, to some linear 
combination of them. Accordingly, we introduce an eigenvector of H® of the 
form 


k(n) 
In) = Me Gy (n) In, y(n))o (12.2.3) 
y(n)=1 
with some expansion coefficients a,(n), v =1,..., k(n). 


Upon multiplying (12.2.2) from the left by 9(n,v(m)| and using (12.2.1), 
we obtain 


AE, o(n,v(n) |n) = o(n, v(n) | Ai |n) (12.2.4) 


where AE, = E,, — E®. To lowest order in Hj, we take |n) to coincide with 
|n)9, and use (12.2.3) to rewrite (12.2.4) as 


S> [Eko — o(n, v(m) |Hiln, v/(n)) 9] avr(n) = 0. (12.2.5) 


v'(n) 


where AE, = E} to first order. 
To have a non-trivial solution for the a,(,) it is necessary that 


det |E) du — o(n,v(n)|Hy|n, v'(n)) | = 0 (12.2.6) 


otherwise one may be able to invert the matrix 
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[Mi] = [En ov — o(n, v(n)|Hi|n, v'(n))o] 


and get zero for the a,(n). Equation (12.2.6) is of a k(n)" order one in term of 
the matrix elements 9(n,v(n) |Hi|n,v‘(n)),9 and is called the secular equation. 

From (12.2.6), we may infer that if, a@ priori, the eigenvectors |n,v‘(n)) 5 
are properly chosen, by considering suitable linear combinations of eigenvec- 
tors of H® with eigenvalue E}), such that [o(n,v(m)|Hi|n, v/(n)) 9] are diago- 
nal, them the matrix [M,,] would be also diagonal and the analysis leading 
to (12.2.6) would simply give the solutions E},, = (n,v(n)|Hi|n,v(n)). 

As an application, consider the Hamiltonian of the hydrogen atom in a suf- 
ficiently strong uniform magnetic field such that the fine-structure contribu- 
tion Vr may be neglected as given in (7.9.13), and we treat the e?A2,, /2Mc? 
term as a perturbation. That is, we set 


H=HH°+H, (12.2.7) 
where 
eB 
H® = Ho —- —— (L.4+ 28, 12.2. 
C auc | +25.) ( 8) 


(12.2.9) 


(see (7.9.5)), where B = (0,0, B), A = B(—y, z,0)/2, and He is the hydrogen 
atom Coulomb Hamiltonian. The eigenvalue of H® were given in (7.9.15) 


ehB 


a s) = EC ms = 
(n,m,m ) n + (m+ m ), 7 Mc 


Ss (12.2.10) 


ms = +1/2, and the eigenstates are given in (7.9.14). Here EY = —Ry/n?. 

Consider n = 2. In particular, we note that the eigenvalue E° in (12.2.10) 
is degenerate for (¢ = 1,m = 0,m, = 1/2) and (€ = 0,m = 0,m, = 1/2), 
corresponding to eigenstates 


Ros (r)¥10(®) @ =), — Roo(r)Yoo(2) (4) =6, — (12.2.11) 


with eigenvalue E° = EY +n. We note that (2? + y?) = r?sin? 6, and from 
the identity (5.8.41), used twice, that cos? Yom is a linear combination of 
Ye+2.m and Yp,. Accordingly, from this, or by direct computation, we may 
infer that (Yoo (1 — cos? 0) |Yeo) is diagonal for @, ’ = 0,1, and from (5.8.43) 
or by direct evaluation 


2 
(Yoo | (1 — cos? 8) |Yoo) = (Yio |(1 — cos? 6) |Yio) = z: (12.2.12) 


2 
3 d 
On the other hand from (7.3.35), 
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(R20 |r?|R2o) = 4205, (Roi |r?|Ro1) = 30 a6 (12.2.13) 


where ag is the Bohr radius. 
The matrix M in (12.2.5), (12.2.6) is then in diagonal form leading from 
(12.2.12), (12.2.13) to the splitting of the level EY + 7 to 


C e*B*\ C eB?’ 


12.3 Variational Methods 


Variational methods will be used in Chapter 13 in minimizing a functional 
of the electron density in the so-called Thomas-Fermi atom (§13.1) as a first 
step in obtaining an expression for the ground-state energy of atoms as a 
function of the atomic number Z, and in Chapter 14 in the investigation of 
a “no-binding” theorem in the process of establishing the stability of matter. 
The variational method considered in the present section in its simplest form 
is the following one. 

Given a Hamiltonian H, the problem considered is to choose a trial wave- 
function W, depending, in general, on one or more parameters, to minimize 
the following expectation value 

(U | AY) 

F[Y] (U]U) (12.3.1) 
considered as a functional of WV. In practice, such a minimizing is achieved 
by optimizing FW] over the parameter(s) on which a priori chosen V may 
depend. 

From the very definition of a ground-state energy, or just the lowest point 
of the spectrum of a Hamiltonian, such a procedure cannot provide a lower 
bound to it. Thus given a trial wavefunction, it will provide, in general, an 
upper bound? to the exact ground-state energy Ep (Chapter 4). The assess- 
ment of the accuracy of such a bound as an estimate of Eo is, in general, not 
always an easy task. On the other hand, if one also derives a lower bound to 
Eo and it turns out that both bounds are close to each other, then such a 
procedure would provide an excellent way of estimating Eo. 

As an example, consider first the inequality in (3.1.8) rewritten, for the 
simplicity of the notation, in terms of a variable z in such units that in one 
dimension 


L, e5 d? 1d 
a ea 3. 
7 (0) < (-F5 - $e Gua) (12.3.2) 
For example, for 


? That this method leads to an upper bound for the ground-state energy, in general, 
is not sufficiently emphasized in the literature. 
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g(z) =44 22? (12.3.3) 
the inequality (12.3.2) leads to 


Vee (-5 + v(2)) (12.3.4) 


generating an anharmonic potential with fixed couplings (see also Prob- 
lem 12.8), 
V(z)a 2? +424 +28. (12.3.5) 


As a trial wavefunction for the ground-state of the Hamiltonian 


d2 
we may choose 
U(z) = : fi 12.3.7 
(2) 7 (27) 1/4g1/2 exp Ao? ( a ) 


where the parameter o > 0 will be chosen optimally. Upon substituting 
(12.3.7) in (12.3.1), (12.3.4), we obtain 


1 
2<Ey< FW =75 +07 #120" + 150°, (12.3.8) 
oO 


The expression on the extreme right-hand side of (12.3.8) is minimized for 
o? about .188, leading to the satisfactory bounds 


2 < Ey < 2.0416. (12.3.9) 


The above interesting method was used in obtaining the exact ground- 
state energies of the harmonic oscillator (86.1), of the hydrogen atom (§7.1) 
and for the Dirac delta potential (§4.2), by appropriately choosing, in the 
process, the trial wavefunctions in such a way that the resulting upper bounds 
coincide with the respective lower bounds, and this procedure may be applied 
to other cases as well (see also (3.3.6), (3.3.7), (4.2.83), (4.2.84)). 

It is easy to apply the above method to generate, a priori, potentials with 
variable couplings. For example, consider the function 


g(z) = z a —“+e4+V2X2z (12.3.10) 


n (12.3.2), where A > 0,c > 0 are constants. This leads to 


+ (20(2))? — 5 (z(2))' = 5 (« =) | (f , >») 2 


r r 
+ (2 ez? + 37. (12.3.11) 
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Using the elementary bound 


AX 3 je cz Died e, HO 
2 = < 3. 
[eee 2( 32) ($)<}: 2 (12.3.12) 


5 (2a(2))? - ; (zg(z))' < ; (« +) + ; (2 + ) 2 +dz* (123.13) 


we have 


thus obtaining 


1 Ar 2 age AA sy 4 
< oO. 
5 (« a) <f i 5 (2+=): az!) (12.3.14) 


generating an anharmonic potential V(z) = \oz? + Az*. The usefulness and 
limitations of this inequality in deriving bounds on the ground-state energy 
is the subject of Problem 12.9. 

As another example, consider the hydrogen molecule in which one of its 
electrons has been stripped off forming an ion of net charge +|e|. To gener- 
ate the potential energy for the latter, we introduce a vector field (see also 
(3.4.22)—(3.4.24)) 


e2 ( x—-R/2 x+R/2 
F(x) = 5 (A5 tiga a) (12.3.15) 


where x denotes the coordinate of the electron and +R/2 denote position 
vectors of the two protons, R = (0,0, 8). Now invoking positivity 


| (z + vee | x) 20 (12.3.16) 
we obtain the simple bound 
- a (E?) x (2 -v-F) (12:3.17) 
and the explicit equality 
_V.F= e e (12.3.18) 


Ix —R/2|  |x+ R/2| 


On the other hand, F in (12.3.15) being equal to e?/2 times the sum of 
two unit vectors, we have 
F? < e4. (12.3.19) 


From (12.3.17)—(12.3.19), we obtain 
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Ime?  €? p” 
< 3. 
ats (F = i) (12.3.20) 
with R = |RI, 
e? e? e? 


V(x) = 


K-R/| +R R ae) 
where in writing (12.3.20), we have added the term e?/R on both sides of 
(12.3.17). V(x) denotes the potential energy of the one-electron (ionized) hy- 
drogen molecule, and p?/2m + V(x) denotes the corresponding Hamiltonian 
taking into account the fact that the mass of the proton >> mass m of the 
electron. 

As atrial wavefunction for the electron, we choose the normalized function 


3/2 
W(x) z lex ( B\x >) + exp (-2 


x + =|) (12.3.22) 


~ \/anN 

where 
2 

N=y,/1+ (1 +A+ *) exp-A , A= GR (12.3.23) 

and the parameters 3, R will be chosen optimally. 
If we set 
3/2 3/2 
o1 = exp ( B\x =|). ba = ew ( B\x4 >|) (12.3.24) 


then from symmetry 


(V |W) = [br [Hbn) + (62 Lb) (12.3.25) 


The computation of the above matrix elements is straightforward (see Prob- 
lem 12.10) but tedious and are explicitly given by 


2 2 
(¢1|H|¢1) = ae E é4 (1 5) | (12.3.26) 
2 2 2 2 
(¢2|H|¢1) = Fae E (: + A we 2f (1+ A)e 4 (N 5 He 
(12.3.27) 


where ap = h?/me?, and we have introduced the parameter, 
€ = Bag. (12.3.28) 


From (12.3.25)—(12.3.28), we then have the following upper bound for the 
ground-state energy of the molecule 
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e 1 £2 
Eo < ao N70) E A()) £B(0)| (12.3.29) 
where 
A(A) =1+ (1 +rA- *) e (12.3.30) 
B(A) =1- (1 + 7 eA + (> +1- 7 e (12.3.31) 


and N = N(A) is defined in (12.3.23). 
Optimizing the right-hand side of (12.3.29) over € gives 


€ = B(A)/A()). (12.3.32) 
Upon substituting (12.3.32) in (12.3.29), we obtain 


e? ~— B?(X) 


Eo < . 
OS 2a N2(0) A(A) 


(12.3.33) 
Optimizing the right-hand side of (12.3.33) over \ yields \ ~ 2.48 leading to 


2 
€ 
Eko <-— 1.1 12.3.34 
0 5a 73 (12.3.34) 
and € = 1.238. From € = Gao, = GR, these give R = 2.003 a. All told, 
(12.3.20), (12.3.34) lead to the bounds 


me?* me* 


— Gyr 3 < Box 


See PAT: (12.3.35) 


By introducing a more complicated trial wavefunction depending on more 
parameters, one expects that the upper bound in (12.3.35) may be further 
reduced. The control of the lower bound, however, is more difficult. 

Needless to say, the variational procedure may be carried out, optimizing 
the expectation value in (12.3.1) over the parameters that a trial wavefunction 
may depend, even if no lower bound to the exact ground-state energy Epo is 
available. When both upper and lower bounds to Ep are derived one may, 
or may not, be able to assess this estimation depending on how close, or 
not close, these two bounds are to each other, respectively. The first excited 
energy level to a given Hamiltonian may be also estimated by the variational 
procedure by optimizing the expectation value in (12.3.1) by choosing a trial 
wavefunction which is orthogonal to the one chosen for the ground-state, 
and extend further the analysis in a similar fashion, for other excited energy 
states. 
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12.4 High-Order Perturbations, Divergent Series; Padé 
Approximants 


In this section, we investigate the nature of the high-order terms in the 
Rayleigh-Schrédinger perturbation series for the so-called anharmonic oscil- 
lator involving a x* term in the potential energy in addition to the harmonic 
oscillator x? one. This interaction turns out to be important as the pertur- 
bation series diverges and provides a prototype for other divergent series and 
has also been useful in clarifying related aspects in field theory. We first de- 
rive a dispersion relation that relates the ground-state energy for positive 
coupling, for which the theory is defined, in terms of an integral restricted to 
negative coupling. This integral leads to a formal expression for the pertur- 
bation coefficients of the ground-state energy to any order. We then carry out 
a qualitative study of these coefficients, using path integrals, which clearly 
shows that they grow factorially with the order for high orders implying the 
divergence of the perturbation series. Finally we comment on a procedure 
which re-arranges the perturbation terms, referred to as the method of Padé 
approximants, that converges, in the limit, to the actual energy value. 

Consider an analytic function f(£’) of a complex variable €’ in the complex 
cut-plane shown in Figure 12.1. Cauchy’s Theorem implies that 


ey re 


where € lies within the contour away from the cut axis. Assuming that there 
is no contribution to the integral from the circles of radii R — oo, 6 — 0, we 


obtain ; ‘ Dise f(é') 
, Disc 


where Disc f(£’) is the discontinuity of f(&’) across the cut, i.e., 


(12.4.1) 


Disc f(é’) = f(é’ tie) — f(é —ie), ée— +0. (12.4.3) 


The variable € will be chosen to be real and positive for which f(€) is real. 
Equation (12.4.2) is referred to as a dispersion relation relating f(€) to the 
discontinuity of f(€’) across the cut. 

One may subtract from f(&) its value at € — +0, obtaining the once 
subtracted dispersion relation 


adpyvect | eae DIE) 
f(E) — FQ) = 5- [gee ae — 8) (12.4.4) 
A formal series expansion 


fO-fO = >> fxlO* (12.4.5) 


K>1 
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Fig. 12.1. Contour of integration C in the complex cut-plane €’ used in 
(12.4.1). The radii of the circles are taken to be R > 00, 6 > 0. 


then leads to the following expression for the expansion coefficients 


0 ’ 
Hoe =|. ayn Disc f(£’) (12.4.6) 


€5 =e (s) (12.4.7) 


K>1 
in (12.4.4). 
In the notation of (12.3.6), we consider the Hamiltonian 
d2 
H=-—5 4274+ 2", A> 0. (12.4.8) 


dz? 
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The nature of the potential V(z) = z* + Az* is quite different for A > 0 and 
A <0 (see Figure 12.2). In the latter case, the potential develops unstable 
states with energies involving imaginary parts and decay through the poten- 
tial barriers of finite widths. Such an imaginary part for the ground-state en- 
ergy E(A), when A is continued to |A|exp(+iz) from » > 0, is what is needed 
to obtain the expansion coefficients Ex in a formal perturbation expansion 
(A > 0), 


E(\) — E(04) = So Ex(\)* (12.4.9) 


K>1 


using an integral as in (12.4.6). In the present notation used E(0+;) = 1, 
corresponding to the harmonic oscillator potential. 


Fig. 12.2. Shapes of the potential energy for positive and negative A. For 
A <0 the potential develops unstable decaying states with energies involving 
imaginary parts. The potential energy in the latter case is also unbounded 
from below. The figures are not based on actual numerical values. 


For <0, the potential is unbounded from below, with the properties 
of the system quite different from that with \ > 0, and the theory cannot 
just be extended from > 0 to A < 0. Technically, this corresponds to the 
fact in the complex A-plane, E(\) may be analytically continued at most 
only to a cut-plane with the cut along the negative real axis of X. This, in 
particular, leads to a discontinuity of E(\) along the negative real axis, i.e., 
[E(A +10) — E(A — i0)] 40 for A < 0, and gives rise to an imaginary part 
for the energy E(A) as discussed above. 

Assuming the analyticity property of E(A) in the cut-plane and that no 
contribution arises from integrations over the circles of radii R — co, 6 — 0 
in Figure 12.1 in the complex A-plane, 


E(\ +i0) — E(A —i0) = 2ilm E(A + i0) (12.4.10) 


for the discontinuity across the cut, we may infer from (12.4.6) that 
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Im B( 
=f dd me) (12.4.11) 


for the coefficients Ex in (12.4.9), where Im F(A) = Im E(\ + i0) approaching 
from above the negative axis. We will not, however, go into the proof of the 
analyticity property of E(A)® for which (12.4.10), (12.4.11) are assumed to 
hold. 

The perturbation series for E(A) in (12.4.9) also diverges. We will provide 
a qualitative analysis of this fact by using path integrals. Before doing this, 
however, we will show how the ground-state energy may be formally extracted 
from a time evolution analysis if time t is replaced by —if@ and the limit 
(2 — oo is taken. 

To the above end, suppose that E,; > Eo denotes the energy level of a 
given Hamiltonian H just above the ground-state energy Eo, then 


(ole Wy) = faPx ep 
Eo 


= oP | Palo) ol? + ff || P(A) ¥||° aa 
; (12.4.12) 


where Py(A) is the spectral measure of the given Hamiltonian ($1.8). Upon 
taking the limit G — oo, we obtain 


(ah |e~P# |p) —s e~ 9 || Pr(Eo) |” (12.4.13) 


for Px(£o) |t) not the zero vector. 
Consider also, for example, the Green function of the harmonic oscillator 
given in (9.2.23). For 2’ =a, T — —ihG, one has explicitly, 


mw i 2 
2rih sin wT 


imwa? 
x exp 


1/2 
_, gA(hw/2) (=) a (-*.") (12.4.14) 
TT 


G(«T; x0) one | ( 


[cot wT’ — csc wT] ) 


T=-ihp 


for 3 — co, where, again, we recognize the coefficient of (3, in the first expo- 
nential, denoting the ground-state energy. 

To obtain the imaginary part of F(A) for A < 0 in (12.4.11), (with A taken 
to approach the negative axis from above), we consider the imaginary part 
of the trace of the operator exp(—3H) for @ large, with the trace normalized 


3 For the relevant details see, Loeffel et al. (1969), and references therein. 
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with respect to the corresponding one for the harmonic oscillator one. For 
K — oc, we may also consider \ ~ 0_. That is, we investigate the nature of 
the function 


1 
F(A, 8) =Im (Z Tr [-*"]) , A<O (12.4.15) 
where : 
C= Tre "2 "| (12.4.16) 
0 d? 2 
Pot (12.4.17) 


in the above mentioned limits. 


For @ large and A ~ 0_, 


e-B[1+ O(a) +iAE] 
) (12.4.18) 


F(A,8) ~ Im (—— 


where O(A) denotes the perturbative real correction to the harmonic oscil- 
lator ground-state energy equal to 1, in the units used in (12.4.8), (12.4.17), 
and AEF is the imaginary part of the corresponding unstable energy (see 
Figure 12.2) arising for \ < 0. We will see that, self consistently, AF is ex- 
ponentially small for 4 ~ 0_, and hence we may write 


F(A, 8) ~ —BAE. (12.4.19) 


By a path integral representation of F(A, 3), we will learn that for @ large, 


A~ 0_, F(A, B) leads to 
2/3A 


vV—r 
for some constant c of order 1 independent of \ (and, of course, independent 
of 3) in the above limits. 

Upon substituting the expression in (12.4.20) for Im AEF in (12.4.11) and 


explicitly carrying out the A-integration, we obtain (K — oo), up to an overall 
numerical factor of order 1,4 


AEw~c (12.4.20) 


Ree (pe (3) rcs 3); (12.4.21) 


where I'(/ + 1/2) is the gamma function with integral representation 


r(K+3)= | du e~*(u)*-1/2, (12.4.22) 


4 This numerical factor is given by 2,/ 6/73 ~ 0.88. This factor is not essential in 
showing the growth of Ex factorially in the order K. The explicit behavior in 
(12.4.21) was first derived by Bender and Wu (1971). 


700 12 Approximating Quantum Systems 


To carry out the integral (12.4.11), first make the substitution 4 — —A, and 
then introduce the integration variable u = 2/3. The coefficients Ex grow 
factorially for K — oo leading to a diverging Rayleigh-Schrédinger perturba- 
tion series. 

To derive (12.4.20), we first write the path integral expression for F(A, 3) 
in (12.4.15) with \ < 0,° 


F(),8) =m (Z Tr 4 


a (z Tr fale P(2(+)) exp Hew |- if va (2 +2(r) +4 rte) 


with the boundary condition z(—G/2) = z(G/2), and we consider the limit 
3 — co, where formally (see (10.1.4)) 


nN. \N/2 (Nz 
aim (<5) (11 a) ' (12.4.24) 
w=1 


9 (z(-)) = li 


Let z(t) = y(t) + 52(t), where 7(t) defines classical path(s) and 5z(t) the 
deviations of z(t) from 7(t). n(t) satisfies the differential equation® 

(4 

mo) — 2n(t) — 4An3(t) = 0 (12.4.25) 


with boundary condition (—6/2) = n(G/2), 6 — oo. By direct substitution 
n (12.4.25), the solutions, for G — oo, are readily verified to be given by 


+1 
V—X cosh [2(t — to)] 


where 7(—6/2) > 0, (G/2) — 0 for BG — ~, to arbitrary. 
Now for 1 ~ 0_, we consider the contribution due to derivations about 
the classical paths in (12.4.26). To this end, we first note that 


Jim, exp (- [ioe oe +9(7) + wwe] = exp (=) (12.4.27) 


and for further reference that 


n(t) = (12.4.26) 


> We note that classically the Lagrangian corresponding to the Hamiltonian 
p? + 27 + Az’ is given by (27/4) — z? — Az* which upon the substitution t > —it, 
leads to the exponential factor in (12.4.23). 

6 We use a method developed in: Zinn-Justin (1981). 
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|| = (a #0) eae (12.4.28) 


both of which follow directly from (12.4.26). 
In terms of the deviations 5z(t) about 7(t), with boundary conditions 


52(+3/2) = 0, 


lie (2 + 27(r) + Azt(r)) ~ —— + A(5z) (12.4.29) 


eee 1d? 6 
A(d =) dt 8 + 1 bz. 12.4.30 
2) 8/2 “\ dae cosh? [2(t — to) ° ( ) 


Here we have used (12.4.25), (12.4.26), integrated by parts over t using 
the boundary conditions on 5z(t) mentioned above, and noted that the co- 
efficients of the terms involving (5z)?, (5z)* vanish for 4 — 0_. We note 
that fo(t) = 7(t)/||7|| is a normalized eigenvector of the operator M = 
[—d?/4dt? + 1 — 6 cosh” ?(2(t — to))] with eigenvalue 0, i-e., M fo(t) = 0. 
The arbitrariness of the parameter to in (12.4.26), corresponding to mul- 
tiple solutions of (12.4.25), gives rise to an associated degree of freedom in 
the problem. Instead of working with the parameter tp, we may consider the 
scaled one —||7)||to for a reason which will be clear below. We carry out a 
change of the path-integration variables in (12.4.23) as follows. We supple- 
ment the normalized function fo(t) = 77(¢)/||7|| by an infinite set of functions 


fit), fo(t), ... such that we may generate a mutually orthonormal set of 
functions fo(t), fi(t), fo(t), ..., and expand n(t) + dz(t) as 
2(t) = n(t) + S— dz: fi(t). (12.4.31) 
21 


In particular, we note from (12.4.26), (12.4.31) that 


dz(t) 1 (Fe) 7(t) 
Cees — TN) = pt 12.4.32 
Alto) — Tat (to) = Tay = PO ee 

Oz(t) . 

—— = f(t >1 12.4.33 
0(dz;) Fi ( ), J ( ) 

and we have thus carried out a transformation : 
(2(t1), z(t2),---) ———> (—ll"llto, 521, 5z2,.-.) : (12.4.34) 
Since fo(t), fi(t), ... are chosen to be orthonormal, the Jacobian of the 


transformation is one. 
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From (12.4.23), (12.4.28)—(12.4.30), (12.4.34), we then have for 3 — oo, 
Aw OL 


Im ( 5 Tr [e~P# Sein Sal aloe Ald 
m(Z r [e i) as G rf. of (52(-)) exp [—A(5z)] 

(12.4.35) 
up to a multiplicative numerical factor independent of \ and (@ for \~ 0_, 
Bo. 

The purpose of the exercise in the transformation carried out in (12.4.34) 
was two-fold: (1) It was to show that the degree of freedom associated with 
the parameter to, corresponding to multiple classical solutions 7(t), lead to a 
multiplicative factor ||7|| ~ 1//—A as an eztra factor to exp(2/3\), coming 
from (12.4.27), as spelled out in (12.4.35). (2) The action in (12.4.35) being 
quadratic, we may infer, for example, from a generalization of the Gaussian 
integral in (9.8.20), that the path integral in (12.4.35) ~ (det M)~!/?, and 
det M does not involve the zero eigenvalue corresponding to fo, which has 
been already extracted in the infinite to-integral for 3 — oo, and is indepen- 
dent of to in this limit. 

The to-integral in (12.4.35) may be then explicitly carried out giving an 
overall @ factor in (12.4.35) for 3 — oo. Since for \ < 0, the system is unstable 
(see Figure 12.2) (det M)~1/? must have an imaginary part and, otherwise, 
all the coefficients in (12.4.11) will be zero. Technically this happens” because 
M has also a negative eigenvalue making det M negative. 

Because of the normalization factor 1/C, the leading net 3, \-dependent 
multiplicative factor for 8 > co, A ~ 0_, in F(A, Z) in (12.4.35) is then just 
—Bexp(2/3A)/V—A, simply up to a numerical factor of order 1, indepen- 
dent of A, @ in the above mentioned limits. Upon comparison of the behavior 
F(A, 8) ~ —Bexp(2/3A)/V—A with (12.4.19) gives (12.4.20) thus obtaining 
the result stated in (12.4.21). Although the “action” in (12.4.35) is quadratic 
and hence the path integral may be explicitly carried out, the actual mag- 
nitude and sign of this overall \-independent factor, for 4 ~ 0_, are clearly 
not essential to establish the factorial growth in K for K — oo as given in 
(12.4.21) (see also Problem 12.12). 

Over years, several investigations were carried out in re-summation meth- 
ods and/or re-groupings of various terms in perturbation expansions of diver- 
gent series, as of the type studied above, leading in some cases to convergent 
results. One of these methods is that of the Padé approximants one discussed 
below which leads to a convergent result for E(A). 

In the Padé approximants method, one formally replaces a_ series 
Ye @KA*, representing a function F(X), by a double-sequence of ratios of 
two polynomials 


” See Zinn-Justin (1981); see also Auberson et al. (1978). 
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N 
Se eu 
n=0 
M 
SS Bn” 
n=0 


Bo = 1, and chooses the (N + M +1) coefficients Ao, Ai,..., An, Bi, ..., 
By in such a manner that the first (N + M + 1) terms in the Taylor series 
of P!N-™1()) coincide, term by term, with the first (N + M +1) terms of 
the power series 0, aK A*. PIN-M1() is referred to as a Padé approximant 
associated with the original formal power series in question. 

The interesting situation arises if P'!“1(\) converges for N, M — oo to 
the actual function F(A), even if >. a«KA* diverges. 

As an example of a Padé approximant, consider the Rayleigh-Schrédinger 
series for the ground-state energy of the anharmonic oscillator with Hamil- 
tonian given in (12.4.8). To second order in A (see Problem 12.1), 


PINMI()) = (12.4.36) 


21 
E(\) =1+ oA _ ra +e (12.4.37) 


and the Padé approximant P!!:4()) is given by 


[1,1] Le 3A 
PEN (Xd) io Th (12.4.38) 
vA 


as is easily verified (see also Problem 12.13). 

The above method has been successfully used in the literature® showing 
that the Padé approximants P!-%](\) converge for N — oo to the actual 
value for E(\). For example, P!°1°1(0.1) = 1.065 285 509 535, P?°9l(0.1) = 
1.065 285 509 543, while the actual value (0.1) = 1.062 285 5 + 0.000 000 5. 
And P!91](1) = 1.392 102 495 074, P!?°29l(1) = 1.392337 481 861, while the 
actual value (1) = 1.392 751 + 0.000 620. 

As mentioned above, there have been many other re-summation proce- 
dures introduced recently and the reader may wish to consult the relevant 
journals for related details. 


12.5 WKB Approximation 


12.5.1 General Theory 


Consider the Schrédinger equation for a stationary state of energy E, in 
one dimension 


8 For the relevant details see, Loeffel et al. (1969), and references therein for values 
of E(A). 
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t= nn re) W(x) =0 (12.5.1) 


where 


p’ (x) = 2m(E — V(a)). (12.5.2) 


Within an interval, when V(x) is constant, the formal solutions of (12.5.1) 
are of the form exp(+ixp/h). More generally, one may formally introduce, 
locally, a scale A(x) = h/|p(a)| — referred to by some as a reduced lo- 
cal de Broglie wavelength of the particle in question. We consider potentials 
V(a) which vary slowly over distances of the order X(2) in the neighbor- 
hood of x. By setting AV = AzxV'(x), then for a change Az of the order of 
X(x), for a slowly varying potential over X(x), we take, as a rule of thumb, 
|AV| ~ | A(x)V"(a)| « h?/(m X?(x)). The latter condition may be equiva- 
lently rewritten as |A A] ~ | A(x) A’(x)| < | XA(x)| signifying that the approx- 
imation sought corresponds to cases where A(x) varies slowly over a distance 
of the order A(x) itself. This constraint in turn, may be rewritten formally 
in terms of the local classical “momentum” |p(x)| as |p'(x)/p(x)| « |p(a)|/h.9 
Such a restriction will be derived more precisely below. 

To find the solutions of (12.5.1) under the above stated condition, we set 


1 
W(x) = a exp S(z) (12.5.3) 


Na 


where S, and its derivative, are unknown. The specific way of writing W(x) 
in this form turns out to be convenient and simplifies the analysis to some 
extent. In any case S(x) is unknown and will be, self consistently, determined 
by substituting the expression for w(a) in (12.5.1). We explicitly have 


I cua eee ie cae 
uO E (=) 3 (=) 7 oF ro 
leading from (12.5.1) to 


(S')? 4 ao ! oy 5(3) =o. (12.5.5) 


For p(x) a constant, we have seen above that S(x) is linear in x, and 
hence S’(a) is a constant. More generally for slowly varying 5S’, we neglect 
the last two terms in (12.5.5) and, in turn, investigate the nature of this 
approximation. To this end, we obtain 


sea e (12.5.6) 


° For p(x) 4 0, the condition in question, written in this last form, corresponds 
formally to a h ~ 0 analysis. 
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Upon substitution of (12.5.6) in (12.5.5), we may infer that this approxi- 
mation may be carried out provided 


«< Pe (12.5.7) 


and obtain the asymptotic solution 


wa) ~ a exp (; fae plo) +2 exp (=; [ax v(e)) (12.5.8) 


where C1, C2 are some constants. This approximation method is referred to 
as the WKB approximation. !° 

Clearly, the approximation breaks down at such points, called turning 
point,!! where p(x) = 0. A turning point is shown in Figure 12.3, where 
V'(a) > 0. 


Fig. 12.3. The figure shows a turning point at x = a, where V(a) = E, and 
hence p(a) = 0. At this point, the approximation in (12.5.8) certainly breaks 
down. Here V’(a) > 0. 


To find out how close can x come to the point a before the approximation 
breaks down, we carry out the expansion 


(x—a)? 


p*(x) ~ —2m |V'(a)(x — a) + V"(a) 5 


(12.5.9) 


where, here, V’(a) > 0. 
For 


10 WKB stands for three of several contributors to this method, namely, G. Wentzel, 
H. A. Kramers and L. Brillouin. 

11 This terminology is taken from classical mechanics, where p(x) = 0 means that, 
at such a point, the kinetic energy of a particle is zero as the particle reverses 
its motion. 
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|V" (a)(a — a)| « 2|V'(a)| (12.5.10) 


we have 


p’(“) ~ 2mV'(a)(a— 2). (12.5.11) 


In the neighborhood of the point a for which (12.5.10) is satisfied, we may 
use (12.5.9) to rewrite (12.5.1) as 


Ee eG = 2) BEG (12.5.12) 


where we have denoted the wavefunction in this region by x. Upon making 
a change of variable x to 


K = (2mV"(a) /h?)*/3 (x — a), (12.5.13) 
we obtain for (12.5.12) 
d2 


The special functions satisfying this differential equation are called Airy 
functions? and a pair of linearly independent solutions are denoted by Ai(r), 
Bi(x«) having, in particular, the following properties: 


: Bi(0) _ 5 
Ai(0) = — =3-27(2 12.5.15 
(0) Fa (3) ( ) 
and for & > 0, 
1/4 
Ai(x) ——> RE (*) exp (-F0°”") (12.5.16) 
a Des 
. 2 an ies /2 
Bi(x) pees VE (=) exp & ) ; (12.5.17) 
On the other hand for « < 0, 
i fir" (; 7 
Aili sin { =|«|9/2 + *) 12.5.18 
om || +00 (na) gi" 4 
tapes (; 7 
Bi(k cos ( =| [8/2 + *) 12.5.19 
© oe ve la) gl +g a 


The general solution of (12.5.14) is 
, 1/3 , 1/3 
vie)= ani ( (22) a) + ami ( (22) (a — a) 


2 Cf. Abramowitz and Stegun (1972), p. 446. 
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with a, 3 some constants. We recall that in (12.5.14), x satisfies (12.5.10). 

Bi(«), unlike Ai(«), grows exponentially in (12.5.17) for x > a in the 
region & < V, and on physical grounds (see Figure 12.3) it is necessary to 
choose 6 = 0 in (12.5.20). Hence 


mV'(a)\ 18 
x(2) = a Ai (2) (Ce 0) : (12.5.21) 


In particular, we learn from the property in (12.5.15) that the solution is 
finite at the turning point a. 

Now we have to find a common region to the left of the point a in which 
the solutions in (12.5.21) and (12.5.8) are valid. Hence in the region, we are 
seeking, both solutions must coincide. 

To the above end, we note that the condition in (12.5.7) must be satisfied. 
This leads to 


(3)" <1< ae (a— ax) = |k\. (12.5.22) 


In turn (12.5.10) requires from (12.5.22) that 


m(V'(a))4 1/3 
JK] < va) (7 vt )) ) (12.5.23) 


for V" (a) # 0, for the justification of the neglect of the second term in (12.5.9) 
in comparison to the first one in a Taylor expression. 

To the left of the point a, ie., for x < a, conditions (12.5.10), (12.5.22), 
(12.5.23), then give & > 0, 


2 2m(V'(a))4\ 13 
1<|K]< IV"(a)] ( A ) (12.5.24) 


for V"(a) 4 0. From (12.5.18), we then write y(«) in (12.5.21), with « < 0, 
|x| satisfying (12.5.24), 


a | (2mV"(a) ue 7 vale 
ie) ~ (FE) ) 
| 2V2mvV"(a) 2,7 
x sin (3 ae a)3/? 4 *). (12.5.25) 


Now we compare this solution to the one (12.5.8) for z < a, subject to 
the expansion of p?() in (12.5.9) and the condition (12.5.10). To do this, 
we further approximate the solution in (12.5.8) by using the expansion in 
(12.5.9), (12.5.10) and note that 
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Lf? _ _22mV"'(a) 2, 1 | 2m V"(a)\(a- 2)? 
al de p(z) = ~~ (a— 2)” "0 \ V@ ih 


which may be rewritten as 


a da p(x) ~ — FIs)? {1 = ae (a- o} (12.5.27) 


From the condition (12.5.10), the second term in the curly brackets is 
very small in comparison to one and we may take 


expe [ dap(x) ~ exp (= (=) (=) VInVTa}(a—2)*) . (12.5.28) 


Hence from (12.5.7) and (12.5.10), we have for k < 0, V”(a) £0, 


2  (2m(V"(a))4\ 
1< |r| < Va) ( 7 ) (12.5.29) 


and the solution ~(a) in (12.5.8) takes the form 


w(x) ~ CO; (2mV"(a)(a — 2) /* exp (- (=) (=) 2mV"(a)(a — a) 


+ Co (2mV'(a)(a — 2)) 1/4 exp (+ (=) (5) 2mV"(a)(a— os!) ' 


(12.5.30) 


In writing (12.5.30), we have finally used (12.5.9), (12.5.10) again, and have 
used a lower limit a of integration for the integral in (12.5.8) and adjusted, 
accordingly, the coefficients Ci, C2. Obviously for the common region in 
(12.5.24), (12.5.29) both solutions in (12.5.25), (12.5.30) should coincide, ice., 


we must have 


aic,ei7/4 =~“ (amv'(a)ny’® =e 12.5.31 
1 an 
T 

2iCpe7i7/4 = a (2mV'(a)h)'/® =e. (12.5.32) 


These conditions on C1, Cz may be now used in (12.5.8). 
In summary, we have found the following approximations 


pay a5 des (; a da aia\ee *) (12.5.33) 


for!? x <a, (a—2x) > (A? /2mV"(a))*/8, 


3 Note that in writing the expression in (12.5.33) we have used the fact that 
sin(z — 7/4) = —cos(z + 7/4). 
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mV"(a)\18 
x(x) = a Ai (7R2) (a — ») (12.5.34) 


for ja — al < 2|V’(a)|/|V"(a)|, and the coefficients a, c are related through 
(12.5.31)/(12.5.32). 

Now we consider the situation that when we move to the right of point 
a, we encounter a second turning point, say, b, where V’(b) < 0. This is 
illustrated in Figure 12.4, where for a < x < b, V(x) > E, ie., p(x) is 
imaginary, and beyond the point b, V(a) < E. This will allow us next to 
study the problem of tunneling through the potential barrier. 


12.5.2 Barrier Penetration 


The expression in (12.5.33), may be rewritten as 


v(e) = ale) ~ ew (5 [ae ne) 


ae exp ( aa ae p(e)). (12.5.35) 


Hence we recognize the first and second terms in (12.5.35) as corresponding 
to amplitudes of incidence on and reflection off the potential barrier. 


Fig. 12.4. The figure shows two turning points at x = a and x = 6, at which 
V'(a) > 0 and V(b) < 0, respectively. For a < x <b, V(x) > E. 


Repeating the analysis leading to the approximation in (12.5.8), we may 
infer that for (a — a) > (h?/2mV‘(a))'/8, in the region 2, the approximate 
solution corresponding to that in (12.5.8) is given by 


tho(x) = 2— exp ( i | aclo(o)!) (12.5.36) 
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where we have used the fact that in region 2, p(x) is imaginary and we have 
selected the damped solution required by the physics of the problem. w2(2) is 
not valid at turning point b (and also at a). Below we will follow the procedure 
leading to (12.5.20) to investigate the nature of the solution near the point 
b. Before doing this, we investigate the approximate solution in the region 3. 

In the region 3, p(x) is real and only an amplitude of transmission may 
arise. Accordingly for |x — b| >> (f?/2m|V"(b)|)1/, 


b3(x) = ae (; [ae ve)). (12.5.37) 


p(2) 


In the neighborhood of the point 5, i-e., for |z — b| < 2|V’(b)|/|V’(0)|, for 
V(b) £0, the approximate solution of the Schrédinger equation (12.5.1) is 
obtained in a similar way as the one (12.5.20), except now V’(b) < 0, giving 


‘V(a) = al Ai (- Cog (x ») 3! Bi ( E2¢ (x — ») 
(12.5.38) 


and for 2|V/(b)|/|V"(b)| 2> 2 — b > (h2/2m|V"(b)|)/3, 2 > b, we have ac- 
cording to (12.5.18), (12.5.19), 


1/4 
\(2) > (=) (=) a’ sin (Fiat + 7) + 2" cos (Fis + *)| 
(12.5.39) 


where now |«| = (2m|V’(b)|/h?)!/3(a — b). 

By expanding p(x) in (12.5.37) about the point z = b, and finding a 
common region of the validity of the resulting solution with one in (12.5.39), 
carried out in a similar way as before, we obtain 


in/4/ 
po WRU 
awake (12.5.40) 


Pe. ne tite 
p= QmhiV"() ps" (12.5.41) 


On the other hand, for « < 6, 2|V’(b)|/|V(b)| > |a — b| > 
(hi? /2m|V"(b)|)1/3, we have from (12.5.16), (12.5.17) the leading contribution 


toy (x) . a 
‘x(z) > (=) (=) exp (Fin) (12.5.42) 


where |«| is defined below (12.5.39). The solution in the region 2 damps out 
as we move from a to the right on the way to b and, of course, the opposite 
arises as we move from 0 to a, as indicated in (12.5.42). 

To compare the solution in (12.5.42) with the one in (12.5.36) in a common 
region of their validity, we first rewrite (12.5.36) as 
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ae 7 7? 
wo(x) = exp (-if da i) pte) exp (i / dx pt) . (12.5.43) 


As a function of z, in the neighborhood of b (a < 6), 


1 1 f° 
=== ex => dz x 
( = (Ff inl ) 


1 2 
~ ya exP (sv 2mi Ole - sp) . (12.5.4) 


(2m|V"(b)||a — b)) 


Upon comparison of (12.5.43)/(12.5.44) with that in (12.5.42), in their 
common range of validity, we obtain 


b / 1/6 
an (5 i cote} p= CORE OIE y = neg (125.45) 


where we have also used (12.5.41). 

Finally, the comparison of (12.5.21) with that in (12.5.36) in the neigh- 
borhood of point x = a, in their common region of validity, by using in the 
process of (12.5.16) and expanding |p()| in (12.5.36) about the point a, gives 


= (2mhV"(a))/6 —— = © 12.5.46 
1 = QmbV'(a)) M5 = § (12.5.46) 
where we have also used (12.5.32). 

Upon setting c/2 = 1 in (12.5.35), and taking advantage of the equalities 
(12.5.41), (12.5.45) and (12.5.46), we may infer that 


1 7° 
c =e'7/4exp (-if da pt!) (12.5.47) 


Finally, from (12.5.35), (12.5.37) for the expression of the wavefunctions in 
regions 1 and 3, and (12.5.47) we obtain for the transmission and reflection 
probabilities the leading expressions 


T ~ exp (-7 i dxV/2m(V (a) — ®) (12.5.48) 


R~1 (12.5.49) 


respectively. Obviously, these are valid if the exponential term in (12.5.48) 
is small. The fact that R turns out to be equal to one is of no violation of 
the conservation of probability (R + T = 1) as both results for R and T are 
leading contributions to these probabilities and the value “1” for R dominates 
over any small correction of the order T as given in (12.5.48). 
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Next we investigate the nature of quantization rules set up by the WKB 
approximation and make contact with the so-called old quantum theory. Be- 
fore doing this we note that the WKB approximations of the wavefunctions 
vi(x), ve(x), v3(x) obtained in regions 1, 2, 3 as given, respectively, in 
(12.5.35), (12.5.36), (12.5.37), by making use of the constraints on their co- 
efficients in (12.5.45), (12.5.46), (12.5.47), may be rewritten, away from the 
turning points a and b, as 


V1(2) = RG cos € [a p(x) + 4 (12.5.50) 


W2(x) = dx (0!) (12.5.51) 


ones ae 


(12.5.52) 


We have seen, how by examining the solution of (12.5.1) near the turning 
point x = a, we were able to find the connection between the coefficients C}, 
Cy in (12.5.8) and ¥ in (12.5.36), as given in (12.5.31), (12.5.32), (12.5.46), 
going from the damped solution (12.5.36), in region 2, to the oscillatory one 
n (12.5.8), in region 1. The connection between 71(x) and 72(x) on the left 
and right of point x = a, respectively, as indicated by the direction of the 
arrow in.!4 


_ ( 5 f aelote))) : cos (; [a (2) +) 
xp x|p(ax)| ) x p(x) + — 
2 |p(x)| Ada p(x) Ada . 
(12.5.53) 
is called a connection formula and shows the correspondence between regions 


2 and 1, remembering that in the physics of the problem we have chosen 
incidence on the potential barrier from left to right. Upon rewriting 


oxo (—j fae!) =e (5 [arteent) ew (—j [arate 


(12.5.54) 
n (12.5.51), where we have used the fact that p(x) = ilp(a)| in region 2, 
we see an obvious correspondence rule between the WKB wavefunctions in 
(12.5.51), (12.5.52) in region 2 and 3. 


12.5.3 WKB Quantization Rules 


Consider the potential energies depicted in Figure 12.5. Referring to part 
(A) of the figure, we may obtain the WKB approximation in the region 


14 Note that in order to avoid confusion with the notation of a limit, denoted by 
—, we have used the notation + in (12.5.53). 
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a<a« <b by two methods. One is to match its oscillatory solution with the 
exponentially decreasing one on the left-hand side of point a, and another one 
by matching it with the exponentially decreasing one as one moves away from 
the point b. By requiring that the solutions obtained by these two methods 
to be the same, leads to a quantization rule as follows. 


(A) 


Fig. 12.5. (A) A potential energy with two turning points at a and b. (B) A 
potential energy with a turning point at b, and the potential energy is taken 
to go to infinity at a. This is equivalent in working in half space. 


To the left of the turning point b, in part (A) of Figure 12.5, we may use 
the solution in (12.5.50) with a in it simply replaced by b leading to the WKB 
solution for a < x < b given by 


(a) = (55) cos (; [x p(x) + *) (12.5.55) 


for some constant c. A solution in a < x < b obtained by matching it with the 
exponentially decreasing one on the left-hand side of point a, may be directly 
read from the one in (12.5.55) by merely replacing f,'da p(x) by fda p(x) 
in it, and replacing the coefficient c, in general, by some other constant, say, 
d. This second method then leads to 


(2) = (=35) cos (; is dx p(x) + *) ; (12.5.56) 


We first note that for the cosine function in (12.5.56), we may write 
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= cos ( [x p(a) + 7) cos (jf p(x) — :) 
“sin (; a da p(x) + *) sin ( P da p(x) — 5) . (12.5.57) 


Requiring that the solution (12.5.55), (12.5.56) obtained by the two meth- 
ods coincide leads from (12.5.57) to 


1 a 
=f dx p(x) — ie nT, d=(-1)"c (12.5.58) 
A Sy 2 


giving the quantization rule 


; fe p(x) = 1 ¢ " 5) (12.5.59) 


where the n are non-negative integers. Since in the range a < x < b, p(z) is 
real and positive (see Figure 12.5 (A)), n cannot take on negative integers. 
The quantization rule in (12.5.59) brings us into contact with the old quantum 
theory of Bohr and Sommerfeld. We recall from (12.5.48), that the validity of 
the WKB approximation, requires, in particular, that such exponential terms 
as in (12.5.48) be small. The latter implies that (12.5.59) is strictly valid for 
large positive integers. The expression in (12.5.59) is, nevertheless, useful in 
predicting the discrete spectrum for some Hamiltonians (see Problem 12.16). 
By considering the wavefunctions near the turning points a and b, using 
(12.5.59), and formally writing the cosine function in (12.5.56), for « — 
b — 0, as cos[(a/4) + na], n would correspond to the number of nodes of 
the wavefunction in a < x < b, since each time, 7/4 is translated by 7, the 
wavefunction on its way to the new value cuts the x-axis. 

In part (B) of Figure 12.5, the wavefunction is to have a node as x > a+e, 
for ¢€ — +0, at which point the potential is defined to be infinite, and the 
WKB solution in (12.5.56) is to be replaced by 


O= ( =| din Gf, de o(«)) Bee (12.5.60) 


for a < x < b. Upon comparison of (12.5.55) with (12.5.60), we arrive at the 


quantization rule 
i 3 
if dz p(x) =7 (» + 3) (12.5.61) 
a+0 


for non-negative integers. 
It is interesting to point out that for the elementary problem of a particle 
in a box with impenetrable walls at a and b, the WKB solutions are to have 
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nodes at b—0, a+0 and the solutions in (12.5.55), (12.5.56) are to be replaced, 
respectively, by 


Cl ee CC 


(12.5.62) 
which upon their comparison leads to the quantization rule 
1 e-0 
dx p(x) = n(n +1). (12.5.63) 
h a+0 


[With p(x) = V2mE, (12.5.63) leads to the familiar energy levels E, = 
h?n?(n + 1)?/2m(b — a)?.] 


12.5.4 The Radial Equation 


For a spherically symmetric potential V(r), the radial equation is given 
by 


cam Aa eles Gees h 

@+1)4+V E| R(r) =0, 12.5.64 

| 2m (= dr’ =) 2mr? ( ) (r) (r) ( ) 

where for simplicity of the notation, we suppressed the dependence of R(r) 

on €=0,1,2,..., and other quantum numbers. We may take advantage of the 

analysis carried above to reduce (12.5.64) to the form (12.5.1). To do this, 
we set 


f(r) 
= 12.5. 
R(r) We (12.5.65) 
to rewrite (12.5.64) as 
d? d 2m 1 
2 | | 2 | 2 — 
r ape T are T he r (E V) (é T 5) f = 0. (12.5.66) 


Finally, let a@ be a conveniently chosen scale, and introduce the variable 
ni —0O <n < ow, by 
r = ae"/?, (12.5.67) 


In terms of this new variable, (12.5.66) becomes 


dP Po 
apt * he f= 0, 


—00 <1) <0 (12.5.68) 


where 


BG) = in (e ei: (ae"v*)) e2n/a _ Ey + 5 (12.5.69) 
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Thus provided, the WKB approximation may be satisfied, we may infer, for 
example, from (12.5.59) that 


: i aoe =a (w 4 5) (12.5.70) 


a 


and when the integral is rewritten in terms of the variable r, we have 


a ryan (E-Vi(r)) E (e+ ;) =r (n+ 5) (12.5.71) 


where rg, Tp are the zeros of the integrand. 
We may thus conclude that to apply the WKB procedure in the one- 
dimensional radial equation in the form. 


lie i 2 (ame uu) mal | 1))| u(r) =0 (12.5.72) 
where 
R(r) = u(r)/r, (12.5.73) 


it is necessary to replace ¢(€+1) init by (€+1/2)?. This is true even for 0 = 0. 
The reader is asked to investigate conditions for the justification of the WKB 
asymptotic procedure for the attractive Coulomb potential in Problem 12.17 
and investigate the eigenvalues resulting from (12.5.71). 


12.6 Time-Dependence; Sudden Approximation and the 
Adiabatic Theorem 


The time evolution of quantum systems for time-independent and time- 
dependent Hamiltonians was investigated quite generally in §2.5, and the 
general theory of quantum decay and its related time-dependence in §3.5. 
Exact treatments of transitions between harmonic oscillator states, driven by 
external time-dependent sources at zero and finite temperatures, were give in 
§6.2-§6.4. Chapter 8 was almost entirely devoted to time-dependent studies 
of spin 1/2 and two-level systems. A detailed account of Green functions and 
their time-dependence was given in Chapter 9. Path integrals and the quan- 
tum dynamical principle, as time-dependent problems, were, respectively, the 
subjects of Chapters 10 and 11. Scattering theory in a time-dependent set- 
ting will be carried out in Chapter 15, and relativistic dynamical aspects in 
Chapter 16. In the present section, we consider some time-dependence aspects 
which call for valid approximations as discussed below. The above mentioned 
material may be also consulted for completeness since some of the topics are 
related to the present ones as will be specified below. 
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On a time scale set up by the physics of a problem, such as one given, 
for example, by a typical quantum mechanical period of oscillations, we in- 
vestigate the dynamics corresponding to two extreme cases of Hamiltonians. 
For one case, suppose that a Hamiltonian varies from one form to a final 
form during a very short time 7 in comparison to a time scale set up in a 
problem. The approximation corresponding to such a sudden change (t — 0) 
of a Hamiltonian is called a sudden approximation. For the other extreme 
case, consider a Hamiltonian which varies very slowly in an interval [0,7] in 
comparison to a time scale set up in a problem, such that one may effec- 
tively consider the limit T — oo. The corresponding approximation is called 
the adiabatic approximation (see also §8.13). In both cases, the underlying 
potential energies need not be weak. 

Before going into the above details, we first treat another important sit- 
uation, where a weak perturbation is added to a given Hamiltonian during 
a given interval of time (0,7), and we investigate the nature of transitions 
that may occur between different states of the given Hamiltonian after the 
perturbation is switched off. 


12.6.1 Weak Perturbations 


Suppose we are given a time-independent Hamiltonian H®°, and during a 
time interval (0,7), we switch on a weak perturbation H1(t), generating the 
Hamiltonian!° 

H(t) = H°+ H(t) (12.6.1) 
where H'(t) =0 fort >7,t <0. 

We are interested in investigating transitions that may occur between 


states of H® caused by the perturbation H1(t) after the latter is switched off. 
A state |¢(t)), at time t, satisfies the Schrédinger equation 


in (t)) = HC) (0) (12.6.2) 


and just prior to the switching on of the perturbation, we specify an initial 
condition |~(t))|,~9 = |Y(0)). 

Let Pa(A), A = (Ao, A1,---), be the spectral measure associated with 
commuting set of spectral measures of operators A = (H 0 Ad,.. ze thus 
including the Hamiltonian H°, in question, to specify a state at t > T. 

For measurements carried out on the system at time t > 7, after the 
perturbation is switched off, 


I (Y(t) |dPa(A)ld()) = )|Pa(A)lY@) (12.6.3) 


15 The switching on and off of the perturbation in time may be carried out in a 
smooth manner as done in (8.7.21), (8.7.23), (12.6.33). 
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represents the probability of the measurements of the observables in A = 
(H°, Aj,...) to have values in the set A, i.e., for which A € A, with |7(0)) 
denoting the initial state of the system. Here Pa(A) denotes a projection 
operator. 

To consider actual transitions of the system under the action of the per- 
turbation, we choose A such that 


J, aPa(X')10(O) = Pa(d) 10(0)) = 0. (12.6.4) 


The physical significance of the condition in (12.6.4) is clear. It states that the 
values obtained in the measurements of the observables in A = (H DA iaced ne 
specified by the condition A’ ¢ A, at least one of them is different from its 
initial value as a result of the application of the perturbation thus the latter 
causing the system to make a transition. This obviously does not necessarily 
exclude transitions which conserve energy (i.e., for which X59 = Ag) since not 
all of the components of X’ need to be different from those of A so that 
NEA. 

To obtain the expression of the transition probability in question, we first 
set 


[as(t)) =e #°/" az) (12.6.5) 
and 
H(t) = et P/E (pe HHO/A, (12.6.6) 
These lead to 
|e(t)) = |@(0)) — a dt’ H*(t') |d(t’)) (12.6.7) 
|2b(0)) = |6(0)) . (12.6.8) 


From (12.6.3)—(12.6.8), we then have for a weak perturbation, i.e., to the 
leading order in H1(t), 


(Y(t) |Pa(A)lb@) = (7) |Pa(A)lY(7)) 


(0)) 
(12.6.9) 


= af uv favo |FORore 


where we have made explicit use of (12.6.4). 

For the sequel, we consider H(t), for 0 <t< 7 in (12.6.1), to be 
time-independent. 

Suppose that initially, 


[b(0)) = |W (AO, Ai,---)) = |W’) (12.6.10) 
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for which, according to (12.6.4), A’ ¢ A. One may then write the double 
integral in (12.6.9) as 


1 1 7 ! - 11 i(Ap—Ao) (t’—t”) /h 
[ woo rars ey woo) [ae [arte 


2 1 7 sin” [(Ao — Ao) T/2A] 
<7 f (MO Pa 2) 80 TS BaP 


(12.6.11) 


where we have multiplied and divided the resulting integral by 77. 
We formally introduce a density of states p(A), for values of the observ- 
ables in A = (H®, Aj,...) lying in the range (A, A+ dA) in A, and write 


(H'1h(0) |dPa(A)|H1(0)) = |(W(A) |AYW(A)) [dA pl) (12.6.12) 


where n specifies the number of observables in A, and we have used (12.6.10). 
Suppose that Ao, in A, varies within a range I(c) = (Ao =e\o+ €), and 
A = IUA\, where A, corresponds the range of values for 1, A2,...- 

The function 


sin? [(Ao — Nb) T/2h 
7? [(Ao — Ap) /2h]” 


in (12.6.11) peaks at Ap = AQ of width ~ 27. For vy € Ife), this 
peak falls within the Ao interval of integration in (12.6.11). We consider 
(Ao — Ap —€) T/2h K —m, (Ao — AG +e) 7/2h > mw, which, in particular, 
implies that er/h >> 2. For such an ¢, but the latter, nevertheless, suffi- 
ciently small, the integrand |v (A) | |b (r’))|? p(A), for the o-integral, 
may be evaluated at the central point Ag = Ap as the main contribution to 
the Ag-integral comes from this point. 

By using the value of the integral 


= sinc? [(Ao — XQ) T/2h] (12.6.13) 


Mya 
a i Bo sinc? Qo— Ai) ~ Tt (12.6.14) 
cc. 


we obtain from (12.6.9)—(12.6.13), the approximation 


(Y(7) |Pa(O)|¥(7)) 


2 
=e n Dias VOY, Mec WO oboe) 
(12.6.15) 


The approximation in (12.6.9) will be adequate if the integral in (12.6.15) is 
much smaller than h/2aT. 

Since the factor multiplying 7 on the right-hand side of (12.6.15) is in- 
dependent of 7, we may introduce the transition probability per unit time 
by 
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(7) |Pa(A)lY(7)) 


T 


2 
== hs det D Onoaes At WOW os) pO: 
(12.6.16) 


This expression emerging from work of Dirac is called the Golden Rule.'® 

The treatment of the case when AQ ¢ I(e) is left as an exercise to the 
reader (see Problem 12.19). 

Of particular interest in the application of (12.6.16) is to the (elastic) 
scattering of a particle of initial momentum p’ to momentum p via a potential 
H' = V(x) conserving energy. Then by setting p’?/2m = p?/2m = E, 
|p|/m = v and 


d3 3 
P= — 4 0 dE dN = p(B,O) dB ao (12.6.17) 
(27h) (27h) 


(see also (9.7.35)), where dQ is the element of the solid angle about the 
vector p, with (W (Ao, A1) |H4|W (Ao, 4) identified with (p|V |p’), we obtain 


for the transition probability per unit time from (12.6.16) the expression?” 


(8,6) € Ai) 
20 noi2 m2 Ae 
yh dQ |(p|V|p')|" p(B, Q) = Anke dQ |V(p — p’)| (12.6.18) 
Ai mh At 
where V(p — p’) is the Fourier transform of V(x): 
Vip —p’) = [ex eo (P-P')/F V(x) . (12.6.19) 


For additional and related details to equations such as (12.6.16), the 
reader is referred to Chapter 15 (see, for example, (15.8.39)). 


12.6.2 Sudden Approximation 


Consider a change occurring in a Hamiltonian during a very short time 


_fM@,t<o 
H= es 6 (12.6.20) 


This may be equivalently rewritten as 


16 This name was coined by E. Fermi. 

'” Compare this with the one in (15.3.13) for later reference in Chapter 15. The 
approximation leading to the one in (12.6.18) is referred to as the Born approx- 
imation which will be discussed in detail in Chapter 15. 
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H(t) = Hy + V(t) (12.6.21) 


where 


Vit)= i cae (12.6.22) 


0, #>0. 


Here V(t) need not be a weak perturbation to Ho. 

At time t = 0, just prior to the “sudden” change in the Hamiltonian 
described above, when the Hamiltonian of the system is still given by Hj, we 
prepare the system in a state, say, |v). For t > 0, the latter state develops 
in time via the Hamiltonian H(t) thus satisfying the Schrédinger equation 


inS, a(t) = HCE) (0) (12.6.23) 


with H(t) given in (12.6.21) and |1(#))|,29 = |¥1). 
After a time t > 7, when the Hamiltonian is given by Ha, a state |q2) 
develops in time via Ho, i.e., 


|wo(t)) =e ¥H2/F aby) . (12.6.24) 


We are interested in investigating the nature of the time-dependence of the 
transition probability |(q2(t)|w1(t))|? under a sudden change of the Hamil- 
tonian as given in (12.6.20) during a very short time T. 

To the above end, let 


[wi (t)) = e*#2/" |b, (t)) (12.6.25) 


VG err Ve ere (12.6.26) 


in a similar approach as done in (12.6.5), (12.6.6). Then 
1) fe soap 
Jou(t)) = 161) — 5 fat! VE) Jove) (12.6.27) 
0 
1) = |) - (12.6.28) 
These equations lead to the transition amplitude 


(alt) |ulO) = (Walaa) — 5 fat Wale)IVEDWE).— 12.6.29) 


We expect that if the Hamiltonian changes during a very short time 7, 
then the second term in (12.6.29) will be vanishingly small for 7 — 0 since 
V(t’) > 0 for t’ > 7, and obtain that 


(t2(t)| di(t)) & (d2| v1). (12.6.30) 
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That is, the transition amplitude does not significantly change for a rapid 
change of the Hamiltonian. To compute this amplitude, one may then evalu- 
ate the corresponding expression on the right-hand side of (12.6.30) which is 
time-independent. Note that |), |¢1) develop in time with different Hamil- 
tonians and (12.6.30), although approximate, does not simply follow from 
unitarity. 

Clearly for 


1 


h h 


fo wale )IV DIC) < a dt’ ||V(t')va(t’)|| <1 (12.6.31) 


and |(w2|wv1)| not too small, one may neglect the second term on the right- 
hand side of (12.6.29) in computing the corresponding transition probability. 
The corresponding approximation is referred to as the sudden approximation. 

Intuitively, if |V|, as an order of magnitude of the effective potential op- 
erating only during a short period of time 7, then from (12.6.31), we ought 
to have r|V|/h < 1, as a rule of thumb, for the validity of the sudden ap- 
proximation.!® 

Ideally, one may rewrite the Hamiltonian H(t) in (12.6.21) as 


H(t) = Hy + (H, — Hz) (-t) (12.6.32) 


where ©(—t) is the step function of negative argument corresponding to a 
change of the Hamiltonian in time tT — 0. More realistically and for a rigorous 
analysis, however, one may replace O(—t) by a smooth “function of negative 


argument”:!® 
il t<0, 
@,(—t) = (1 = cee"7/*) , 0<t<r, (12.6.33) 
0, TK. 


This remarkable function is not only continuous but has a continuous deriv- 
ative as well, and is non-vanishing for t < 7 only. O,.(—t) and its derivative 
are plotted in Figure 12.6. 

Typically, if |~2) is an eigenvector of Hp, then 


al dt’ ||V(t’)o(t’)|| = if dt! ||V(t’)ol|. (12.6.34) 


For the change in the Hamiltonian occurring smoothly during a non-vanishing 
time interval (0,7] with V(t) taken now as (H, — Hz) O.(—t), the above is 
bounded above by T]||(41 — H2)w2\|/h, where we have used the fact that 


'8 For example, for a charged particle moving at high speeds during its interaction 
with an atom of atomic radius ~ a, we may roughly take tT ~ a/c, V ~ e/a, 
thus TV/h ~ e?/hc = a which is of the order of the fine-structure constant. 

19 See also (8.7.21). 
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oa Bi Se (12.6.35) 
0 


(see also (8.7.21), (8.7.22)). 


(a) (b) 


Fig. 12.6. (a) Plot of the function ©.(—t) and in (b) its derivative given by: 
t<0 
O0<t<rT 


d0-(—t) t/t 


= ee 
dt 
T<K<t 


Both functions are continuous and rigorously vanish for t > T. 


As an elementary example, suppose that a uniform electric field & is sud- 
denly applied to a charged particle in a harmonic oscillator potential, in one 
dimension, being initially in the ground-state. In this case, 


2 2.2 
p Mw x 
2 ae 12.6.36 
2m 2 ( ) 
2 2.2 
p Mw x 
Ay = —+ — €&X. 12.6. 
har 5 e&u (12.6.37) 
The latter Hamiltonian may be rewritten as 
2 2 2 202 
p mw ef le€& 
Hy = : 12.6.38 
ya 2 (« =) 2 mw? ( ) 


The transition probability, for example, that the harmonic oscillator will 
be excited to the first state is according to (12.6.30), (see (6.1.37), (6.1.38)) 
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1/2 oe ; 


where a = e&/mw?. The latter is easily integrated to yield the probability 
a?(mw/2h) exp(—mwa? /2h). 


12.6.3 The Adiabatic Theorem 


Consider a time-dependent Hamiltonian H(t) which varies slowly in time 
t in an interval [0,7], measured by a large value of the length T of the 
time interval (i.e., T — co). The slow variation is made in comparison to a 
time scale set up in a problem. Such a time scale may be a typical quantum 
mechanical oscillation period ~ w~' (see, for example, (8.13.27)). 
For each fixed t, let |7,(t)) be a normalized eigenvector of H(t) with 
eigenvalue E,,(t), 
H(t) |im(t)) = Ent) m(0)) (12.6.40) 


The spectrum of H(t) need not to consist only of (discrete) eigenvalues. 
Given an eigenvector |n»(t)), we may, conveniently, introduce the equivalent 
eigenvector defined by 


Iba(8)) = Ina(t)) exp (: il dt! (na(t’) rin (¢)) 12.641) 


(see also §8.13, (8.13.5)). The eigenvector |¢,) resulting from |n,(t)) by a 
phase transformation ((7,| im) is real), has the advantage over |7,(t)) in 
that 


(Ont) | dn(t)) =0 (12.6.42) 


as is easily verified. 

We will assume that no crossing takes place between £E,,(t) and other 
eigenvalues for alO <t<T. 

Let |~(t)) be any state of the dynamical system satisfying the Schrodinger 


equation 
nT [w(6)) = AC |). (12.6.43) 


Upon integrating the elementary equation 


d i t / , . i 7 

pan Ree n(t’) | __ ie ! ! 

3 [ent Heeb a” B0] = (4,00) ] He) ex (5 ff aven(¢)) 
(12.6.44) 

where we have used the fact that 


(on(o| Ayo elearencnr) 
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= =F (Salt) |(H(t) — En(t))|(t)) ef a” Bole /™ 


=0, (12.6.45) 


we obtain 


(no [ui@)exp (1 fat maleate») exp = fate 


w(t) exp ( ie we.) : 


(12.6.46) 


= (nal) /0(0)) + fae! (Bult) 


In writing the expression on the right-hand side of (12.6.46) we have used 
(12.6.41). Equation (12.6.46) is exact. 

Provided that for T — oo, the second term on the right-hand side of 
(12.6.46) becomes negligible, for (7,(0)|w(0)) 4 0, we obtain 


(nal [u@)exp (1 fad male) |ainte))) exp (Ff ae'En(¢)) 


~ (7m (0) |(0)) (12.6.47) 


which is the content of the Adiabatic Theorem. Equation (12.6.47) states the 
fact that in the adiabatic regime of a very slow change of the Hamiltonian 
H(t) in the interval [0,T] (T — oo), the scalar product (n,(t)|¢(t)), with 
the appropriate phase factors as given on the left-hand side of (12.6.47), 
essentially remains invariant in time in the above mentioned interval. 

To investigate the nature of the second term on the right-hand side of 
(12.6.46) suppose for simplicity that the spectrum of H(t) consists only of a 
discrete non-degenerate spectrum for all 0 < t < T. Quite generally, one may 
expand [1i(E)) = Dom lrtm(t)) (rim (t) [W(4)). For |¥(0)) = Ina(0)), for a given 
n, then (12.6.47) implies that (7,(t)|¢(t)) is approximately one times the 
complex conjugate of the phases multiplying it, and otherwise (7,,(t) |¢(t)) ~ 
0 for m # n, since (m(0)|n(0)) = 0 for m # n, and the second term on the 
right-hand side of (12.6.46) is expected to vanish for T — oo. Hence, 


WCE) = Inn (€)) exp (: [ae tnt) tint) ) 


- at 
x exp (-7/ wE,(¢)) +... (12.6.48) 
0 
plus terms which are expected to vanish for T — co, given that 


1(0)) = |1m(0)) - (12.6.49) 
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That is, the state |W(t)) evolves together with the Hamiltonian satisfying 
approximately the eigenvalue equation (12.6.40) for allO <t<T. 

We note that for any state |(t)), of the dynamical system, the second 
term in (12.6.46), with the above discrete non-degenerate assumption of the 
spectrum of the H(t), may be rewritten as 


y i at! (bn(t’) 


bralt!)) (malt) (E)) exp ( i w'E,()) 


(12.6.50) 
where we have used (12.6.42). 
Now upon taking the time derivative of 
(n(t)| A(t) = En(t) (bn(t)| (12.6.51) 


and multiplying the resulting equation by |¢,,(£)), for m 4 n, we obtain 
(on (t) | 5¢-A(O)|om(t)) 
(Em (t) = En(t)) 


expressed in terms of the change of the Hamiltonian. 
Also note that as in (12.6.45), 


(dn(t) | dm(t)) = (12.6.52) 


£ (Iweppet area) = —* aria) — By(t)) WO)ek Mae'Bale) 
(12.6.53) 
from which 


1 
(A(t) — En (t)) 


= |x(t)). (12.6.54) 


w(t) ek oO) = ih 


S (lw(o) et 8 are) 


That is, we may write 
» nt 
(8n|ém) omlv)exp (+ fav En(¢)) 


a (On| lb) d 
= ~in ital at Pe? (nl ae XC) (12.6.55) 


for the integrand in (12.6.50) with t > Uv’. 
Upon substitution of (12.6.55) in (12.6.50) and integrating by parts over 
t’, the latter equation becomes 


t 


en (+ f avec) eve) 


0 
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-fae(s (t’) wil) exp Gf we.) (12.6.56) 


pees t) |OH(t)/Otlbm(t)) 
= nye CCEA: (Om(t)| - (12.6.57) 


where 


The reason for integrating by parts is that if we make the change of 
variable t = sT in (12.6.56) where 0 < s < 1, and introduce the notation 


\E(t)) = |E(s)), |w(t)) — |v(s)), E(t) — E,(s), we obtain a multiplicative 
factor 1/T in (12.6.56): 


[om (z 7 fas Ba(s!)) (ls) 1000) 


- fase é(s") () exp (; a aba) (12.6.58) 


as is easily checked. This is unlike the expression in (12.6.50) in its original 
form as the latter involves the t/-integration variable and one time derivative 
n (dn(t!) (t’ )). and the substitution t = sT is not helpful in that form. 


Therefore provided the expression within the square brackets in (12.6.58) 
remains bounded for T’ — oo, (12.6.47) holds up to a correction of the order 
1/T. 

For a detailed application of the Adiabatic Theorem, see (8.13.19)-— 
(8.13.27) where the correction to the adiabatic limit is given by the ratio 
of two frequencies. 

The derivation of a sufficiency condition for the Theorem as obtained 
in (12.6.58) with a O(1/T) correction may be also carried out even if the 
spectrum of H(t) does not consist only of a discrete non-degenerate one 
by a very similar procedure as given above and further generalizations are 
possible.?° 


s 


12.7 Master Equation; Exponential Law, Coupling to 
the Environment 


Suppose we have two interacting systems described by a Hamiltonian 
A = Hoi + Hog + Ay (12.7.1) 
20 See Kato (1951b). There has been much interest in this Theorem recently with 


various degrees of rigor, cf., Avron et al. (1990); Wu and Yang (2004), and ref- 
erences therein. 
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where Ho is the free Hamiltonian of system 2, and Ho, is the Hamiltonian of 
system 1, which may include interactions terms between its sub-systems. H, 
is an interaction term between system 1 and 2. The Hamiltonians Ho,, Ho2 
are assumed to commute. We carry out an investigation of the dynamical 
process described by the Hamiltonian in the density operator formalism. Our 
interest is in the dynamics and aspects of system 1 only in response to system 
2. Accordingly, we trace the total density operator py(t) over variables of 
system 2, thus introducing the reduced density operator of system 1, defined 
by 


p(t) = Trler()] (12.7.2) 


where 
pr(t) =e *#F/h on (O)eitA/®, (12.7.3) 


To study the nature of the reduced density operator in (12.7.2), we first 
introduce the operator 


m(t) = elon Hoa) /F pa (phe i Hont Hoa) /h (12.7.4) 


(the so-called density operator in the interaction picture), to obtain, the 
equation 


d i 
—m(t) = —= [Ay (t), m(t 19.7. 
qa n(t) = — = [An (t), m(@)] (12.7.5) 
and 
Hy(t) = e't(Hort+Hoa)/P Fy e—it(Hor + Hoa) /P (12.7.6) 


In terms of 7;(t), the reduced density operator p(t) is then clearly given 
by 
ple) =e irl hg (ayeltoulh (12.7.7) 
where 


n(t) = Tr [m(6)] (12.7.8) 


We first develop a differential equation for 7(t) based on some approxima- 
tions. The resulting equation derived is referred to as the master equation. 
This is followed by applications to the exponential law presented in the analy- 
sis carried out after (8.1.67), and also to the concept of quantum decoherence 
as a result of coupling to the environment elaborated upon at the end of §8.7, 
notably in (8.7.47). 


12.7.1 Master Equation 
We may integrate (12.7.5) to obtain 


i 


m(t) = m(0) — > | dt’ [A(¢’), m(€)] (12.7.9) 


where (0) = pr(0). 
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A formal perturbation expansion in H(t), gives 


m(t) = m(0) + 5° (FZ) fa. [ dtjaiss. 


n>1 


x fat anos L Filta), m(0)] (12.7.10) 
0 


We set 
m(0) = mr(0) = p1(0)p2(0) (12.7.11) 


where the initial density operators of the two systems are taken to be uncor- 
related. This suggests to introduce the operator 


Qt) =14+ > (ZY fate [Peta 


n>1 


x ie dey Tr [Hil), ..., [Hi(t1), p2(0)e] ... (12.7.12) 
0 


to write 7(t) in (12.7.8) as 


n(t) = Q(t)p2(0) (12.7.13) 
giving 
7(t) = Q(t)p2(0). (12.7.14) 
The latter may be conveniently rewritten as 
H(t) = Q()Q7*(t)n(t). (12.7.15) 


The operator Q(t)Q~!(t) may be referred to as a time development gen- 
erator. 

We carry out a weak coupling expansion of the generator Q(t)Q71(t) in 
Hy. To this end, to second order 


Q) =F Hie). n(04] — 55 ff ae TLC). LHe). o2(]. 


(12.7.16) 
In the sequel, we consider, interaction Hamiltonians such that 


Tr [Hi(t) p2(0)] = 0 (12.7.17) 


and this property will be explicitly verified below. Therefore to go up to 
second order for the generator Q(t)Q7!(t), we may take Q~'(t) simply to be 
the identity operator 1. That is, to a second order expansion of the generator, 
we obtain from (12.7.15), (12.7.8) the equation 


730 12 Approximating Quantum Systems 


0 =—F [ ae BLO, LEW), mono]. (12.7.18) 


The types of interaction Hamiltonians Hj(t) to be investigated are of the 
form 


Hy(t) = Atel" S* Agdye et + Aew" S ~ dZ bh el! (12.7.19) 
k k 


where of, by, are creation, annihilation operators of excitation energy hw, 
associated with system 2, with the free Hamiltonian Ho2 of the latter system, 
given by 
Ho2 = D> hwy bj.be (12.7.20) 
k 


corresponding to an infinite set of independent harmonic oscillators, omitting 
the zero point energies. 

At, A are creation, annihilation operators of excitation energy hw per- 
taining to system 1. 

We take the system 2 to be initially in the ground-state, i.e., 


p2(0) = |0), (Ol (12.7.21) 
hence 
= fo i 
{0lb«l0), = 0 = ,(0 io), (12.7.22) 
We also recall the following basic relations (see §6.1, (6.1.30)), 
bf |ni,...,ne,-.-) = Ve + 1|ni...,ne +1,...) (12.7.23) 
by \ni,-+-, Mk, +--+) = ng |n1...,n~ —1,...) (12.7.24) 


and the orthogonality property of the different states |n,,m2,...) and, in 
particular, the orthogonality of |0) and the excited states 


(ny, nz, ...|0) =0. (12.7.25) 


It is straightforward to evaluate the trace Tr in (12.7.18) (see Prob- 
lem 12.23) giving 


n(t) = -{ at an(( + n(t) At AI* (t) — An(t) At (I(t) + ri) (12.7.26) 
where 


i= - | at! (0|zi(t) Hi(t!)|0), 
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= 2M i E dt! elev) (t—#') (12.7.27) 


and lA )10), = 0 satisfying (12.7.17). 

As we are summing over an infinite number of modes, we will replace 
the sum over & in (12.7.27) by an integral over the frequency w, — w’, and 
introduce, in turn a frequency density n(w’), to rewrite I(t) as 


1 co 
1) = | du! |X(w’)|? n(w i dp eee). (12.7.28) 
0 
We will refer to (12.7.26) as the master equation,?! and 


(ol(e) (to), = f des! |A(w’)P? n(w') #9) (12.7.29) 
0 


depending on the time difference (t—t’), as the correlation function of system 
2, to which we will return shortly. 
After carrying out the time integral in (12.7.28), we obtain 


Re{I()} = a =f aie) nw ere (12.7.30) 
ee ‘asain (12.731) 


In the applications to follow, the real part of I(t) will be of central im- 
portance. To evaluate the latter, we note that it may be rewritten as 


Re{I(t)} = af. da |d (w( aes =) n(w(1+ —)) me (12.7.32) 


(see also (8.1.90)). The function sin /ax peaks at the origin, and is concen- 
trated mainly in the region |x| < 7. We make the Markov approximation by 
assuming that |A(w’)|? n(w’) is slowly varying around the point w’ = w, and 
hence for wt > 7, it may be taken outside the integral evaluated at w’ = w, 
ie., at x =0, with increasing accuracy for wt >> 7. This gives 


Re{I(t)} = AG) Ente) ie Go (12.7.33) 


hh? x 
which for wt > 7, 


Re{ I(t)} = aie n(w) 


1 This name is usually given to (12.7.26) after some simplifications are made in 
the expression for I(t). 
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Ne) 


(12.7.34) 


using the notation in (8.1.93). 

Actually, the equality in (12.7.34) is a familiar one as following directly 
from (12.7.30), which for sufficiently large t, sin(w’ — w)t/(w’ — w) acts like a 
delta function with support at w. We may also rewrite the correlation function 
in (12.7.29) as 


0 | H(t) M(t')|0), = i dx |\(w + wa) |? n(w + wx) cos(wra) 
- if- dx |A(w + wa)|? n(w+we) sin(wrx) (12.7.35) 


where t = t—t’. Assuming the integrability of |A(w+wa)|? n(w+w2) over 2, 
then for w7 sufficiently large, the so-called Riemann-Lebesgue Lemma implies 
that both integrals in (12.7.35) become small. 

That is, for rT >> 1/w, the correlation function (0 | H(t) A(t’) |0} be- 


comes small, and the latter contributes mainly to the time integral in 
(12.7.27)/(12.7.28) for 7 not large. As a matter of fact, if we formally replace 
the correlation function by the highly localized expression in rT: y/2 6(7), then 
this gives from (12.7.27) the result that I(t) = y/2 consistent with (12.7.34). 
This is the content of the Markov approximation. The operator Hj(t’), in 
0 |Hy(t) Ay(t’) |0),. creates single excitations from the vacuum (ground-state) 


at time ¢’ which are annihilated by Aj(t) within a short interval of time of 
length 7 = t —#’, ending up again by the vacuum. Such fluctuations occur in 
short periods of time of the order 1/w, referred to as the correlation time. On 
the other hand, in (12.7.34), we have taken t >> 1/w, and the corresponding 
approximation requires that t is long compared to the correlation time of 
system 2. Finally, for the validity of the second order perturbation theory 
in (12.7.18), we may infer from (12.7.26), (12.7.34) that ¢ should be short 
compared to y~!. As shown below 7 is a constant associated with the decay 
of system 1 in response to system 2. 

What is the significance of the small imaginary part in (12.7.31)? This, in 
general, contributes to an energy shift (as in the Lamb shift) for the system 
1. This is easily seen from the dynamical equation for p(t) in (12.7.2), which 
from (12.7.7), (12.7.26) is given by 


p = o fe! fo + Al; Al A)eitHor/h q 


_ {Alp + pAtA— AvAt \ (12.7.36) 


where 


Al Sq Hou/h Alton! (12.7.37) 
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I, =Re{I}, I; =Im{I}. (12.7.38) 


In (12.7.36), note that the Hamiltonian Ho; of system is displaced by the 
quadratic term hJ; At A, involving the energy correction hJ;, as a result of the 
interaction of systems 1 with that of 2. 

For further reference, we note that 


I(t) + I*(t) =20,=7 (12.7.39) 


as follows from (12.7.34). 


12.7.2 Exponential Law 


Consider the two-level system in (8.1.67) with interaction Hamiltonian as 
given in (8.1.68)—(8.1.70), 


Bo+ Fy | Ey— Bo 


A = 
01 2 2 


03. (12.7.40) 


Suppose the system is initially in the state (1 0)’, ice., 


(0) = p(0) = Gi 7 (12.7.41) 
and let 
n(t) = (: ’) (12.7.42) 
Then from (12.7.26), (8.1.69), (8.1.70) 
ab a(I + I*) bl 
(: ‘) i ( ae etre 2) ae) 
which from (12.7.41) leads to 
a(t) = exp (- ih dd’ (I(t’) + ri) (12.7.44) 
b(t) =0 = c(t) (12.7.45) 
d(t) =1— exp (- pow (1(t') + rey) . (12.7.46) 
0 
From (12.7.7), (12.7.40), we get 
p(t) = i i.) (12.7.47) 


This gives for the survival probability 
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=a(t)=e” (12.7.48) 


which coincides with (8.1.94) as expected. It is valid for t >> w~! but for ¢ 
short in comparison to y~!, where ¥y is the decay constant. 
The integration in 


[ dt’ (I(t) + I*(t')) = 4t (12.7.49) 
0 


requires justification. To this end, note that from (12.7.33) that 


I(t) + I*(t) = i/ da. (12.7.50) 
—wt 


x 


Let wt = z. Then 


~~ sing sing 
/ dx =T7- 7 dx 
-_ x 7 x 


_ 7 — 082 sinz 2 fe gy ne) (12.7.51) 
z z 2 fy x 
useful for z > 1 where in the last step we have integrated by parts twice. 
Similarly, 
/ dpe = Ee i da ==* (12.7.52) 
tes x 2 0 x 


and for 0 < z <7, (12.7.52) implies that 


or 

sin &v 

[fies 
= x 


From (12.7.50)—(12.7.53), we then have 


[wow +P) = (1 a Ga), li 


< jt (12.7.53) 


wt 


justifying the expression given in (12.7.49) for t >> w7!. 


12.7.3 Coupling to the Environment 


Consider a spin 1/2 object interacting with an apparatus described by a 
harmonic oscillator as given in (8.7.24), with the initial state of the system 
of spin 1/2 and the apparatus as given in (8.7.25) 
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|Bo) = ‘2 |-iao) (12.7.55) 


where |—iag) is a coherent state describing the initial state of the apparatus, 
Qo is real and positive. Here Ho; is taken to be the expression in (8.7.24). 
In the notation, (12.7.8), we have from (8.7.31) 


i(0) = les? (G6) lad(al + e-P (99) ania 


iapee ( i) la}(—al + ce (; s) aavlal (12.7.56) 


where aw = ao exp(—iwT), T = hia/2A (see (8.7.29)). 

For the interaction of the apparatus with the environment (see §8.7), we 
choose the one given in (8.7.44), where a’, a are the creation, annihilation 
operators associated with the harmonic oscillator describing the apparatus. 

To solve (12.7.26), with At, A now identified with at, a, we introduce?? 
the generating function 


FIK*, K; ]=Tr jews n(t) ea! (12.7.57) 


where the Tr is taken over the apparatus variables, and kK, K* are parame- 
ters. 
Using the elementary properties 


a ek! — Kal (g 4 K) (12.7.58) 
ak "eat= (at +K*\e* * (12.7.59) 
we obtain 


Tr [ex ala nekel] =K* Tr le “an eka") 


+Tr fe “ana eel) (12.7.60) 


Tr le “naka eke") =Tr le *nat ek") K 


+Tr le “anal eke!) (12.7.61) 
Therefore, from (12.7.26), 


Tr lem" en eal) = —K*I Tr le “aneX] ~KI* Tr [eX “natea" 
(12.7.62) 
2 We use a method described in: Walls and Milburn (1985). 
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In terms of the generating function in (12.7.57), this leads to 


qilk’, K l=- E eerie te F[K*, K; ¢]. (12.7.63) 


This is readily integrated over t to give 


Tr [eX an(teX@"] =Tr [exp (i ae Jo ) n(0) exp (Kal ew Jot]. 
(12:7.64) 
Using the properties 
e ja) = e Ja) (12.7.65) 
(see (6.6.26)), (a|a) = 1, (a|—a) = exp (|a|?/2) = exp(aZ/2) (see (6.6.38), 
it is not difficult to see from (12.7.56), and (12.7.64), that 


n(t) = |e4|? ¢ ) la elt Na efor] 


+ c4.c% G :) la e~ Jo \ (0 e So ‘ exp(—2a¢(1 — e~7)) 


.. (00 aft x - 
+chc ‘ i) |-ae fT \ (ae 81) exp(—2a(1 —e 1)).  (12.7.66) 


Now it is not necessary to carry out the — oper- 
ation in exp (—i#421) n(t) exp (i), defining p(t) in (12.7.7), to note the 
following. Although yt is small, i.e., exp[—2a2(1 — e~7)] ~ exp[—2a2 yt], 
then for a macroscopic value a% > 1/2yt, the non-diagonal part of (12.7.66) 
will be washed away relative to the diagonal one, with decoherence setting in 
exponentially on a decoherence time scale ~ 1/a2 as discussed at the end of 
§8.7. The operation in exp(—itHoi/h)n(t) exp(itHoit/h) just multiplies the 
a’s in |ta) by phase factors without obviously destroying the Hermiticity of 
p(t) (see Problem 12.24). 


Problems 


12.1. For the Hamiltonian of the anharmonic oscillator potential in (12.4.8), 
show that the Rayleigh-Schrédinger series for the ground-state energy 
up to third order in X is given by 


E(X) = 1+ (8/4)A — (21/16)? + (333/64)? +... 


Also find the ground-state to first order. 


12.2. 


12.3. 


12.4. 


12.5. 


12.6. 


12.7. 


12.8. 


12.9. 


12.10. 


12.11. 


12.12. 
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Treating the electron-electron interaction term in the helium atom 
as a perturbation, find approximately, to the leading order in this 
interaction, the ground-state energy of the atom. 

For a two-level system, let |1) ,|2) be two orthonormal vectors. Con- 
sider linear combinations [a |1) + a2|2)] as eigenstates of a given 
Hamiltonian H to find the exact energy levels and eigenvectors of the 
system in terms of the matrix elements (i|H|j). If |1) ,|2) are taken 
as eigenstates of a Hamiltonian H° with eigenvalues E,, E2, compare 
the expressions obtained with those of perturbation theory discussed 
in §12.1, §12.2 for the cases E, 4 Ez, E, = E> by treating H — H® as 
a perturbation to H®. 

If the degeneracy of a given energy level E? is not removed to fist order, 
then show how energy corrections may be determined to second order. 
Compute the splitting of the ground-state energy of the 3D harmonic 
oscillator to the leading order if a perturbation —F'z is added to the 
Hamiltonian by working in Cartesian coordinates. 

Investigate the splitting of the energy level Ey — 7, in (12.2.10), cor- 
responding to the states specified by (¢ = 1,m = 0,m, = —1/2),(€= 
0,m =0,ms, = —1/2) due to the perturbation in (12.2.9). 

To the Hamiltonian of a plane rotator H® = —(h?/2I)(0?/0¢7), where 
I is a moment of inertia, a perturbation —uwE cos ¢@ is added, where pu 
is an electric dipole moment of the rotator and EF is the magnitude of 
an applied electric field. Study the leading order perturbations contri- 
butions to this coupled system. [Ref. Johnston and Sposito (1976).| 
Scale the variable z in the Hamiltonian in (12.3.4) to generate one 
with variable couplings and re-express the bounds in (12.3.9) in terms 
of the coupling parameters. 

Use the variational method with trial wavefunction given in (12.3.7) 
to obtain an upper-bound-estimate for the ground-state energy of the 
anharmonic potential with Hamiltonian given in (12.3.14). Study care- 
fully the usefulness and limitations of the inequality in (12.3.14) in 
providing a lower bound to this positive Hamiltonian in question. Can 
you improve the lower bound to obtain a bound closer to the upper 
one? 

Derive the expressions for the matrix elements, (1 |H|¢1) , (b2 |H|¢1) 
in (12.3.26), (12.3.27) to finally obtain the bound in (12.3.29). 
Derive the behavior in (12.4.21) of the expansion coefficients Ex of 
the ground-state energy of the anharmonic oscillator potential with 
Hamiltonian in (12.4.8), from a direct perturbative analysis of the path 
integral expression (12.4.23) without using the dispersion relation ex- 
pression leading to the integral in (12.4.11). 

The numerical factor multiplying the expression for Ex in (12.4.21), 
independent of kK, for K — oo, may be obtained by investigation of the 
determinant of the matrix M = —(1/4)d?/dt? + 1 —6 cosh ?[2(t— to)] 
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12.13. 


12.14. 


12.15. 


12.16. 


12.17. 


12.18. 


12.19. 


12.20. 


12.21. 


12.22. 


12.23. 
12.24. 


12 Approximating Quantum Systems 


in (12.4.30), in comparison to the matrix My = —(1/4)d?/dt? + 1, 

since the action in the path integral in (12.4.35) is quadratic. Using 

this fact, together with the \-independent numerical factors such as in 

(12.4.28) omitted in obtaining (12.4.21), evaluate the overall numerical 

factor in question. [Ref. Zinn-Justin (1981).| 

Determine the Padé approximant P?:4](X) consistent with the expan- 

sion for the ground-state energy to third order in » given in Prob- 

lem 12.1 for the anharmonic oscillator potential with Hamiltonian in 

(12.4.8). 

How can one modify the WKB approximation if V’(a) is zero in 

(12.5.9)? 

Reformulate the WKB approximation in the momentum description. 

(i) In reference to part (A) of Figure 12.5, study the nature of the 
eigenvalues of the harmonic oscillator as following from (12.5.59). 

(ii) For a Hamiltonian with a linear potential —F'x defined for x > 
0, study the nature of the eigenvalues according to (12.5.61) in 
reference to part (B) of Figure 12.5 with a = 0. 

Justify the use of (12.5.71) for the WKB approximation as applied to 

the attractive Coulomb potential and find the corresponding eigenval- 

ues according to this formula. 

Attempt to extend the WKB approximation to 3D deriving in the 

process a Hamilton-Jacobi-like equation encountered in classical dy- 

namics. 

Derive the expression corresponding to (12.6.16) in the case when 

No ¢ Ie). 

For the sudden approximation concerning the Hamiltonian H,, H2 in 

(12.6.36), (12.6.37), find the transition probability from the ground- 

state energy to any excited state. 

Find the transition probability for the transition from the ground- 

state to an arbitrary excited state when the direction of the electric 

field in the Hamiltonian Hj = Ho, where Hz is defined in (12.6.37), is 

suddenly changed. 

If the equilibrium point of a harmonic oscillator potential is changed 

adiabatically (i.e., slowly) with a uniform speed, find the transition 

probability from the ground-state to the fist excited state. 

Evaluated the trace Tr in (12.7.18) to show that it is given by (12.7.26). 


Carry out the action of the operator exp(—itHoi/h) in 
[exp (—itHo1/h)n(t) exp (itHo1 /h)| = p(t), where Ho; is taken to be 
the expression in (8.7.24), and 7(t) is given in (12.7.66), to show that 
this operation simply multiplies the a’s in |+qa), in (12.7.66) by phase 
factors (without obviously destroying the Hermiticity of p(t)). 
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Multi-Electron Atoms: Beyond the 
Thomas-Fermi Atom 


The purpose of this chapter is to determine, as an estimate, an explicit 
expression for the ground-state energy E(Z) for multi-electron neutral atoms 
as a function of the atomic number Z. 

The Hamiltonian of a neutral atom consisting of Z electrons and a heavy 
nucleus of charge Z|e| is taken to be 


Z 2 2 Z 2 

D; Ze e 
A= = wa 13.1 
» (F Th + Vets ( ) 


where r; = |x,|, and m is the mass of an electron. Since the last term, 
responsible for the electron-electron interaction, is positive a lower bound 
to the ground-state energy of multi-electron atoms is readily obtained from 
(13.1) by using the bound 


Z 2 2 
Pp; Ze 
He > & ees ) (13.2) 


with the right-hand side consisting of the sum of the Hamiltonians of Z “non- 
interacting” hydrogenic atoms each of ground-state energy —mZ?e*/2h?. 
This leads (83.4) to the following conservative lower bound for the ground- 
state energy of atoms 


mZ%e4 
2h? 
which is sufficient theoretically in establishing the stability of atoms, and 
improvements to the lower bound in (13.3) may be certainly made (see Prob- 
lem 13.14). 

Our starting point for obtaining an explicit expression for E(Z) is the so- 
called Thomas-Fermi (TF) model for the atom. The TF atom has captivated 
the hearts of physicists since its birth over a three quarter of a century ago 
in 1927, when quantum physics was still in its infancy, and will continue to 


E(Z)> (13.3) 
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do so due to its extreme simplicity. In simplest terms, the interaction that 
an electron in the TF atom at x experiences is described by an effective 
spherically symmetric potential V(r), |x| = 7, and uses the TF semi-classical 
approximation (see (9.8.1)—(9.8.5)). The key point is that complicated expres- 
sions for the interactions and the multi-particle kinetic energy are replaced by 
simple functions of the electron density determined in the TF semi-classical 
approximation (see (9.8.4), (9.8.5), and also (9.7.31), (4.5.15) for the kinetic 
energy). In Appendix A to §13.1, on the TF atom, it is shown formally that 
the ground-state energy E;-r(Z) computed according to this model actually 
gives the leading contribution to the exact one E(Z) for large Z. Therefore, 
this model provides the correct starting point for the determination of an 
estimate for E(Z). Corrections to E;r(Z) are then systematically investi- 
gated. The corrections turn out to be, however, not just mere perturbation 
expansion terms in power of 1/Z and the analysis is more involved than that. 
These corrections, based on physical grounds, are worked out in the subse- 
quent sections (§13.2-§13.4) and the resulting final expression, as an estimate, 
for E(Z) is given in §13.5 by adding up these contributions. 

We urge the reader to review the contents of §9.7, 89.8, on Green func- 
tions, before reading this chapter. Here one will witness the power and the 
relative simplicity of using Green functions in dealing with endless situations 
encountered in quantum physics. 


13.1 The Thomas-Fermi Atom 


In the Thomas-Fermi atom, the interaction that an electron at x expe- 
riences is described by an effective spherically symmetric potential V(x) = 
V(r), |x| =r, also that the TF semi-classical (see (9.8.1)—-(9.8.5)) approxi- 
mation is valid and that the complicated expressions for the multi-particle 
kinetic energy and the interactions may be self consistently replaced by simple 
functions of the electron density determined in the TF semi-classical approxi- 
mation (see (9.8.4), (9.8.5), (9.7.31)), as will be now described. This is indeed 
correct for large Z values as is formally shown in Appendix A to this section. 
Deviations from large Z values will be then investigated in the next three 
sections. 

From (9.8.5), we may infer that the electron density n(r), allowing for 
spin degeneracy, is given by 


m(€ —V(r)) \°? 
n(x) = : (7 Calg ) = n(r) (13.1.1) 


he 
where 


Jos n(x) = Z. (13.1.2) 


Here, according to (9.7.22), (9.8.4), the parameter € provides an upper bound 
to the maximum energy of a bound electron. The parameter € also determines 
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the boundary of the TF atom defined by r = rg. Since the electron density 
n(r) = 0 for r > rg, we have 


V(rs) =€ (13.1.3) 


and due to the spherical symmetry of V(r) (and n(r)) and the assumption 
of neutrality of the atom, with neutrality arising by taking the charge of the 
nucleus into account as well, we may infer that V(rg) = 0 (see Problem 13.1), 
ie., € = 0. [Although they are related, a vanishing property of both n(x), 
V(x) for |x| — oo, would imply that € = 0.] 

The Green function part Goo’(xt;x’/0) (see (9.7.15), see also (9.7.34)), 
with spin indices o, o’, satisfies the Schrédinger equation with potential V(r), 
and in the TF semi-classical approximation, for coincident space points x = 
x’, is given by (see (9.8.3)), (7 = t/h) 


3 


Goa (xt; x0) = lean exp | (z= + vi) 7 (13.1.4) 


For the sum of the kinetic energies of the electrons we obtain from (9.7.31), 
(13.1.4) (see also below (4.5.15)) the expression 


1 dr 0 
3 . . 
: Jc x | ee i. v| Goo(xt; x0) 


2) 5/349 
Cry fox (n(x))°? = TIn] (13.1.5) 


as followed by an elementary integration, where we have used (13.1.1), € = 0, 
and allowed for spin. 

From electrostatics, one may define the interaction of the electron-nucleus 
system in terms of the electron density, and add to it the kinetic energy term 
(13.1.5). We then obtain the energy functional F'[n], dependent on the density 
n, defined by 


a2) °/ He ee 
Fi[nj = Soa: je (n(x))°/? — Ze? Tents) 
+ 5 fe?xa?x! n60) + n(x’) (13.1.6) 


with the ground-state energy Evr(Z) of the TF atom obtained by minimiz- 
ing Fn] over n. The second and third terms in (13.1.6) take into account, 
respectively, of the interactions of the electrons with the nucleus and the 
electron-electron interactions in this model. 

By setting the variational (functional) derivative of (13.1.6), with respect 
to n(x), equal to zero, as one proceeds in Lagrangian mechanics, gives 
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317? waar. 5 2/3 Ze? d?x’ 


[| |x —x’| 


or by using (13.1.1) this leads to! 


3y/ 
V(r) = ref we (13.1.8) 


with € = 0, for the effective potential felt by an electron at x in the TF atom. 
In Appendix B to this section, we show that the solution n(x) = nrp(x) 
of (13.1.7) actually gives the smallest value for the energy density functional 


(13.1.6). 
Let ; 
g 
V(r) = -——F(r) (13.1.9) 
Tr 
and set 
r=azt, F(r)= f(z) (13.1.10) 
where a 
30 h2 1 1 
_ ( 4 ) Ime2 Ziss — 99858007775 (13.1.11) 


and ag is the Bohr radius h?/me?. 
A straightforward integration over the angles in (13.1.8) gives 


feta] ae! ve [F(@’))°? (13.1.12) 


) t> 
satisfying the boundary condition (B.C.) 
f(0) =1. (13.1.13) 


The latter B.C. corresponds to V(r) ~ —Ze?/r for r — 0, i-e., the Coulomb 
potential of the nucleus being dominant in this limit. Here 2, = max(z, 2’). 

The integral equation in (13.1.12) may be reduced to a differential equa- 
tion by differentiating twice with respect to x giving 


2 7 3/2 
ce fO)=1. (13.1.14) 


The TF electron density and the effective potential may be written in 
terms of the dimensionless function f(a) as follows 


' Note that the right-hand side of (13.1.8) may be formally rewritten as 


+ n(x’) | , r’ = |x’. 


a d?x' Z6(r’) 


|x — x’| Arr! 
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n(r) = = (me) Ze é25 = ntr(r) (13.1.15) 
V(r) = -24/3 ane aa ey (13.1.16) 


The normalization condition (13.1.2), together with (13.1.14), give 
cB 
Z= z f dz xf" (x) = Z {rp f' (ap) — f(ep) + J (13.1.17) 
0 


where tp = rp/a (see (13.1.10)) with rg specifying the boundary of the 
TF atom. Thus from (13.1.3), with € = 0, and (13.1.9), (13.1.10), (13.1.16), 
(13.1.17), we obtain 

apf’ (ep) =0. (13.1.18) 


To find the asymptotic behavior for 7 — oo, we write f(x) = cx~7 and 
substitute the latter in (13.1.14) to obtain c = 144, y = 3, ie., 


F(z) ~ <* (13.1.19) 


for « — oo. [Although (13.1.19) exactly satisfies the differential equation in 
(13.1.14), it is not a solution for all x since it does not satisfy the normaliza- 
tion condition at x = 0.] This gives the asymptotic limits 


Ze? 
V(r) 144a° a (13.1.20) 
1 (mz \*7 1 
n(r) sa ( a 440°) " (13.1.21) 


for r — oo, with the latter giving a far slow decrease for n(r) in comparison 
to an exponential decrease typical of bound states. 

Equation (13.1.20) shows that f(a) vanishes at infinity and that xp = oo. 
Equation (13.1.18) then also implies that f’(x) — 0 for x — oo. Accordingly, 
upon integrating (13.1.14) from 0 to oo, we may infer that for the slope of 
f(x) at the origin we have 


f'(0) = ar da aaa <0 (13.1.22) 


ie., it is strictly negative. 

Actually the function f(x) vanishes only at infinity. This is easily seen 
by integrating the differential equation (13.1.14), over x, between two points 
Z1 < 2%, to obtain 
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x2 * 3/2 
f(s) - fea) = | ade PO) (13.1.23) 


and hence conclude that with f’(0) < 0, that 
FO) << f(a) << fi(a2) < ++ <0 (13.1.24) 


for 0 < +++ <a <+++ <a <-+-++ < oo. That is, f(x), starting at f(0) = 1, 
is monotonically non-increasing, approaching zero for 7 — oo. The function 
f(x) cannot vanish for finite 2 and then increase again as this will be in 
contradiction with (13.1.24). Also note that the differential equation (13.1.14) 
implies that 

f" (x2) ———— 0. (13.1.25) 


zr—0o 


The function f(a) and its derivative f’(x) may be then determined numer- 
ically from the differential equation in (13.1.14) with the boundary conditions 
f(0) = 1, f(#) — 0 for  — ow. In particular, f’(0) & — 1.58807. For the 
integral in (13.1.22), we have numerically 


bes 3/2 
i an a = —f"(0) & 1.58807. (13.1.26) 
0 


The numerical value of the following integral? 


[ow (f(a))? = 0.6154 (13.1.27) 
0 


will be also needed later on. 
The explicit analytical expressions for the following integrals are easily 
established: 


ae 3/2 
i: dy Go) = —f'(x) (13.1.28) 


[ou ap ang = -5 [ox (f'(a)) (13.1.29) 


2 FO = : cate "(x i 
[a ae = f’(0) [a (f’(«)) (13.1.30) 


SO) Ae opti 
[a ae [fe (F'(@))’. (13.131) 


From (13.1.29)—(13.1.31), we also have 


? Note that, in particular, this means that the TF potential V(r), as given in 
(13.1.16), is square-integrable with respect to the Euclidean measure d?x. 
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38 ayr/? 
| dx oy = -27'0) (13.1.32) 
[we (f(a)? = -27'(). (13.1.33) 
0 


The behavior of f(x) near x = 0, may be inferred by writing 
f(x) =1+a2f'(0)+ar°+... (13.1.34) 
and by substituting the latter in the differential equation in (13.1.14) to 
obtain a = 4/3, b = 3/2, ie., 
4 
f(z) =14+ 2f'(0)+ aoe Hf (13.1.35) 


for « — 0. [Note that (13.1.14) does not represent a Taylor expansion about 
x = 0, since, from (13.1.14), f”(x), for « — 0, does not exist.] 
Equation (13.1.16) leads to 


Vir) = c .f “| (13.1.36) 
and (13.1.15) to 
mZe2 3/2 ’ 
n(r) = (7 < ) = c ;7,| (13.1.37) 


for r — 0. 


Computation of Eyr(Z) 


For the kinetic energy term T[n] in (13.1.5), with n = np, given in 
(13.1.15), we have 


/2 
3e7?Z?2 f° (f(a) ‘ 
and from (13.1.32), this gives 
272 
T[n] = -5 g —f'(0) (13.1.39) 


where f’(0) < 0 is given in (13.1.26). 
For the electrons-nucleus interaction part 


—Ze? — n(r) = eee [x uo 
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Z@e2 
a 


f'(0) (13.1.40) 


where we have used (13.1.26). 
Finally, for the electron-electron interaction part 


1 
r= cial 


=F Paw [eet = eye)” 


9”? 
oy fo en oy) a 
Vise 


=-=-f'(0) (13.1.41) 


where we have used (13.1.32). 
Adding the contributions (13.1.39)—(13.1.41), for the minimizing density 

(13.1.15) of the energy functional (13.1.6), we obtain for the ground-state 

energy E;p(Z) of the TF atom: 

3 6? Z? 

Tt @ 


2/3 
= ; (=) f'(0) (=) gis (13.1.42) 


which upon using the numerical value in (13.1.26) gives 


a 
o d°x d°x’ n(r) 


ee 


Epp (Z) = f"(0) 


Eqp(Z) & —1.5375 (35 af =) Via (13.1.43) 


The power law Z7/® is to be noted. 
For future reference, we rewrite (13.1.42) as 


Err(Z) = Epr(1)Z"? (13.1.44) 


Epp(1) = ; (A) f’(0) (=) (13.1.45) 


where 


Appendix A To 813.1: The TF Energy Gives the 
Leading Contribution to E(Z) for Large Z 


Let E(Z) denote the exact ground-state energy of the multi-electron atom 
Hamiltonian with atomic number Z: 
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z p> Ze? Z e? 
w= (Re - = o> ee (A-13.1.1) 
We show rather formally that? 
lim Z-"E(Z) = Epp(1) (A-13.1.2) 


Z- 00 


where Epp(1) is the coefficient of 7’/? in the TF energy Epp(Z) in (13.1.44). 
Due to the complexity of the demonstration, the details provided in this 
appendix may be omitted at a first reading. The main result (A-13.1.2) is, 
however, of central importance for the multi-electron atom problem. 

To establish (A-13.1.2), we derive formally upper and lower bounds to the 
left-hand side of (A-13.1.2) and show that the limits of both bounds coincide 
with Eyp(1) thus obtaining the result in question. 


The Upper Bound 


We consider first the seemingly unrelated problem of a one-body potential 


with Hamiltonian* , 


ee s 
h=5_+V(r) (A-13.1.3) 


where V(r) is the TF potential (13.1.8), (13.1.9). We set x = R/Z'/3, and 
define 

n(r) = Z* prr(R) (A-13.1.4) 
where n(r) (= npr(r)) is the TF density in (13.1.7), (13.1.15) satisfying 
(13.1.2). Thus from (13.1.16) we may write 


V(r) = Z4/30(R) 


2 
= _— (3n2)?/? 24/3 (opp (R))2/. (A-13.1.5) 


The Green function G(xt; x’0; V), for the potential V, may be written as 
(see (9.7.56)) 


ak j h?k? 
G(xt;x/0;V) = | ay cle) exp |-i ( 5 r+UGe719)| 


(27)3 m 
(A-13.1.6) 
where T = t/h, with U satisfying the equation (see (9.7.59)), 


° The content of this result is a Theorem due to Lieb and Simon (1973). The 
derivation given here is based on: Manoukian and Osaklung (2000). 

4 Note that this Hamiltonian with the given potential V(r) in (13.1.9), (13.1.16) 
satisfies the conditions of Theorem 4.1.1, hence its spectrum is bounded from 
below and its negative part of the spectrum is discrete. 
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a hk re >. ih, 


with the boundary condition U po = 0. 
We are particularly interested in the integral 


| d°x G(xt; x0; V) (A-13.1.8) 


for coincident space points. The exp|ik - (x — x’)] in (A-13.1.6) then becomes 
simply replaced by one. Since the x in (A-13.1.8) and the k in (A-13.1.6) are 
merely integration variables, we may make any convenient change of these 
variables of integrations. In particular, we consider the change of variables 
of integrations: x — R, k — K, where (see (A-13.1.4)) x = R/Z'/3,k = 
Z?/3K and also carry out the following scaling substitutions: V = Z*/3v, 
Tt = T/Z*/3, We note that with the 7, k scalings, just defined, the product 
k?7 = K?T in the exponential in (A-13.1.6) remains invariant. With these 
new variables, (A-13.1.7) reduces to 


a h?K-VRU hh? (WRU)? — ih? VgU 
ime T 2m F2/3 T Im 72/3 = 0. (A-13.1.9) 


Let jim U =U... Then (A-13.1.9) collapses to -OU./0T +v = 0, where we 


note that uv is independent of Z (see (A-13.1.5)). Thus we obtain U, = vT. 
Accordingly, we have the following limits for large Z, as is readily verified 
upon substitution of vT for U, Z — co 


[ex a / ua eer eee z fa’ R pre(R)=Z_ (A-13.1.10) 


TM J_ogT —1€ 


and 


Zs [ox = | ue i w G(xt; x0; V) 


Ti Jig T—i€ OF 


—2 for [* BS +n] o( =e - i) 


= (377) a8 


3 5/3 2 
saat [PR (ere())* ~ 


1 
e? [orer’ prr(R)——— ppp(R’) — (A-13.1.11) 
IR—R| 
where the factor 2 multiplying the r-integral is to account for spin. The 
T-integrals project out the negative spectrum of h in (A-13.1.3). 
Equation (A-13.1.10), in particular, is of fundamental importance. It 


states (see (9.7.24)) that for large Z, the Hamiltonian h, allowing for spin, 
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has Z (orthonormal) eigenvectors corresponding to its negative spectrum. 
Let gi(x,0),.-.,9z(x,@) denote these eigenvectors for large Z. Define the 
determinantal (anti-symmetric) function 


1 
®z(x101,...,Xzoz) = Vai det [ga (xs, 7,)] - (A-13.1.12) 
Since such an anti-symmetric function does not necessarily coincide with 
the ground-state function of the Hamiltonian H in (A-13.1.1), the expectation 
value (®7|H|®z), with respect to ®z in (A-13.1.12) can only over estimate 
the exact ground-state energy F(Z) of H or at best be equal to it. 
We rewrite the Hamiltonian H in (A-13.1.1) equivalently as 


He ns 2 ee ayy [pan (A-13.1.13) 


V<J 


where h,; is defined in (A-13.1.3) with corresponding variables x;, p;. 
Accordingly, 


jim FORD) < jim Z—"!3 (@z |H|®z) 


Zz 
Si —7/3 lhethas : —7/3 7 
jim Z D_ (oilhilge) + jim, 2 Fz (A-13.1.14) 
where 
d3x d°x’ 
oe == Zi nz(xo,xo) n(r’) 
2 dx d3x’ 
| 5 pay = = [nz (xo, x0) nz(x'o',x'0") - [nz (xo,x'o'))”| 
(A-13.1.15) 
(xo, xo’) = Soa x, 0) g;(x', 0’). (A-13.1.16) 


Now using the fact that the second term in the square brackets in the second 
term in (A-13.1.15) is negative and 


2 [°° dr 
lim Z-?5 = lim Z-?— t;x0;V 
Cae nz (xo, xo) aes Qri J_ooT — ie CaM) 
= prr(R) (A-13.1.17) 


we obtain 
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Ix — x’| 


=5 [Snzlana0)) (Someta) 


d?R.d?R’ 
~ 2 | |R—R’ prr(R) pre(R’). (A-13.1.18) 


By 35! 
jim GUS Pic me" jim Ys es a (= netsrxn) 


Finally, we have 


Z 
oS genta Msi ne ee -7/3 
jim 2-7 $1 (gilhalge) = Jim (z 252 , 


i=l A<O 


De fProd 
= jim 2-118 fax 5 ‘| E ‘2 G(xt; x0; V) 


Ti JiggT—i€ OF 
(A-13.1.19) 


where Se A in 2 S- \ is a sum over all the negative eigenvalues of h in (A- 
A<0 A<0 
13.1.3) (see (9.7.32)) allowing for multiplicity but not spin degeneracy. The 
factor 2 takes the latter into account. 
From (A-13.1.14)—-(A-13.1.19) and (A-13.1.11) we obtain 


28/8 9 
; af (30?) "hi 5/3 aR 
jim 27882) < STE [ate (ore(R))°8 — & [FE ore) 


RGR! 
[IR — R’ 


prr(R) pre(R’) (A-13.1.20) 


where we recognize, from (13.1.6), (13.1.15), (13.1.16) and (13.1.44), that the 
right-hand side of this inequality coincides with the coefficient Epp(1) of 27/3 
in Epp(Z), as given in (13.1.44), (13.1.45). That is, 


lim Z-"E(Z) < Epr(1). (A-13.1.21) 


Z—00 


The Lower Bound 


We use a special case of the lower bound of the repulsive electron-electron 
potential to be established in (14.1.3) given by 


e? 1 
da? da? / ! 
esp > ye 2 [Pe nt = | UD errr UB. 
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(3m ig h2 


ps Le ae Lae 5/3 , ; 
60n2mZ Jc x (n(r))"" +62? Erp(1) (A-13.1.22) 


where n(r) is taken to be the TF density in (A-13.1.4), (13.1.7), (13.1.15) 
and we have chosen 3 = 6Z, with N = Z. 

Let W be a normalized anti-symmetric function in (x,01,...,xzoz). Then 
from (A-13.1.4), (A-13.1.13) and (A-13.1.22), 


1 
(W|H|W) > HD hi |v) — C27 [onar ptr(R Rony ®) 
302)?! pe 
a ca jor (pre(R))°?/? + 62? Erp (1). 
(A-13.1.23) 


Consider the lowest energy EF of the Hamiltonian S- h;. The latter Hamil- 


i 
tonian describes Z “non-interacting” electrons, but each interacting with the 
external potential V. According to Pauli’s exclusion principle, these Z elec- 
trons can be put in the lowest energy levels of S- h, (allowing for spin de- 


generacy) if Z < the number of such available levels, or else if Z is larger, 
then the remaining free electrons should have arbitrary small (— 0) kinetic 
energies to define the lowest energy of S- h;. In either cases, E > 2 = r, 

a A<O 
where S- d is defined as before below (A-13.1.19). 


rA<O 
Accordingly, from (A-13.1.23) and the last equality in (A-13.1.19), we 


have 


9 co 
jim Z-7/3 (UAW) > jim Baers ex i a ge G(xt; x0; V) 


277i J_ggT —i€ OT 


prr(R’) 
(A-13.1.24) 


e2 1 
—— /d?Rd?R’ R)——_— 
5 i pre (Rear 


involving reals, and noted that the last two terms on the right-hand side of 
(A-13.1.23) go to zero, when multiplied by 7~7/? in the limit Z — 00. From 
(A-13.1.11), (A-13.1.24) we then obtain 


5/3825 
i = (37 ) h 5/3 dR 
jim 2 "28 (U|H|W) > an [vr (prr(R)) e [ R prr(R) 


2 ¢d3Rd3R’ 
: / prr(R) pre(R’) 


IR—R'| 
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= Epp (1). (A-13.1.25) 


Since |W) was arbitrary and can only overestimate E(Z) or at best would 
lead to a value equal to it, (A-13.1.25) is also true for the (unknown) ground- 
state wavefunction, that is 


lim Z-"E(Z) > Epp(1). (A-13.1.26) 


Z—00 


From (A-13.1.21), (A-13.1.26), the statement in (A-13.1.2) then follows. 


Appendix B to §13.1: The TF Density Actually Gives 
the Smallest Value for the Energy Density Functional 
in (13.1.6) 


In this appendix, we show that the TF density npr(x) satisfying (13.1.7) 
actually gives the smallest possible value for the energy functional (13.1.6) 
defined a priori for an arbitrary density p(x) > 0 by 


3 
Fil = A fax (oa) - ze [ pt 
© aeeaty : B-13.1.1 
aaa aes PU) ghee (B-13.1.1) 
where ae 
(377) h? 
Re. B-13.1.2 
1072m Pee) 


Let p(x) = to: (x) + (1 — t)po(x) = tor + (1 —t)p2, p(x’) = ter(x’) + (1 
t)p2(x’) = tp, + (1—-t)ps,0 <t <1, and where p,, p2 > 0. From Appendix II 
on convexity, we have the elementary inequality 


(tor + (1 t)p2)"? < t(p1)°9 + (1 = 8) (p2)°. (B-13.1.3) 
Also 
[tp + (1 — t)pe] [to + (1 — t)p5] 

= tpip + (1 — t)p2p) — (1 — t)(p1 — p2)(p, — P2). (B-13.1.4) 


A Fourier transform, for example, shows that 


Jo dx’ [pi (x) — p2(x)] [pi(x’) — po(x’)] > 0. (B-13.1.5) 


Ix — x’| 


to be used in conjunction with the last term in (B-13.1.4). Hence we may 
conclude form (B-13.1.1)—(B-13.1.5) that 
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F[tp. + (1—t)p2] < tF [pi] + (1 — t) Fla. (B-13.1.6) 


Also 
d 5 3 3/2 
ar ltrs +(1—t)p»] = aA d°x [tp1 + (1 — t)p2]""" (p1 — pe) 


d3x 
oe zer (p1 — p2) 


> [dex dex’ 


|x — x’| 


[to + (1 — t)p5] (p1 — 2) 


and 


d 
at lta + (1—t)po| 


t=0 


5 Ze? d3x’ 
= fats (p1 »)| Apa!® + € i). (B-13.1.8) 


3 bd |x — x’| 


By choosing p2 = np, and p; = o = 0 arbitrary, we conclude from 
(13.1.7) that the expression within the square brackets in (B-13.1.8) is zero, 
thus 


o Blio + (1 —t)nrr] 


7 7 (B-13.1.9) 
Also (B-13.1.6) leads to the bound 
F[o] — F[nre] > HE OS inele Ee (B-13.1.10) 


t 


Since the left-hand side of (B-13.1.10) is independent of t, we may take the 
limit t — 0 and use (B-13.1.9) to conclude that 


Flo] > F[nre| (B-13.1.11) 


with the TF density npp providing the smallest possible value for the energy 
functional in (B-13.1.1). 


13.2 Correction due to Electrons Bound Near the 
Nucleus 


According to (13.1.36), an electron near the nucleus, i.e., for r — 0, feels 
the potential 
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Vo(r) = -— - 0 13.2.1 
o(r) =- == - = p00) (13.2.1) 
where a x Z~1/3 is defined in (13.1.11), and f’(0) is given in (13.1.26). The 
first term —Ze?/r is the familiar potential due to the nucleus, while 
es Ze? d?x 
— — /'(0) = —]|f’(0)| =e? | — 13.2.2 

£0) == [FO] =e | nlx) (13.2.2) 
(see (13.1.7)—(13.1.10), (13.1.22), (13.1.1)) is a background constant potential 
due to the electrons felt at the origin. 

The potential Vo(r) may be treated explicitly without recourse to a semi- 
classical approximation. From the theory of the hydrogen atom, we may place 
the electrons bound near the nucleus up to some energy level specified, say, 
by a principal quantum number n’. For a given n’, the maximum energy € 
attained by an electron bound near the nucleus in the potential Vo(r) is then 
given by 


_ Ze* wg), (13.2.3) 


Accordingly, to obtain the correction to the TF semi-classical approxima- 
tion Epr(Z), due to electrons bound near the nucleus, we replace the TF 
semi-classical contribution,’ in the potential Vo(r), up to the maximum en- 
ergy in (13.2.3), by the exact contribution due to this potential with electrons 
placed in energy levels, having at most the energy in (13.2.3), according to 
Pauli’s exclusion principle. 

Hence the correction to Eyr(Z) sought in this section is given by® 


Bee = > (2n?) |- ee 70] 


2h?n? a 
n=1 
2 oo Sd 
— wa |x | . rs ee) (13.2.4) 
Qri _xoT —1€ OT 
where 
dk of irk 
Gg-(xt; x0; Vo) - fom exp | i( aes vatr)) | (13.2.5) 


with the second term in (13.2.4) giving the sum of the eigenvalues, of a 
Hamiltonian with potential Vo, less than € (see (9.7.32)). Accordingly, we 
may introduce the parameter 


° In reference to this contribution, as given in the second term in (13.2.4), see 
(9.7.32), (9.8.3) by finally incorporating spin. 

® Sc in Es. refers to Scott (1952) who first gave the correction in (13.2.14) — see 
also Englert and Schwinger (1984). 
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aie (< 4 oe #"(0)) (13.2.6) 


mZ2e4 a 


and take (1/—n)!/? (see (13.2.3)) as the average of the principle quantum 
numbers n’ and n’+1, with the latter corresponding to the next energy level 
greater than €. That is, 


1\/? tnt) , 1 
i, aes gear 13:2: 
(=) ; nm + 5 (13.2.7) 


We recall that the 2n? factor multiplying the summand in the first term 
in (13.2.4) denotes the number of electrons that may be put in the energy 
level specified by the principal quantum number n. 

Upon summation over n, the first term in (13.2.4) is given by 


7? 4 Z 2 
— n' 4 7 n(n! +1)(2n' + 1)f'(0) (13.2.8) 


and by keeping track of the power of Z, we note that the second term in 
(13.2.8) is x 74/3 in contrast to Z? as appearing in the first term. 
The second term in (13.2.4) is explicitly given by 


d?k hk? 2 m 2 
-2 fetx fo ( a ae Jo al (¢ a ) ) 
_- Ane 2m(—€') 5/2 ipge ye —é'r ote 
get) Ss (eee): Lae) 


(r< Ze? /—€') 


z 1 Te 3/2 ~t'p 3/2 
2\—€r Ze 
(13.2.9) 
where 3 
Ze 
f= E+ — FO). (13.2.10) 
Upon setting 
at 
a Fe (13.2.11) 
and using the integral 
; ie). TASaye 37 
are : = 13.2.12 
i no E ye 2 ysl? 32 ara 


we obtain for (13.2.9) the expression 
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7 4 1 1/2 7. 4 1 
mas (=) =m (w'+5) (13.2.13) 


where 7 = 2h?é'/mZ?e* was introduced in (13.2.6), and we have used 
(13.2.7), (13.2.10). 

All told, we obtain from (13.2.8), (13.1.13) and (13.2.4), to the leading 
order in Z 


mZ7e4 

2h? 
where the first term in (13.2.8) cancels out, and the principal quantum num- 
ber n’ now appears only in the coefficient of Z*/? in the second term in 
(13.2.8). This term, however, may not be retained as a further correction to 
the Z? one in (13.2.14) since the corrections considered in the next sections 
are proportional to Z°/3. Thus we learn, in particular, that for the correction 
due to electrons bound near the nucleus, the contribution of the background 
constant potential due to all of the electrons in (13.2.2) may be neglected 
and we may effectively take Vo to be just the Coulomb potential 


Ege = (13.2.14) 


Vo(r) = ve (13.2.15) 


r 


due to the nucleus. 


13.3 The Exchange Term 


In considering the electron-electron interaction term 
—— n(x") (13.3.1) 


in the TF theory in (13.1.6), we have not taken Pauli’s exclusion principle 
into account that no two electrons may occupy the same state. In the present 
section we remedy this situation. 

To the above end, we consider, the non-local density given in (9.7.26) with 


a) 


1 co 
Neo! (X,x/) = af ge Goo: (xt; x’0). (13.3.2) 
271 J_oo5T — 1 
Since we are considering non-spin interactions, 
Goo(xt; x'0) = b¢0'G(xt; x'0) (13.3.3) 
Noo! (X, xX’) = boo N(x, x’) (13.3.4) 


S/ Neo(x, x’) = 2N(x,x’) = n(x,x’) (13.3.5) 


Oo 
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where N(x,x’) does not take spin into account. We also note that 
n(x, X) = n(x) (13.3.6) 


in the earlier notation in (13.3.1), and we may rewrite the integrand in 
(13.3.1), not involving the 1/|x — x’| factor, rather trivially as 


n(x, x) n(x’,x’) = S- N(x, x) N(x’, x’) 


o,o' 


= S>N(x,x) N(x’,x’) + 50 N(x,x) N(x’,x’). (13.3.7) 
oO oo’ 
To satisfy Pauli’s exclusion principle, we replace the first term in (13.3.7) 
by 
S> [N(x, x) N(x’, x’) — N(x,x’) N(x’, x)] (13.3.8) 


which when combined with the second term in (13.3.7) provides the substi- 
tution rule 


= n(x, x) n(x’,x’) — 5x, x’) n(x’,x) (13.3.9) 


as is easily checked by trivially summing over o and o’. We note the important 
complex conjugation property in (9.7.27): 


n(x’,x) = (n(x,x’))’. (13.3.10) 


Hence to obtain the “exchange correction” to Epp, in the TF semi-classical 
approximation, we have, from (13.3.1), (13.3.9), (13.3.10), to compute the 
integral 


1 


2|x — x’| 


2 
Eexe = _ d°x dx’ n(x, x’) (n(x,x’)) (13.3.11) 


where we note the additional 1/2 factor multiplying 1/|x — x’| in (13.3.11). 
To the above end, by setting 


= =kp(r), -“«-x=G, ([l=¢ (13.3.12) 


where V(r) is the TF potential (13.1.8), (13.1.16), we have (see (13.3.2), 
(9.8.3), (9.7.26)) 
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2 [% dr ak ; hk? 
a ik-(x—x’) 3 
ae a — i : exp | i( 2m +V¥(r)) 7 


_4n ke(") hdk (ciks enite) 


iC Jo (2) 
Ckr(r) 
= a 5 al dz cos(x) (13.3.13) 
as 0 
or ; 
n(x!) = 5 [sin (Ck (r)) — Cke(r) cos (cke(r)) | (13.3.14) 
Let 
Che(r)=y, y=ly| (13.3.15) 
to obtain 13 
n(x,x’) = no [siny — ycosy]. (13.3.16) 


For x’ — x, it is easily checked that n(x, x’) goes over to n(x) in (13.1.1), 
with € = 0, (see Problem 13.9). 
Accordingly, for the exchange effect Eex< in (13.3.11) we obtain 


e? {3 dx’ k&(r) (= - pos 


E — 
ai 2 142|x — x’| yp 


(13.3.17) 


Upon making a change of the variable of integration x’ to y, this leads to 


2 a3 ga eS +9) \2 
Eexe => | d°x kat) | (ee). (13.3.18) 
Ar4 y y? 


Finally using the integral 
~d 1 
/ <% (siny — ycosy)” = = (13.3.19) 
o Y 4 


and the definition of kp(r) in (13.3.12), rewritten in terms of the density n(r), 
we obtain 


2 
Boxe = ——q (3n?)*? Jovx (n(r))*”. (13.3.20) 
Ar 
This may be simply rewritten in terms of the TF function f(a) in (13.1.15) 
as 
9 V/s met 2 
Eexe = — (=) 27° (4) | da (f(x))”. (13.3.21) 
4 ne) fae (40) 


The numerical value of the integral in (13.1.27) gives 


4 
Pig = OA416 (=) Zs, (13.3.22) 
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13.4 Quantum Correction 


The deviation of the (space coincident) Green function 6G(xt; x0; V) part, 
for a given potential, from the Thomas-Fermi semi-classical approximation 
in (9.8.3) has been worked out in §9.8, (9.8.23) and is given by (7 = t/h) 


h27r2 


5G(xhit;x0;V) = Ta 


(v7) S =(vv)"| 


d°k hk? 
x i V 13.4.1 
Frans |+(Gm *¥) 7] oan 
using the mass m for 4, and emphasizing its dependence on V. 
The (quantum) correction due to electrons near the nucleus has been 
already taken into account in §13.2 where we found that the effective potential 


felt by such an electron may be taken to be simply the Coulomb potential of 
the nucleus alone 


Vo(r) = -—. (13.4.2) 


Accordingly, from (13.4.1) and (9.7.32), we may introduce a further quan- 
tum correction to the ground-state energy by” 


2 [°° dr 0 O 
— 3 j . . — 7 . Py 
Equa = Jc X57 a ae f ar 5G(xhr; x0; Vor) ine 5G(xhr; x0; Vc) 


(13.4.3) 


where Vrp is the TF potential (13.1.9), (13.1.10), (13.1.16) and Vo is the 
Coulomb potential in (13.4.2) due to the nucleus, and the 7-integral projects 
out the respective negative spectra (see (9.7.32)). 

Integrating by parts over 7, we have for the 7-integral in the following 


Ve ie oF? seen x OV) 


Qmi J_gT—ie OF 


TT 


1 [dr 
- | ze 6G(xhr;x0;V) 


” This section is based on: Manoukian and Bantitadawit (1999). 
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Fos 
x i “ar exp -i (== - v) 7 (13.4.4) 
(YY a) fei o(ME even). as.48 


2 dV 
Upon integration over k, and taking the derivative with respect to V, we 
obtain 


_ [oa , (WU)? a 
~ 127m lv Ve 2 dV 


or 


VV + 


V2m 2 1 (vv)° 
I = 13.4. 
WO soar oS a (13.4.6) 
Now we use the identity 
2 
3 3 (Vg) 
. 8/2). 2 2,9 
Vv Mo SV9V"9 += 7 (13.4.7) 
or , 
1 (VV) tos 3/2, 1 2 
tar 3) +5V-VVV (13.4.8) 
to get 
V2M | ze? 2 2 3/2 
The expression for Equa in (13.4.3) then becomes 
Bes [ex (1 [Ver (s)] = 1[Vo(x)]). (13.4.10) 


Consider the contribution of the second term in (13.4.9) to the x-integral 
in (13.4.10). To this end, using (13.1.9), (13.1.10), (13.4.2), the integral 


In = [ox Vv? [(-Ver)*” - (-Vo)*”) 


_ (2e2)""" (an) [ore ® -|l Oa | (13.4.11) 


qi/2 dx? 3/2 3/2 


—oco 


integrates to 
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e2 3/2 7 1/2 (f(2))°” —1 +. 
p= FOO 3 ryan)! P@ | an | 
"G3.4.12) 


From (13.1.19), (13.1.35), it is readily checked that the expression within the 
brackets in (13.4.12) goes to zero for x > co and x — 0. That is, the second 
term in (13.4.9) gives no contribution to (13.4.10). 

Using the Poisson equations (see (13.1.8), (13.1.1), with € = 0) 


V° Vor = 40 Ze753(x) — 4re? npr (r) (13.4.13) 
V?Vo = 40 Ze753(x). (13.4.14) 


We then obtain for (13.4.10) 


Equa = eas [e°x{(-vre) pete (-Vc) 11 59x) 


2 


e€ 
~ a |a°x (-2mVer)/?nee(r). (13.4.15) 


The first integral is proportional to 
~~ d(x) 1/2 
| de 7 |(F(@))'? -1] (13.4.16) 


which, from (13.1.35), integrates to zero. 
All told, we have from (13.4.15), (13.1.1), 


2 
Equa = — a (302) 4/* [ox (n(x))*”” (13.4.17) 


where we have used the notation n(x) for npr(r). 
Upon comparison of (13.4.17) with Eexc in (13.3.20) we conclude that® 


=— (aa) aol (5) [x (f(x))”. (13.4.18) 


From the numerical value of the integral in (13.1.27) we then obtain 
Egon % — 0.09814 (2) gars 13.4.19 
qa = —0. RP (13.4.19) 


8 The total contribution Eexc + Equa = (11/9)Eexc was evaluated by Schwinger 
(1981) by modelling his analysis after the harmonic oscillator potential. 
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13.5 Adding Up the Various Contributions: Estimation 
of E(Z) 


The various contributions to the estimation of the ground-state energy 
E(Z) of neutral atoms evaluated in the previous sections are as follows: 


e the TF energy Epp, as the leading contribution, ((13.1.42)—-(13.1.45)) 


given by 
6/4Y j me? 7 
= — ae /3 
Erp 7 (=) f’(0) ( 72 ) Z (13.5.1) 


with f’(0) given in (13.1.26), 
e correction due to electrons bound near the nucleus (see (13.2.14)) given 


Dy 4 2 
Eg. = (5 ) a (13.5.2) 


e exchange effect (see (13.3.21)) given by 


Fae ey (=) 23 [ae (f(x))” (13.5.3) 


0 


e a quantum correction (see (13.4.18)) given by 


1/3 4 oe) 
n= (ats) (2 Fre 


2 
= 5 Bex (13.5.4) 


where the numerical value of the integral in (13.5.3), (13.5.4) is given in 
(13.1:27), 


Adding up the various contributions (13.5.1)—(13.5.4), we obtain, as an 
estimate for E(Z), the following functional dependence on Z: 


my 7/3 2 5/3 met 
B(Z) © |-1.53752 + Z? —0.5397Z — (13.5.5) 


This expression turns out to be remarkably reliable and we refer the reader 


to the general survey: Morgan III (1996), Chapter 20, p. 233, for the assess- 
ment of its accuracy, for further developments and for other methods. 


Problems 


13.1. For a potential 


13.2. 


13.3. 


13.4. 


13.5. 
13.6. 


13.7. 
13.8. 
13.9. 
13.10. 


13.11. 


13.12. 


13.13. 


13.14. 
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V(x) = farx’ PRE, fetx p(x) =0 


|x 


corresponding to a neutral system, where p(x) = p(|x]|) is a spherically 
symmetric density such that p(|x|) = 0 for |x| > rp, show that V(x) = 
0 for |x| > rg. The radius rg may be finite or infinite. [This is a special 
case of Newton’s classic result.| 

Derive the expression for the kinetic energy in (13.1.5) by using 
(13.1.4) and (13.1.1) with € =0. 

Show that the variational (functional) derivative of (13.1.6) gives 
(13.1.7), (13.1.8). 

Starting from the expression of V(r) in (13.1.8), (13.1.9) derive the 
equations (13.1.12), (13.1.14)-(13.1.16). 

Establish the asymptotic limits in (13.1.20), (13.1.21) for r — oo. 
Prove the explicit analytical expressions for the integrals in (13.1.28)— 
(13.1.33). 

Establish the limits of the integrals in (A-13.1.10), (A-13.1.11). 
Verify in detail the steps leading from (13.2.9) to (13.2.13). 

Show that n(x,x’) in (13.3.16) goes over to the local density n(r) in 
(13.1.1), with € = 0, for x’ > x. 

Show that the expression within the square brackets in (13.4.12) goes 
to zero for x — oo and x — 0. 

Investigate the nature of the TF theory described in §13.1 for an ion 
with N electrons, N < Z, and, in particular, the properties of the 
corresponding TF function f(x). 

Show that the probability Prob [r < ro] of finding an electron within 
a sphere of radius rp about the nucleus for the TF density (13.1.15) is 
given by 


Prob [r < ro] =1 f() ay ee, 


where a x Z~1!/3ag is defined in (13.1.11). Make a careful study of this 
probability for various ro by noting, in the process, its dependence on 
the atomic number Z. 

Reconsider the TF theory in §13.1 by investigating the nature of the 
next to the leading term p?/2m in the relativistic expression for the 
kinetic energy \/p2c? + m2c4 — mc? for an electron, where c is the 
speed of light. When would such a term be important in your analysis? 
In obtaining the lower bound in (13.3) from the lower bound Hamil- 
tonian in (13.2), no use was made of Pauli’s exclusion principle. Make 
use of the latter to obtain an improvement to the bound in (13.3) at 
least for sufficiently large Z. 


14 


Quantum Physics and the Stability of Matter 


If one is asked to prepare a short list of most significant problems in 
physics of theoretical nature and are critical for our existence and that of the 
universe, the subject matter treated in this chapter would undoubtedly be 
on such a list. 

God forbid the Pauli exclusion principle becomes abolished making the 
electron in matter to behave as a boson, and converts matter to a “bosonic 
one”, then such matter would collapse and our world will cease to exist. This 
is what quantum physics predicts. Here we see this monumental theory at its 
best. The Pauli exclusion principle is not only sufficient for the stability of our 
matter but is also necessary. This result alone promotes the Pauli exclusion 
principle, or more generally the spin and statistics connection, as probably 
one of the most important results in physics, and in the sciences, in general. 

In regard to such “bosonic matter”, F. J. Dyson writes: “/Bosonic] mat- 
ter in bulk would collapse into a condensed high-density phase. The assem- 
bly of any two macroscopic objects would release energy comparable to that 
of an atomic bomb.... Matter without the exclusion principle is unstable.” 
E. H. Lieb writes:? “Such “matter” would be very unpleasant stuff to have 
lying around the house.” 

The drastic difference between matter (with the exclusion principle) and 
“bosonic matter’, for systems considered with Coulombic interactions, with 
N negative and N positive charges, is that the ground-state energy has the 
power law N°, where a = 1 for matter,? while a > 1 for “bosonic matter”. 
Such a power law behavior with a > 1, for the ground-state energy, implies 


' Dyson (1967). 

2 B. H. Lieb, in: Thirring (1991), p. 23. This volume and subsequent editions 
consisting of many of his publications with various collaborators, such as 
W. E. Thirring and others, contain a wealth of information on the subject. 

3 Lenard and Dyson (1968); Lieb and Thirring (1975). 
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the collapse of “bosonic matter”,**> since the formation of a single system 
consisting of (2N+2N) particles is favored over two separate systems brought 
into contact, each consisting of (N + N) particles, and the energy released 
upon collapse of two separate systems into a single system, being proportional 
to [(2N)* — 2(N)*], will be overwhelmingly large for realistic large N, e.g., 
N ~ 1073. 

The Hamiltonian under consideration for the stability matter is taken to 
be the N-electron one 


N _9 N Py Nk 2 k 2 
Pp; € Z5e Zi, Lj € 
A= —————__ — 14.1 
a eel a) Re 
w=1 <j J i=1 j=1 J V<J J 


where m denotes the mass of the electron and the x;, Rj correspond, re- 
spectively, to positions of the electrons and nuclei. Also we consider neutral 
matter, i.e., 


S04 = N. (14.2) 


The Hamiltonian in (14.1) is a typical one in that it corresponds to mo- 
tionless (i.e., infinitely massive and hence with arbitrary large rest mass en- 
ergies) fixed point-like nuclei. This is non-academic. By doing so, one does 
not dwell on the fate and the dynamics of the positive background, and one 
is looking at, and monitoring the fate of, the electrons through the “eye” of 
the former system. 

The key result in the problem of the stability of matter, with the exclu- 
sion principle, is the single power law behavior Ey ~ —N of the ground-state 
energy, and the physically expected result that the ground-state energy per 
electron |Ey /N| remains bounded for all N unlike “bosonic matter” for which 
the latter becomes larger and larger as N increases. What we will actually 
learn in §14.3 is that, for a non-vanishing probability of having the electrons 
within a sphere of radius R, the volume vz in which the electrons are con- 
fined grows not any slower than the first power of N for N — oo. That is, 
necessarily, the radius R of spatial extension of matter grows not any slower 
than N'/3 for N — oo. No wonder why matter occupies so large a volume! 
Here it is worth recalling the words addressed by Paul Ehrenfest to Wolf- 
gang Pauli in 1931 on the occasion of the Lorentz medal® to this effect: “We 
take a piece of metal, or a stone. When we think about it, we are astonished 
that this quantity of matter should occupy so large a volume”. He went on 


4 Dyson and Lenard (1967); Lieb (1979, 1976); Manoukian and Muthaporn 
(2003b). The corresponding law here is N°/* with motionless fixed (i.e., infi- 
nitely massive) positive charges. 

° Dyson (1967); Manoukian and Muthaporn (2002); Conlon et al. (1988). The cor- 
responding law here is N“/*, where the positive charges are treated dynamically 
as well with finite masses restricted to Coulombic interactions. 

° See Ehrenfest (1959), p. 617, as quoted in Dyson (1967). 
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by stating that the Pauli exclusion principle is the reason: “Answer: only the 
Pauli principle, no two electrons in the same state”. 

It is important to emphasize that the collapse of “bosonic matter” occurs 
even if the positive charges are treated dynamically with finite masses with 
Coulomb interactions.” Also that the collapse of “bosonic matter” is not a 
characteristic of the dimensionality of space,® and that such matter does not 
change, for example, from an “implosive” to a “stable” or to an “explosive” 
phase with change of dimensionality. 

In §14.1, we obtain a lower bound to the electron-electron repulsive 
Coulomb potential energy, which when combined with the lower bound in 
(4.6.24) derived for the expectation value of the kinetic energy of the elec- 
trons provides a lower bound to the ground-state energy E’y in §14.2. In the 
latter section, an upper bound for Ey is also derived consistent with the 
single power of N obtained for Ey in the lower bound. The high density 
limit of matter is investigated in $14.3. The final section §14.4, deals with the 
collapse of “bosonic matter”. 

To make this work accessible to a wider audience we have relegated rather 
some technical aspects of the analyses to the appendices of the relevant sec- 
tions. 


14.1 Lower Bound to the Multi-Particle Repulsive 


Coulomb Potential Energy 


In the appendix to this section, we derive the following inequality which 
follows from (A-14.1.28) and reads in detail, 


25/3 ‘Wi? Je 5/3 > | ae POO 
Aa eee (TT aa 
(37?) iortme fT *P y aS pe RG 
e 1 Bs Gwe 
hae? uo = 
te = fax ER) oP ) RR 
k 
i=l 


where 3 > 0 is an arbitrary dimensionless parameter, p(x) is an arbitrary 
positive function, and Eyp(1) Y —1.5375 (me*/2h?) is the coefficient of the 
TF ground-state energy defined in (13.1.43), (13.1.45). 

The energy density functional, expressed in terms of the density p(x) 
on the left-hand side of (14.1.1) is in the spirit of the TF energy functional 


” Dyson (1967); Manoukian and Muthaporn (2002). 
8 Manoukian and Muthaporn (2003a); Muthaporn and Manoukian (2004a,b). 
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considered earlier in (13.1.6) in the TF theory, with the mass m of the electron 
replaced by mG, and with the further generalization of including k nuclei, with 
the last term, involving ‘Z;Z; e?’, describing their interactions. 

The inequality in (14.1.1) gives rise to a lower bound to the (repulsive) 


Coulomb potential energy of k particles of charges Z;|e|,..., Z,|e|, or charges 
—Z,\e|,..., —Zple|, ie., for charges of the same signs as follows: 
Zi Ly € ‘ p(x) 
7 > Z 2 fe 
Pi rear pa a Ix — R,| 
i<j j=l 


k 
- (3n2)°/ ‘5 p°!3(x) + BEpp(1) > Zi”. 
i=1 


(14.1.2) 


1072m 


In particular for the interaction of N electrons we have, with substitutions 
kN, Z; - 1, R; — x; for j =1,...,N: 


a e? = p(x) 
oer? be fe 
i |x; x,;| = Ix — x5| 
UJ j=l 
e? 3 3.7 ! 
a d°x d’x’ p(x) r xP? 
(3n2)°/? h? d3x p°/3(x) + @NEpp(1). (14.1.3) 
1072m 


The two inequalities (14.1.1), (14.1.3) combined with the lower bound of 
the expectation of the kinetic energy given in (4.6.24) for s = 1/2, will be 
used in the next section to derive a lower bound for the exact ground-state 
energy of matter with Coulomb interaction by appropriately choosing p(x) 
n (14.1.1), (14.1.3) to coincide with the particle number density defined in 
(4.6.16).° 


° Note that the positive function p(x) in (14.1.1)-(14.1.3) being arbitrary may be 
chosen to be the same in all of these three inequalities. 


14.1 Lower Bound to the Multi-Particle Repulsive Coulomb Potential ... 769 


Appendix to §14.1: A Thomas-Fermi-Like Energy 
Functional and No Binding 


Due to the technical nature of this appendix, its content may be omitted 
at a first reading. The main result obtained (inequality (A-14.1.28)), however, 
is important in obtaining a lower bound to the repulsive Coulomb potential 
energy for the e~-e~ interaction as given in (14.1.3). 

We introduce the functional of a positive function p(x) defined by 


F[p;Z1,..-, Ze, Ri,..., Re] 


2\5/3 fi? 3 5/3 (x ® fax p(x) 
= —z— _|d Z; ——_ 

(om) 1072mp ma - 2d g€ x — R,| 

e une ye 

3 fedex dx! A-14.1.1 

y [axa 90) +more 
(compare with (13.1.6)), depending on positive parameters Z,,..., Z, and 
vectors Ri, ..., Rx. Here @ > 0 is an arbitrary dimensionless parameter. [In 


particular, for k = 1, the last term in (A-14.1.1) is absent, and by setting 
R; = 0, @ = 1, we obtain the energy functional in (13.1.6), (B-13.1.1).| 

The main result (A-14.1.24)/(A-14.1.28) established in this appendix was 
used in this section to obtain a lower bound for the (repulsive) Coulomb 
potential for many particles having charges of the same signs as given in 
(14.1.2), (14.1.3). 

Let po(x;k) satisfy the equation (see also (13.1.7)) 


2/3 he? 1 

(317) / amp po? (xk) = - 25- e€ ® fats apeolee) 
(A-14.1.2) 
as obtained by functional differentiation of (A-14.1.1) with respect to p(x) 

and by setting the result equal to zero as done in Lagrangian mechanics. 
Following the proof given in Appendix B to §13.1, which shows that the 
TF density satisfying (13.1.7) actually provides the smallest value (see (B- 
13.1.11)), for the energy density functional (B-13.1.1), we conclude (see Prob- 
lem 14.8) that po(x;k) satisfying (A-14.1.2) provides the smallest value for 
the functional F[p; Z1,---,Zk, Ry,--- Ry] in (A-14.1.1), with the normal- 


ization condition 
k 


Jo’ po(x;k) = SoZ; (A-14.1.3) 


i=1 
satisfied. That is 


F[p; Z\,- .., 2k, Ri,. .., Ry] = F'| po Zay 3-0) Ze; Rays ,R,]. (A-14.1.4) 
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We introduce the functionals 
F[p; AQ, sey AL, Zi41) sey Zk; Ri, sey Rx (A-14.1.5) 


and 
F[p;\Z,-.., AZ, Ra,..., Ri] (A-14.1.6) 


where | < k, and \ > 0 is an arbitrary parameter. 
Let 1 (x), 2(x) be the corresponding solutions to (A-14.1.2) for the func- 
tionals in (A-14.1.5), (A-14.1.6), respectively: 


l k 
2/3 he 2/3 AZ; €? Z; €? 
(37”) Imp?! (x) = iy Ix “Ry a Dy xB] 
j i= 
2 3./ J / 


2/3 A? ‘AZ; e? 1 
(3n2)?/ a Ae) = Ae [atx qn (A-14.1.8) 


For simplicity of the notation only, we have suppressed the dependence of /, 


p2 on A, k, l. 
By setting 


2) 2/3 he? 2/3 tr ie 
(30°) mp ?3 (x) =Q;(x), j=1,2 (A-14.1.9) 
we obtain from (A-14.1.7), (A-14.1.8), upon subtraction, 


Quloe) ~ Qats) = YO ES — 08 fats [ou e) — pale’) 


1 (2mp ae 2 [43 1 3/2 3/2 
32 ( h2 ) fa ral 1 (@)—Q (%’)]. 
(A-14.1.10) 
Since the sum over j in (A-14.1.10) is non-negative, [Qi(x) — Q2(x)] 
cannot be strictly negative for all x otherwise this will be in contradiction 


with the equation (A-14.1.10) itself. 
We introduce the set 


c= { x | Qi(x) — Qa(x) <0 \ (A-14.1.11) 


14.1 Lower Bound to the Multi-Particle Repulsive Coulomb Potential ... 771 


and we will show that this set is empty, thus concluding that Q1(x)—Qo2(x) > 
0. 

We assume that S is non-empty and then run into a contradiction. As 
we move away from the boundary 2 of S$, [Qi(x) — Q2(x)] changes sign or 
vanishes, by definition of S', and we then have 


A- V[Qi(x) — Qo(x)] > 0 (A-14.1.12) 


otherwise, we would run into a region beyond S where [Qi(x) — Q2(x)] 
is still strictly negative. [If S is of infinite extension the non-negativity of 
fi - V[Qi(x) — Q2(x)] on the boundary still holds.| 

The application of the Laplacian to (A-14.1.10) gives 


k 
V?[Qi(x) — Qo(x)] = —40 S- Z; e°5°(x — Rj) 


j=l41 


2 2mB = 3/2 3/2 
+ Ate h2 (3n2)2/8 1 (x) —Q5 (x) 


(A-14.1.13) 


and for x in the set S, the expression on the right-hand side of this equation 
is strictly negative since lai’ 3 (x) — - *(x)| <0 for such x by hypothesis. 
Accordingly, 


0> Jo V?[Qi(x) — Q2(x)] = [o2a- Vas — Q2(x)] (A-14.1.14) 
S Q 


in contradiction with (A-14.1.12), hence S' is empty and 
Q1(x) — Qo(x) 20 (A-14.1.15) 


as a function of x. 
In reference to the functional 


F[p; Zi41,---, Zk, Rigi,---, Re] (A-14.1.16) 
let p3(x) satisfy 
k 2 
22/3 fi? 9/3 Ze ® fax! u ' 
pein = » reed eee d3x/ 
Or) 2m pa (x) — |x—R,| < : ea 


(A-14.1.17) 
in analogy to (A-14.1.7), (A-14.1.8). 
We define 


g() = F[pij;AZ1,..-,AZt; Zsa, +--+) Ze, Ri,-.., Re] 
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= F[p2;AZ1,...,AZi,Ri,..., Ri] 
— F[p3; Zi41,---) Ze, Rigi,---, Re] (A-14.1.18) 


with | < k and the p; non-negative. Since for \ = 0, 91 and p3 denote the 
same density, and p2, in (A-14.1.8) is obviously equal to zero for A = 0, as 
the left-hand side of (A-14.1.8) is non-negative while its right-hand side is 
non-positive for \ = 0, we may infer that 


g(0) = 0. (A-14.1.19) 


We will show that 
g(1) > 0. (A-14.1.20) 


From (A-14.1.19), we may write 


gl) = fo g'(A) (A-14.1.21) 


and hence to establish (A-14.1.20) it is sufficient to show that g/(A) > 0 for 
O<AK<L1. 

To the above end, we note from (A-14.1.1) with Z, > AZ, ..., Z) > AZ, 
p— py that 


O 
Dre Pui rZiy s+ AZ Zit +++ Zi Ra, ++ Re] 


I 
me 22/3 A ays 25 XZ; 
z [e =) (80) Imp”! ye < |x — R,| 


Z () 
22 j 2 [43 rn | 2 
e€ S- | =RA fax: =i) py P(x) 
j=l41 
1 
ae fax Ry) 


LZ, Ue 
ise 20>) > RR] + 2% DS me a 
(A-14.1.22) 


On account of (A-14.1.7), the expression within the square brackets of the 
x-integral in the first term on the right-hand side of (A-14.1.22) is zero. An 
expression similar to the one in (A-14.1.22) for 
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0 
Or F[p2;AZ1,...,AZi,Ri,..., Ri] 
may be also readily derived. Hence from (A-14.1.18) 


l 


0 = [p1(x [pr(x) = p2(x)] 
we 2 ea 2 fe ee 


i=1 jal+1 
1 
= x Z{Qi(Ri) — Q2(Ri)] > 0 (A-14.1.23) 
7a 


where we have used (A-14.1.10) and (A-14.1.22), thus establishing (A- 
14.1.20). Here we note that the summation over j in the first term in (A- 
14.1.10) is from (1+ 1) to k, while the one on the extreme right-hand side of 
(A-14.1.23) is over i from 1 to I, and there are no ambiguities in the expression 
n (A-14.1.23). 

Accordingly, from (A-14.1.18), (A-14.1.20) we obtain 


F[pi;Z1,.--,Z,Ri,..-,Re| > Flpo;Z1,...,Z1,Ri,...,Ri] 


+ F[p3; Zi4i,---, Ze, Riz1,.--,Re] 
(A-14.1.24) 


for any 1 < 1 < k, where p1, 2, p3 are the densities which provide the 
smallest values for the corresponding functionals, respectively. 

Since |, k (with | < k) are arbitrary natural numbers, (A-14.1.24) implies 
that 


k 
Fl po; Zijes 03 Dey Riga Re] S S> F [ees Zi, Ri] (A-14.1.25) 


i=1 


where each F’ [P's Zi, Ri| is the TF functional (13.1.7), evaluated with the 
TF density p',, with nuclear charge Z;\e|, situated at R;, and the mass m of 
each negatively charged particle simply scaled by 7. That is, 


h? 2/3 Ze? 1 
3q2 2/3 i = i ee je / i nN 
(377) Imp (pre (x)) x—R,| € x r= xy PTR ) 
(A-14.1.26) 
Upon replacing x by x + R, and setting 
pre +Ri)=nrr(x)| (A-14.1.27) 
ZZ; 


where npr(x) is the TF density (13.1.7) of §13.1, we obtain from (A-14.1.25) 
and (13.1.42)—(13.1.45), (A-14.1.4), 
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F[p; Z1,--.,Ze,Ra,...,Ra] > GErr(1 ee (A-14.1.28) 


for arbitrary positive p(x), where Epp(1) = —1.5375 (me*/2h?) correspond- 
ing to particles of masses m. 

The basic inequality in (A-14.1.24), shows that a system identified by 
the parameters [Z1, ...,Zk, Ri,--- ,R;| cannot have an (optimized) energy 
functional (A-14.1.1) less than the sum of the (optimized) energy functional 
of any two subsystems identified by parameters [Z1, ...,41,Ry,... ,Ri], 
[Zi+1; ..-, 2k, Ryyi,--- Ry], |< k. Because of this last property, the Theo- 
rem embodied in the inequalities (A-14.1.24), (A-14.1.25) is referred to as a 
“No Binding Theorem”.!° 


14.2 Lower and Upper Bounds for the Ground-State 
Energy and the Stability of Matter 


14.2.1 A Lower Bound 


For anti-symmetric normalized functions U(x,01,...,xnon) of N elec- 
trons, we have for the expectation value of the Hamiltonian H in (14.1) 


N p? N 1 
wim) = 2a) - Dae ecw) 
i=1 i=1 j=1 2 J 
N k 
1 AVA 
iy. (v —— uv) a ee 14.2.1 
» [xi — x5 » |R; — R,| ( ) 


t<j 


To derive a lower bound to this expectation value, we recall the definition 
of electron density 


p(x) = N SS [ax ... xy |U(xo1,x202,...,Xwon)|? (14.2.2) 
O15--5O0N 
normalized to 
[ex p(x) =N (14.2.3) 


which we will use in (A-14.1.28), and the lower bound (4.6.24) to the expec- 
tation value of the kinetic energy for s = 1/2: 


'° This important result was discovered by Teller (1962) and was established rigor- 
ously by Lieb and Simon (1973); Lieb (1976) where the existence of a consistent 
positive solution of (A-14.1.2) for the density is also studied. 
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N 2 2/3 49 
Pp; 3 30 h Je 5/3 
> 3 wlts 
S (w|Pilw) > 2 (=) x, [exe (x) (14.2.4) 


i=l 


For the second term on the right-hand side of (14.2.1), we have the explicit 
equality 


Sze (alls 


i=l j=l xi 


¥)= 320 fas RoR (x). (14.2.5) 


We derive a lower bound to the third term in (14.2.1) by using, in the 
process, the lower bound in (14.1.3) for the Coulomb potential energy of 
repulsion of the electrons. To this end, we first note that 


$e Jona ( 


1 
v) =e" [etx dx! 000) Er) 


|x — x;| — x;| 
(14.2.6) 
and hence from (14.1.3) 
N 
1 2 1 
Sie (v Ix: — xj] uv) S > [ eax’) 0’) 
i<j ‘ 2 
~ (392)9/8§ _ _ fga.¢ 96/8 NEre(1 14.2.7 
( Tv ) 1072mB x p (x) + 6 TE ( ye ( afte ) 


From (14.2.4)—(14.2.7), we then obtain the following lower bound for 
(14.2.1) 


h? p(x) 
(U|H|) > (an2\5/8 Te fat p!3(x) 4, e * fax 


e A ZZ, 
— | d’xd?x’ wd NEpp(1 
+ =f x x! poe) a eri vr (1) 
(14.2.8) 
where we have set 
3 (3n\73 — (3n?)5/3 1 
5\ 4 Bre 6B f1\7% 1 1 moe 
(3n2)5/3 Vag B > Br ie 
52 


For a positive 3’ we must choose 3 > (47)?/3. 
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The sum of the first four terms on the right-hand side of the inequality in 
(14.2.8) coincide with the expression on the left-hand side of the inequality 
in (14.1.1) with ( in the latter replaced by (’. Hence 


k 
(U|H|W) > B'Epp(l) S> 2]? + BN Epp (1) (14.2.10) 
i=1 
or 
ar 
(W|H|W) > Brp(1) | BN + —= (14.2.11) 
hy Fi 
G3 
Optimizing over 3, we obtain 
1/2 
B= (4n)?/ |14 aa (14.2.12) 
giving finally the Lieb-Thirring bound?! 
1/27? 
Ve 
2/3 a 
(W|H|W) > Epp(1)(40)?3N + (30 | (14.2.13) 
where ‘i 
me 
Eyp(1) = —1. ea 14.2.14 
se(l) = -15375 (| (14.214) 


If Z corresponds to the nucleus with the maximum charge, in units of |e], 
then 


k k 
SoZ eo Bena (14.2.15) 
i=1 i=l 
giving for the ground-state energy Ey the lower bound 
me? 2 
Ey > —8.3104 (Fe) N [1 i Z8) (14.2.16) 


where we have used the fact that W is arbitrary and hence (14.2.13) is true 
for the ground-state as well. The numerical coefficient 8.3104 may be further 
reduced but we will not attempt to do so here. 


4 Lieb and Thirring (1975). 
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14.2.2 Upper Bounds 


A quick and rather conservative upper bound for Ey may be derived by 
considering the following trial determinantal function 


ion. anew = aie eee (14.2.17) 
(j,k =1,...,N), where 
o;(x,0) = ¢ (x = L) xi(0) (14.2.18) 


with normalized spin functions x;(o), which for simplicity make be taken to 
be all the same, and 


ox) =] (+ cos (=) , ail <L (14.2.19) 


i = 1,2,3, and is zero otherwise, x = (x1,22,23). We choose the vectors 
LS, ee Le as follows 


LO SJDG Ue: 2h (14.2.20) 


and we may choose 
4L < D. (14.2.21) 


It is easy to see that the intervals: {jD-—L < a; < jD+1}, for j = 
1,...,N, are disjoint, for each ¢ = 1,2,3, and the functions ¢(x — L)) are 
then non-overlapping, and orthogonal with respect to each of the components 
x, of x. 

We choose 

R;=L®, j=1,...,k. (14.2.22) 


The above construction consists of conveniently placing the k nuclei at 
Lh. ee Pe. and one electron in each one of the k boxes with centers at 
LY, L“). One electron is also placed in each of the remaining (N — k) 
ficient boxes with centers at Tey. ae Le ). The Coulomb potential 
being of long range, interactions occur between particles in the different boxes 
as well. 

Due to the localizations of the functions ¢;(x, a), as described above, the 
electrons are well separated, and we may write 


|x; -—x,| 2 D/V2, iFj (14.2.23) 


and bound the repulsive e—e interaction term as 


N 2 N 
DS ees < v2 25%), (14.2.24) 
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From (14.2.22), (14.2.20), |R; — R;| > D for i # j, and we have the 
inequality 


k 
ZL; e? 
LZiZj- 14.2.2 
XR, R,_ Ry“ pe (14.2.25) 
Finally, we use the conservative bound 
N ok kok 
Zje Zje? 
= < = 
2 mR SPR 
k 
Ze? 
<- 
dR 
k 
Ze? 
=-yo (14.2.26) 
a lite L®| 
to obtain 
k 3 
us a d x 2 
W|A|v) - Zie ——— (x 
cra < (afb Sele) Dae f Same 
N k 
V2S (+>) 2:25] . (14.2.27) 


i<j i<j 


The kinetic energy part is explicitly given by 


(oly: )- nee (S) (14.2.28) 


d?x Or eee A. 
| aw t- eae aay) (14.2.29) 


since |x| < /3L in the latter integral. 
All told, we obtain 


Ls 


and 


3hen? N e? e? = : 
(UES an 5 V2S°(1)+ 5° Z;Z;|. (14.2.3) 
V<j <j 


Optimization over L gives 


L= 3V3 2 (4) (14.2.31) 
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and leads to the bound 


4 me* e? eet E 
(U|HIY) <-55 (Fa ) Nt V2S°(1)+ 552,25). (14.2.32) 
<j i<j 


We may choose D large enough to make the second term as small 
as we please in comparison to the first one (e.g., make it equal to 
0.00031(me?/2h?)N) to obtain 


me? 
(UW |H|V) < —0.0450 (Sa) N. (14.2.33) 

Since ® does not necessarily coincide with the ground-state wavefunction, 
and the configuration positions of the nuclei does not necessarily correspond 
to the lowest possible energy, (14.2.33) leads to an upper bound for Ey: 


me? 
< —0. == : Lis 
En < 0.0450 (SE) (14.2.34) 


with the upper bound having the same power of N as the lower bound. 

As an estimation, the coefficient 0.0450 in (14.2.34) may be significantly 
increased. For example, and quite formally, however, we may consider the 
following infinitely separated N clusters: k ions (atoms), each in the ground- 
state, of nuclear charges Z;|e|,..., Z,|e| having each one electron, and (N—k) 
free electrons with vanishingly small kinetic energies. Formally, the ground- 

k 
state of such a system is — )> Z?me*/2h?, and since Z? > Z;, we obtain 
i=1 
met 
ves (3) N. (14.2.35) 
thus increasing the above coefficient to one. 

Note that the lower bound in (14.2.13), and the upper bound 
— yo", Z?me*/2h? given above (14.2.35) imply the following interesting con- 
clusion. Suppose for some q,2 <q <N, 2, =...= 2 9,Zq41 =-.- = Ze = 9, 
then the ground-state energy will grow not slower than (—1)N?. Stability 
then implies that as more and more matter is put together, thus increasing 
the number N of electrons, the number & of nuclei in such matter, as sepa- 
rate clusters, would necessarily increase and not arbitrarily fuse together and 
their individual charges remain bounded. That is, technically, as N — oo, 
then stability implies that k — oo as well, and no nuclei may be found in 
matter that would carry arbitrary large portions of the total positive charge 
available. With such bounded positive charges, (14.2.16), (14.2.35) provide 
bounds linear in N for large N. [Of course Z is bounded in nature.| 
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14.3 Investigation of the High-Density Limit for Matter 
and Its Stability 


This section addresses the important physical problem of investigating 
rigorously the high-density limit of matter and of its spatial extension, as 
a function of N, as the amount of matter is increased in the light of our 
analysis in §3.1 (see (3.1.27)—(3.1.37)). In reference to this analysis, we recall 
the words of Paul Ehrenfest to Wolfgang Pauli, quoted in the introduction 
to this chapter, regarding as to why matter occupies such a large volume of 
space. Our strategy of attack is the following. For a non-vanishing probability 
of having the electrons within a sphere of radius R, we prove rigorously that 
the radius of this spatial extension grows necessarily not slower than N!/3 
for large N. (Manoukian and Sirininlakul (2005)). An explicit quantitative 
statement will be also made regarding this inflation of matter. In passing, the 
result embodied in this investigation immediately provides also alower bound 
to the average spatial extension of matter as a function of N. 


14.3.1 Upper Bound of the Average Kinetic Energy of Electrons 


in Matter 


Let |W(m)) denote a strictly negative-energy state of matter, not necess- 
arily the ground-state, 


—en[m] < (Y(m)|H|¥(m)) <0 (14.3.1) 


where —ey[m] = Ey < 0 is the ground-state energy, and we have emphasized 
its dependence on the mass m of the electron. 

By definition of the ground-state energy, the state |W(m/2)) cannot lead 
for (U(m/2)|H|U(m/2)) a numerical value lower than —ey|[m]. That is, 


—en[m] < (U(m/2)|H|(m/2)) (14.3.2) 


where we note that the interaction part V of the Hamiltonian H in (14.1) is 
not explicitly dependent on m: 


N ok N k 
Ze? e? Z,Zie" 
V=- i seed 14.3.3 
Seen Seoer mee, 
i=1 j=1 J i<j J i<j J 


Accordingly (14.3.2) implies that 


N92 
—éen|[2m] < (v6 | (>. Pe + v) 


Upon writing, trivially, 


v(m) (14.3.4) 


14.3 Investigation of the High-Density Limit for Matter and Its Stability 781 


N 2 
‘i 


Pi “\p “\ p? 
a t= > at (>: rare v) (14.3.5) 


the extreme right-hand of the inequality (14.3.1) then leads to 


N N _9 
3 vio) <= (won|(S PE sv) 


(v6 


which upon multiplying by two, (14.3.4) gives 


(a6 None 


a p 
> 3m 


i=l 


Pp; 
4m 


v(m) (14.3.6) 


v(m) < 2€y [2m] (14.3.7) 


for all states |W(m)) such that (14.3.1) is true including the ground-state. 

Thus from (14.3.7), (14.2.15), (14.2.4), we have the following bounds for 
the expectation value T of the total kinetic energy of all the electrons in such 
states 


3 (3n\7/? FP met 2 
= (2) = fax p(x) <7 < 16.6208 (Sw [14 27) 
2 (=) | x p?!?(x) < 16.6208 72 + 
(14.3.8) 
Now we are ready for the main investigation of this section. 


14.3.2 Inflation of Matter 


Let x denote the position of an electron relative, for example, to the center 
of mass of the nuclei. We define the set function 


_ J 1, if x lies within a sphere of radius R 
xR(x) = ‘i otherwise. ae) 
We are interested in the expression 
N 
2 i (11 Bx; a) [W(x101,....xvon)? 
O14-+50N i=1 
= Prob [|x:| < R,...,|xw| < RB] (14.3.10) 


which gives the probability of finding all the electrons within the sphere of 
radius R. 
Clearly, 


Prob [|xi| < R,...,|xw| < R] < Prob [|xi] < R,...,|x;| < R| 


1 
<...< Prob [|xi| < R] = W [ex XR(x) p(x) (14.3.11) 
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for 7 < N, with p(x) given in (14.2.2). 
By Holder’s inequality in Appendix II, 


[28x xnls060 < ( as 0) ( fax xno) (14.3.12) 


where x?! (x) = vr(x), 


[ox XR(x) = UR (14.3.13) 


denotes the volume in which the electrons are confined. 
Hence, in particular, (14.3.11) gives 


Prob [|x1| < R, ee) xn < R] -< Prob [|x| < R] 
2/5 3/5 
< = ( fox 6°! (x) (14.3.14) 


and (14.3.8) finally leads to the simple bound 


N 2/5 1 2/5 6/5 
Prob [|x1| < R,...,|/xw| < R] (=) < (=) 10 [i+ 27/9) 


UR ao 
(14.3.15) 
where ag = h?/me? is the Bohr radius and Z is the maximum of the nuclear 
charges. 

We immediately infer from (14.3.15) the inescapable fact that necessarily, 
for a non-vanishing probability of having the electrons within a sphere of 
radius R, the corresponding volume vr grows not any slower than the first 
power of N for N — o, since otherwise the left-hand side of (14.3.15) would 
go to infinity in this limit and would be in contradiction with the finite upper 
bound in (14.3.15). That is, necessarily, the radius R of spatial extension of 
matter grows not any slower than N'/3 for N — oo. (Manoukian and Sirin- 
inlakul (2005)). No wonder why matter occupies so large a volume! 

In turn, one may infer from (14.3.15), that the infinite density limit 
N/vpR > ©, ie., of the system collapsing onto itself, does not occur, as 
the probability on the left-hand side of (14.3.15) goes to zero in such a limit 
upon multiplying the latter equation first by (vr /N es >, 

From (14.3.14), (14.3.8), we may also write 


1 HW TVA 6/5 
¥ d°x xn (x) p(x) = Prob [[x| < R] < (=) (=) 10 [1 + 2/3] 


N ag 
(14.3.16) 
This immediately leads a lower bound to the expectation value 


 bx| |x: 
(s x ) = S- J aixr dy os i! SiGe. in NOK) 


i=l 
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= y d?x |x| p(x) (14.3.17) 


for a measure of the extension of matter. Using the facts that 


1 3 3 + | ax of 
x [ax blood > =f ax boo > ff ax pt 


|x|>R |x|>R 
= R Prob ||x| > R] (14.3.18) 
Prob [|x| > R] = 1 — Prob [|x| < R] (14.3.19) 


UR = 47 R3/3, and (14.3.16) we obtain 


N 2/5 2/5 
[x Re 4 2/3 oe 
sad ee en 2) F462 . (14.3.2 
o DR Na : of1+ 27/9) (14.3.20) 


Upon optimizing the right-hand side of the above inequality over R, this 


gives 
1\3/6 73 \ 1/8 1 
R=(=— — N1/3_______ 14.3.21 
(z) (=) "pea te 
leading for (14.3.20) the explicit bound 
N 
x3] Nv 
0.02575 ag ———. 14.3.22 
(s iS ie “OT + 2/8) ( ) 


14.4 The Collapse of “Bosonic Matter” 


We derive lower and upper bounds for the ground-state energy ER for 
“bosonic matter” consisting of N negatively charged (spin 0) bosons and N 
positively charged motionless bosons of charges —|e|, +|e| respectively, with 
Coulombic interactions. That is, we consider the Hamiltonian 


p? 
Pp; 


aren (x1,---,Xv;Ri,...,Rw) (14.4.1) 


where 
V(x1,...,Xn;Ri,...,Ry) 


ig a eal le eae Mere 


V<J i=1 j=l 
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the masses m1,...,my of the negatively charged particles are not necessar- 
ily taken to be equal. Here the x,;, R; respectively, refer to the negatively, 
positively charged particles. 

N 


Since )> p?/2m, is a positive operator, we may consider the bounds 
i=1 


te te < iG 14.4.3 
y 2m i=l 2m; y 2m 


where m = max(m,;), m = min(m,), and it is sufficient to consider, respec- 
uv a 


tively, the Hamiltonians 


N2 
Hi) Pt + V(x1,...,x0)Ra,...,Rw) (< H) (14.4.4) 
i=l 2m 
and 
Np? 
Hus), Piet V(x1,....%0)Ra,...,Rw) (2 H) (14.4.5) 
i=1 2m 


in determining the lower and upper bounds for the ground-state energy EX. 
The analyses will give rise to an N®°/? law!? for such systems. 


14.4.1 A Lower Bound 


We first reconsider the TF theory in §13.1 when the electrons are replaced 
by (spin 0) bosons. 
In this case, the density in (13.1.1) becomes replaced by (see also (9.8.5)) 


—~2mV(r) \ 3”? 
np(x) = a (=) (14.4.6) 


with € = 0. The TF energy functional (13.1.6) becomes then simply replaced 
by 


FP [np] = (3n2)9/9 (=) [ax 200) - 2e? [ £% nao 


1072 |x| 


e? 1 


That is, in the TF theory for bosons we simply have to replace m by 
m/2?/ in the TF theory for electrons. Hence, in particular, EB,.(Z) for boson, 
we have from (13.1.43): 


2 Dyson and Lenard (1967); Lenard and Dyson (1968); Lieb (1976, 1979); 
Manoukian and Muthaporn (2003b). 
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~ x Err(Z). (14.4.8) 


In the TF-like energy functional in (14.1.1), likewise, we replace m by 
m/2?/8, 

For the expectation value T of the total kinetic energy for bosons, we may 
use, for example, the bound in (4.6.26) 


Sf Bae \ Ae Pox 
as 2 < /3 < 
; (5 ) oa Jc x p/3(x) <T (14.4.9) 
where 
x) = N [ ax: ... Pxy |W (x, x2,...,Xy)]?. (14.4.10) 


All told, we then have from (14.4.10), (14.4.8), for the basic inequality 
corresponding to the present case, 


3 (30 \7/3 Fe 
(wim) > 2 (sr) a fx 0°69 


h? ots (x) 
_ 2\5/3 3,. 5/3 = 2 [43 p 
(37°) i0n2a & ) [a x p?/?(x) ) Ze fox Sa 


j=l 
e i * Z,Z;e2 BN 
==: fd d? x’ = a(x at Epr(1 
+5 x © A) i) + Rt IE TR(1) 
(14.4.11) 


where now we have to make the replacements Z; — 1, k — N, but we will 
do that later for greater generality. 
Upon setting 


3 (3n\73 — (3n2)5/3 92/8 
5 \QN 5B 
2)5/3 
(377) / 92/3 
= 
we obtain instead of (14.2.10), (14.2.11) in the present case 


1 
= 14.4.12 
B" ( ) 


(U | Hy |v) > f Bre(1 So al + Be Bret) (14.4.13) 
or 
ae 
(U|AL|v) > sya Bre) BN 4 =! . (14.4.14) 
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Optimization over (3, this gives 


fe 1/2 
» 2" 
B= (4nNY?/3 J 1+ aa (14.4.15) 
leading to the bound 
k 7/3 1/2 
U|H|W) > Epp(1)(20)?2/8 NP J 14 Z 14.4.16 
N 
i=l 
or from (13.1.43), (13.1.45) 
7 1/2] 2 
4 2" 
me he 
W| AL |W) > —5.2352 ( —— ) n*/3 | 1 ———— 14.4.1 
(0 fn, |) > ~5.2352 (TE) ws | | Sh (144.17) 


[The numerical factor 5.2352 may be decreased! further but we will not 
attempt to do so here.] 
Accordingly, for the bosonic system at hand we have (Z; — 1, k > N) 


4 
ER = —20.941 (Fe) ners. (14.4.18) 


14.4.2 An Upper Bound 


We consider arbitrary N > 8. Since (N/8)1/° is some real number, it may 
be written as 


(N/8)13 =n+e (14.4.19) 


where n is a strictly positive integer and 0 < « < 1. Let 


8n? = k (14.4.20) 
then we have the useful bounds 
ate (1+ =) <(V-»)< ze (1+ =) (14.4.21) 
nr nm n nr 
1< (i+ =) 29. (14.4.22) 


13 See Manoukian and Sirininlakul (2004). 
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We introduce the trial wavefunction U(x,,...,xy) by considering the 
following construction. We put!* « of the negatively charged particles and 
k of the positively charged particles in a box of sides 2L, 2L, 2L centered 
at the origin of the coordinate system. The remaining particles are placed 
as follows. We consider (NV — «) non-overlapping boxes, each of sides 2D, 
2D, 2L9 which also do not overlap with the box at the origin. The centers 
of these (N — &) boxes are defined by the tip of the vectors te, 
(see Figure 14.1), where 


LY = jD(1,1,1), j=1,...,N—k. (14.4.23) 


To ensure that all the (N—«)+1 boxes are non-overlapping, it is sufficient, 
but not necessary, to choose 


6L< b1g< D: (14.4.24) 


Fig. 14.1. The figure shows the regions (non-overlapping boxes) where parti- 
cles are localized. The centers of the boxes are situated at 0, L™,...,L~") 
with the latter (N —«) vectors being along the vector (1, 1, 1). The sides of the 
box at the origin are equal to 2L,2L,2L, while the ones of the other (N — &) 
boxes are 219,220, 2L0, where Lo > L. The Coulomb potential being of long 
range, there are non-trivial interactions between particles in different boxes 
as well. 


The numerical factor 6 is chosen for convenience to simplify the algebra. 


' The following analysis is based on: Manoukian and Muthaporn (2003b) deriving 
the upper bound for all N > 8 and is an extension of the classic work of Lieb 
(1979) for N restricted to N = 8,64, 216,.... 
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Now we put (N — «) of the positively charged particles at the centers of 
the (N — «) boxes introduced above by choosing 


Raa Lc Ryo il! (14.4.25) 


and we put one negatively charged particle in each of the (N — &) boxes. 
R,,...,R, are chosen to lie in the first box, with the latter centered at the 
origin, as will be discussed later. 

To localize the negatively charged particles in the (N — «) + 1 boxes as 
discussed above we introduce the following normalized trial wavefunction: 


a 3 obe(a1))--- 61%) 
x W1(X(M:,41)) --- Pw—n (X(t) (14.4.26) 
where the sum is over all permutations {71,...,7y} of {1,...,N} such that 


[2x ¥200¥j00 = 6. fox 6" Goi) =0, [0° JOGO? = 
(14.4.27) 


Since W does not necessarily coincide with the ground-state wavefunction, 
we have for the ground-state energy EX 


Ex < (¥|Hu|¥). (14.4.28) 


We choose the following localized single-particle trial wavefunctions con- 
sistent with the above construction by placing the negatively charged particles 
in the (N — «) + 1 boxes: 


(x) = II (Jes (5z)) = x(x), lail < L (14.4.29) 


i =1,2,3 and is zero otherwise, and for 7 = 1,...,N —k, 
x) 
1 ee Ui) G) 
uj(x) = [] preg Lo = o1,(x LY V5 |x; —L; |< Lo 


(14.4.30) 
and are zero otherwise, 7 = 1, 2,3. 

Since the intervals {-L < 2; < L}, {jD-—LIo < a < JD + Lo} for 
j=l,...,N—k, (¢=1,2,3) are all disjoint, the wavefunctions ¢(x), ~;(x) 
are non-overlapping and automatically satisfy (14.4.27), with orthogonality 
relations holding with respect to each component x; of x. 

The single-particle average kinetic energies are given by 


=P f exivooe) 2 = i). (14.4.31) 


14.4 The Collapse of “Bosonic Matter” 789 


he : Pee cay Cae cies 
Ten aa x|Vu;(x)/? = Oi (=) =T (14.4.32) 


and for the multi-particle state 


N 
h2 
Se ae f ex ... dbxy IV j;U(x1,...xn)|? = [KT? + (N — aaa 


ie 

(14.4.33) 
as is easily checked since the functions in each of the products in (14.4.26) 
are orthogonal with respect to each of the components of x. 


A detailed calculation gives 


N 2 
e 
(U|Hy|W) = [xT° + (N — «)T] + (Vi) + (Vo) 4 pay By (14.4.34) 
i<g sd 
where 
N ie 1 
(Vi) = -e? Joo ——__ 2 (x) + | ——___— 
y Ix — Ry” kk +L -R,| 
1 
oo fale ae 7, (X) (14.4.35) 
e+ -R,| 
(V2) = ce = 1) [ox d?x’ $3 ge (x’) 
ss 2 Hi |x — x’| ea 
N-k 1 
+2n fax atx d(x) pi dio®) 
jal [x - x’ — LY | 
N-k 1 
462 je d2x! 62 2 (x’). (14.4.36 
° pe ac oN) eee ( ) 


By noting the overall negative sign of (Vi), we may bound the latter as 


(i) < “2S foxy ay e?(N — &) [expt (14.4.37) 


Ix| 


where we have conveniently chosen an upper bound with the summation 
going up to «, and in writing the second term, we have chosen only the terms 
with 7 =«+1,...,j = N, respectively, for the (N — «) terms multiplying 
7, and used, in the process, (14.4.25). 
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Since |x| < V3Lo in the last integral in (14.4.37), we obtain 


(Vi) < “2S fexy Ry i a (14.4.38) 


To derives an upper bound for (V2), we note that 
| > V3D, iL us L®| > V3D (14.4.39) 
for i # j, and because of the vanishing properties of $7(x), ¢7,(x) outside 


the corresponding intervals in questions, we obtain in reference to the second 
and third set of integrals 


1/2 
4/3L 
x—x!— L®| > | _ AVL (14.4.40) 
ra 
where we have used (14.4.24), and similarly 
[x — x + 2 - 1] > D. (14.4.41) 


Accordingly, the second and third set of integrals on the right-hand of 
(14.4.36), combined, may be bounded above by 


(14.4.42) 


xc k) + NaN K- ») 


thus obtaining 


e? 1 e?(N —K)(N+«-1) 
ig f 3 Dal ghd 2 / . 
(Va) < Fale 1) fax ax’ $2 (0) ob!) + - 
(14.4.43) 
Finally we use the bounds 
IR; -R;|>D (14.4.44) 


for 7 = K+1,...,N and all i such that 1 < i < 7. This follows since for 
i=n«+1,...,N (such that i < 7) we may use the equalities in (14.4.25), 
while for i=1,...,%, |R;| < V3L and (14.4.44) follows. The decomposition 


N j-l1 
1 

ee —R,| 25 R > ik, -R)| aH (14.4.45) 

oe ao] : zs j=kK+1 t=1 

then leads to 
2(N +«—1) e2 

WU |Hu|¥) < xT°+(Hi)+(N— ris 14.4.46 
(UW |Hy|W) < 6T°+(A1)+(N-k) : aE | | 
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+ Sh(n —1) [ox d?x’ 62, (x) —— 62 (x’) +e? S- 


|x — x’| — 
VJ 


1 
IR; —R,| 
(14.4.47) 


In the appendix to this section, the following upper bound for (H;) is 
obtained 
(H,) < antl (14.4.48) 
by appropriately fixing the positions of the « positive charges. Again this 
latter configuration does not necessarily correspond to the lowest possible 
energy, and hence we obtain an upper bound to E% given by 


3h? 1 e? 4/3 


pee eee 
NS Qn 2" 6L" 


(14.4.49) 


+(N ofS ON +K=1) a9 


Bm 12 aly J3Lo 
where we have set D = xLo, with x > 6, which will be conveniently and 


consistently chosen. 
Optimization over L and Lo, gives 


2A? 4 272 1 
= ia og Lo ne Weed (14.4.50) 
me? K, me 2 2/3 kK 
x 
with 
Nike 
0< 2 Rye ; < vB t/a (14.4.51) 
Wi 


and with «!/3 > 2, we may choose « = 2\/3(N + « — 1), which is obviously 
larger than 6, giving 
_ 3/307 h? 


Lo = (14.4.52) 


me? 
The last term on the right-hand side of (14.4.49), involving the (N —x) factor, 
then leads to a strict negative contribution proportional to N, that is with a 
power of N less than that of the sum of the first two terms with LD as given in 
(14.4.50). Hence we may further bound the right-hand side of (14.4.49) from 
above by the sum of the first two terms only. 

Accordingly, we obtain the strict upper bound 
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me* N®/3 1 


E8 
NS ~ ORF O7n2 (1 +e/n) 


(14.4.53) 
for all N > 8, where we have used the fact that (see (14.4.19), (14.4.20)) 
6,78 3 
K=N[1+=] = 8n’. (14.4.54) 
n 


From (14.4.22), a conservative bound is [1 + ¢/n]° < 2° for this factor in 
(14.4.53). For the cases where ¢ = 0, 


1 me? 
B — 5/3 
Ey (35 ) Nee: (14.4.55) 


More generally, for all 0 < e < 1, and for large bosonic systems, e.g., with 
n2A4,ie., with N > 512, 


1 me* 
B pane 5/3 
EN <-335 Gs ) N°! (14.4.56) 


and more interestingly for larger systems, e.g., with n > 50, ie., for N > 10°, 


1 me* 
Bee ao (35) NP. (14.4.57) 


A larger numerical coefficient than 1/ (307?) of (me*/2h?) N°/3, may be 
obtained but this will not be attempted here. 

We note the presence of the same power N°/® for the lower (14.4.18) and 
upper bounds (14.4.53)—(14.4.57) for EB. 

We have thus established the main objective of this section by showing the 
power law behavior N°/? for bosons, implying the collapse of such systems. 
It is interesting to point out that the collapse is not a characteristic of the 
dimensionality of (Euclidean) space and persists in arbitrary dimensions!° 
(see also Problem 14.7) with 1/r interactions. The N°/? law for bosons just 
established corresponds to fixed positively charged particles (i.e., infinitely 
massive motionless point particles). As mentioned in the introductory section 
to this chapter this is non-academic. The problem with the positively charged 
particles treated dynamically with finite masses as well restricted, however, 
to the Coulombic interaction, gives rise to a N7/° power law implies again 
the instability of such systems and we refer the reader to the literature!® for 
the derivation of this law. 


'® Muthaporn and Manoukian (2004a). 
6 Dyson (1967); Manoukian and Muthaporn (2002); Conlon et al. (1988). For the 
corresponding law in other dimensions see, Manoukian and Muthaporn (2003a). 
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Appendix to §14.4: Upper Bounds for (H;) in (14.4.47) 


Consider the following integrals written in terms of dimensionless vari- 
ables 


K d3 
BiGeXice hay = © fog’ (A-14.4.1) 
j=l d 
KP fas agp 92 1 27. 
In(k) = = fa x d’x’ f (x); af) (A-14.4.2) 
x — x’ | 
: 1 
I3(K; X1,..., Xe) = A-14.4.3 
s(6 Xie) = oe reree) 


where f(x) is a real function defined by 


f(x) = g(#1)9(#2)9(#3) (A-14.4.4) 


g(a;) 4 0 for —1 < a; < 1, and g(a;) = 0 otherwise, g(x) = g(—2), 


i g(x) da = 1. (A-14.4.5) 


-1 


By choosing, in particular, 


g(x) = cos S (A-14.4.6) 
setting X; = R;/L, and noting that «(« — 1) < «?, for the coefficient of the 
second term in (14.4.47), we conclude that 


I(«;X1,..., Xx) (A-14.4.7) 
where 


T(«;X1,...,X,) = Ly («; Xy,...,X,) + Lo(K) + Ig(w; K1,..., KX, ). 
(A-14.4.8) 
Therefore, to derive an upper bound for (H;) we may work directly with the 
expressions in I(K;X4,..., Xx). 

We partition!” the unit interval [0,1] into n subintervals: 0 = ap < a1 < 
...@n = 1 such that ‘ 

* de @(0) = > 

2n 


aj-1 


(A-14.4.9) 


for 7 = 1,...,«. By doing so, we divide the box of sides 2,2,2 into (2n)? = 
8n? = « smaller boxes. 


1” We follow the construction of Lieb (1979). 
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Let a; = a; — aj;—1, then we note that 
pee (A-14.4.10) 
j=l 


A box of sides a;, aj, a; may be denoted by Bj;;. We label the smaller boxes 
thus generated arbitrarily by Bi,...,B,, and note from (A-14.4.4), (14.4.5) 
that 


1 1 
| eo dad 3 (A-14.4.11) 
B; 8n3 ok 
for? =1,...,k. 

The ineotale in (A-14.4.1), (A-14.4.2) may be rewritten as sums of inte- 


grals over such boxes as follows: 


I(«;X,...,X«) = cine x Roxy (x) (A-14.4.12) 
3 3 i 1 2(./ 
ems d?x ie d°x eof). 

(A-14.4.13) 


Now we place X, in box By, X2 in box Bo,...,X, in box B, and then 
average the expression of I(K;X1,...,X,), over X1,...,X, by multiplying 
it by the normalized density 


P(%) sas =I (Kf?(X (A-14.4.14) 
= PX f(x) re 7 
and integrating over Xj,...,X,, respectively, over the boxes B,,..., By, to 


obtain for this average the expression 


(I(m;X1,...,X eee fh dbx a Px! f? (x — xf re) 


i=1 j=1 
on d?x [ dx! f a) 
ef d?x im d?x’ f? (x le af f(x’) 


= K “ / 1 / 
=a. ax d?x POR al): 


(A-14.4.15) 
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By the definition of an average, there must exist at least one set of 
{X,,...,X,,} with X, in box B,,...,X, in box B, such that 


I(«;Xy,...,Xn) < («3 Xa,...,X%,)) (A-14.4.16) 
That is, with such a set {X4,...,Xx}, 


ae 1 
I(«K;Xq,...,Xe)<-— a3 d?x! f? 2(x"), 
(GX Xe) SF Df ae fae Poy LO) 


(A-14.4.17) 
A box Biz, of sides aj, aj, a; may be inserted in a sphere of radius 


5/07 + a4 + a7 and from the normalization condition (A-14.4.11) we have 
(see Problem 14.6) 


3 3y/ af 2 I 2 1 F 
dx [| dx! f2(x) P(x’) > 
Biju Biju re x’ | ,/ 02 + a4 +a? K 
(A-14.4.18) 
Accordingly, we may bound I(«;X ,...,X,) as follows 


n 
a a2 = 
i,j,l=1 14/a? + 0% +a? 


where, by symmetry, the factor 8 takes into account all of the boxes, since 
the summation over i, 7,1, accounts only for n? boxes corresponding to 0 < 
vm <1,0<%<1,0< 273 <1. 

By noting that for a; > 0, 


Te i X (A-14.4.19) 


kK 
1 
S° Va— =K (A-14.4.20) 
i=l VG 


an elementary application of the Cauchy-Schwarz inequality then yields 


K K i 
(>: «) (>: 1) > K? (A-14,4.21) 


i=1 t=1 
or oe 

1 K? 

(>: | >. (A-14.4.22) 
ay 

a8)? TE.) 
1=1 
Applying this elementary inequality to (A-14.4.19) gives 
1 
T3662 X83) (A-14.4.23) 


~ Joy tait+az 
i,j,l=1 
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On the other hand, 


So fetatto?< SS (tay + ay) = 3n? (A-14.4.24) 


ij l=1 ijl=1 


where we have used (A-14.4.10), which from (A-14.4.23) leads to 


I(«;Xq,...,X%) < -sn* = ——— (A-14.4.25) 


since « = 8n3. The improvement of the bound in (A-14.4.25) is not ruled out. 
From (A-14.4.7), the inequality in (14.4.48) then follows. 


Problems 


14.1. Establish the equality in (14.2.6). 

14.2. Verify the details leading from (14.4.11) to (14.4.17). 

14.3. Derive the expressions for the average kinetic energies given in 
(14.4.31)-(14.4.33). 

14.4. Show that the expectation value (V|Hy|W) is as given in (14.4.34)— 
(14.4.36). 

14.5. Repeat the derivation of the upper bound for E% given after (14.4.19) 
by considering the following construction: Rewrite (14.4.19) (if e1 4 0) 


113 
N 

(=) =n + €1, 0<e, <1 

where the natural number n; > 1. Let ky = 8n}, and continuing in 
this manner, we have (if e2 4 0) 


(wa) 


) 113 
8 ) =na+é2, 0O<e2<1 


and so on, by defining in turn kp = 8n3,...,k» = 8n3, you will reach 
a natural number 6b such that 


ae 


= €b4+1; O < €p44 <i. 


For example, for N = 58410, b = 5. For 6 > 1, place k,,...,k», pairs 
of negatively and positively charged particles in b non-overlapping 
boxes, and the remaining 2[N — (ki +...+ky)] particles in other 
[N — (ki +...+ kp)] non-overlapping boxes with each containing one 
positive and one negative charge. By such a construction, can 
you increase the numerical factor multiplying —N°/? (me*/2h?) in 
(14.4.53)? 


14.6. 


14.7. 


14.8. 


14.9. 


14.10. 
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Let p(x) be a charge density vanishing outside, a region R correspond- 
ing to a total charge Q, i.e., 


[exo =@ 


R 


such that |x| < R for x in R for some R. Show that 


bis fa fix oon Pra ie 


Repeat the derivation of the upper bound for ER in $14.4 in ar- 
bitrary dimensions v [with 1/r potentials] to show that ER < 


he 
a positive constant cp(v) depending on v. [Ref: Muthaporn and 
Manoukian (2004a).| 
Follow the proof given in Appendix B to §13.1, which shows that the 
TF density satisfying (13.1.7) actually gives the smallest value for the 
functional (B-13.1.1), to prove simply that po(x;*) in (A-14.1.2) pro- 
vides the smallest value for the functional (A-14.1.1), ive., (A-14.1.4) 
holds true. 
Derive the equality in (A-14.1.22), and the corresponding expression 
for 


- (35 ) en(v)N (2+¥)/v and obtain an explicit expression for such 


Ar [p23 AZ}, tee ,AZI3 Ry, tee , Ri] 

to finally obtain the result given in (A-14.1.23). 

Derive the counterpart of the expression for the probability in (14.3.15) 
and the counterpart of the lower bound in (14.3.22) for “bosonic mat- 
ter”. Do these bounds provide useful information on the nature of such 
“matter” as they do for matter with the exclusion principle? Explain. 


15 


Quantum Scattering 


Significant progress has been made over the years in physics through 
scattering experiments. The discovery of the atomic nucleus, the visible tracks 
of particles observed in cloud chambers, the determination of the structure 
of matter, the emergence of the endless variety of particles by accelerators 
and the extraction of information on their interactions, are just a few of the 
examples concerned with the analyses of scattering processes. 

In the present chapter, dealing with the theoretical development of quan- 
tum scattering, we approach the basic problem of scattering as a time evo- 
lution process involving, in general, three stages: the preparatory stage oc- 
curring in the remote past to the interacting stage, followed finally by the 
detection stage in the distant future. As a time evolution process, asymptotic 
boundary conditions arise, dictated by the physical situation under study, 
linking the three stages. These aspects are investigated in §15.1, §15.2. Special 
emphasis is put in the latter section on the connection between the momen- 
tum a particle has acquired in a collision and its emergence spatially within a 
cone on its way to a detector. Differential cross sections are studied in $15.3. 
In terms of number of particles, we here recall the definition of a differential 
cross section as the number of particles scattered per unit time into a solid 
angle, about the scattering angle, divided by the flux, with the latter being 
the incident number of particles per unit area per unit time. In a scattering 
process, the detection of a scattered particle is carried out away from the for- 
ward direction to avoid or minimize interference effects between incident and 
scattering components of the interacting state. The analysis of the scattering 
process in the forward direction leads to the so-called optical theorem which 
together its physical interpretation is the subject matter of §15.4. In this sec- 
tion, a phase shift analysis is also carried out dealing with the expansion of 
scattering amplitudes in terms of angular momentum states. §15.5 provides 
a detailed treatment of Coulomb scattering which requires special attention 
due to the long range nature of the underlying interaction. In §15.6, §15.7, we 
will see how the elegant functional formulation in Chapter 11 may be used 
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in scattering theory, and an application is then carried out to scattering at 
small deflection angles at high energies. §15.1 through §15.7 deal with elastic 
scattering theory where the initial and final particles in the process are the 
same and their is no change in the internal energies of the particles involved 
so that the total kinetic energy of the system remains conserved. Inelastic 
processes and the underlying theory are treated in §15.8 which deal with 
cluster of particles as introduced in §2.5. Here we also elaborate on some 
subtleties of systems involving three particles. In the final section (§15.9), 
we consider the energy loss of a charged particle moving through a medium 
(hydrogen). In the same section, this is followed by a treatment of neutron in- 
terferometry dealing with the splitting and recombination of a neutron beam 
with an investigation of the interference effect, resulting upon recombination, 
in the Earth gravitational field. 


15.1 Interacting States and Asymptotic Boundary 
Conditions 


In a scattering process, one would initially prepare a particle in some 
state, say, |®in(¢)), let it eventually interact with another system, assumed 
widely separated in the beginning of the experiment, and then finally study 
the outcome of the process. In the preparatory stage, in the remote past, 
before a particle participates in the scattering process, its interaction with 
the other system, if it is of short range,' is negligible and the state |®;,(t)), 
for t + —oo, develops in time via the free Hamiltonian Hy = —h?V? /2m, 
where m is the mass of the particle. In time, as the particle approaches the 
other system in question and its interaction with the latter becomes non- 
negligible, the particle would be described by a state |q_(t)), satisfying the 
time-dependent Schrédinger equation 


(ins, - i) |b_ (t)) =0 (15.1.1) 
ot 

where H # Hp is the total Hamiltonian, and |®;,(t)) would provide the 
asymptotic boundary condition for |y_(t)) in the limit t > —oo. 

Similarly, a sufficiently long time after scattering, as a particle emerges 
from such a process, and if its interaction, assumed of short range, with 
the remaining system becomes negligible, one, in a statistical sense, may 
investigate its localizability in a detection region and enquire as well of the 
momentum it has acquired that has led it to such a region. In this detection 
stage, one may then assign a state, say, |®out(t)) for such a statistical study 


| By an interaction of short range, it is meant an interaction which vanishes faster 
than the Coulomb potential at large distances of separations of the particle in 
question from the rest of the system, such as (distance)? for distance — oo, or 
faster. 
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which, for t — +00, develops in time via a free Hamiltonian. Before the 
interaction becomes negligible, however, the particle would be described by 
some state |7)+(t)), satisfying the time-dependent Schrédinger equation 


(ny 2 i) Ib. (t)) =0 (15.1.2) 


with |®out(t)) taken as the asymptotic time limit t — +00 of |w+(t)). 

The purpose of this section is to study the nature of the fully inter- 
acting states |~+(t)) and their asymptotic boundary conditions: |%)+(t)) 
| Pout jin(t)) for t + too. The scattering of a particle off a Coulomb potential, 
as of a long range interaction, will be treated in detail in 815.5. 

Consider the retarted/advanced free Green functions G§ (zx, 2’), intro- 
duced in §9.1 (see (9.1.12), (9.1.35)), where we have used the convenient 
notation? (t,x) = 2, given by 


en ery Cm gta | 
Gi (a2) =+ nf (anh)! G ~ e Fi i) ; é— +0 (15.1.3) 
(2 —2')p = (x—x’)-p—(t-#)p® (15.1.4) 
(dp) = dp°d*®p (15.1.5) 


with boundary conditions (see (9.1.9), (9.1.33)) 


Go (z,2')=0 for t-t $0 (15.1.6) 


satisfying the differential equations 


(ing - Ho) Go (2, 2") = +ihd* (a — 2’) (15.1.7) 


where Hy = —h?V?/2m. 
We introduce the integral equation 


AREA ak * / (dx") G®. (a, 2”) V (a) Ag (2,0") (15.1.8) 


where V(x’) is a given potential which may be formally considered to be 
time-dependent, in general, and 


(da) = dt’ d?x”. (15.1.9) 
It is then easily verified from (15.1.7), (15.1.8), that 


? Needless to say, this is just a convenient notation having nothing to do with a 
relativistic notation. 
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ba (0) = / (de’) Ax (2,2) Bout/in (2°), (15.1.10) 


upon applying the operator [ih 0/0t — Ho] to the latter, that they satisfy the 

interacting Schrédinger equations (15.1.1), (15.1.2), where H = Hj + V, and 

use has been made of the facts that [ih 0/0t — Ho|®out/in(w) = 0. That is, 

the compact expressions in (15.1.10) provide solutions of the fully interacting 

systems. We will eventually consider only time-independent potentials. 
From (15.1.8), (15.1.10), we note that w(x) may be rewritten as® 


We (x) = Bout /in (x) x / (da’) (da) Ge. (x, a’) V (x"") 


x Ax (2, x’) Bout /in (x’) (15.1.11) 


We will see that the solutions q+ (a) formally satisfy the asymptotic bound- 
ary conditions* 74 (x) 4 ®out/in (7) for t + too. 
To the above end, we introduce the Fourier transforms 


V (2) ={[ eiPt/h VY (p) (15.1.12) 


1 / 
Ax (2", 2’) = | SP as (np pyeP Oe Pein (15.1.13) 


® oat jn (2) = i a z [20h 5 (p° — p?/2m)] e?*/" Goutyin (Pp) (15.1.14) 


and we may rewrite the second term on the right-hand side of (15.1.11) as 


- (p) (15.1.15) 


—i)p°— 
/ (dp) e [p°-E(p)]t/h eilp-x-E(p)i]/h pe, 
(2rh)* [p° — E (p) + ie] : 


_ (dp’’) (dp’) " "oy : \ on 10 / 
F.(o)= | Dany PB AL WP) Gouna (P)2ARS (P oe 
(15.1.17) 


eee 


sere 
In reference to the p°-integral in (15.1.15) we note that 
3 Equations such as (15.1.11), (15.1.21), (15.1.33) are usually referred to as Lipp- 


mann-Schwinger equations. 
+ The nature of the t > --co limits will be further discussed below. 
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lim 


eo i[p°-E(p)]t/h 0 
cA eet See 15.1.18 
t++00 [p° — E (p) + ie] { ( ) 


—2rid(p° — E(p)) 


as obtained by closing the contour of integration, in the complex p°-plane, 
from below with Im (p°) < 0 on the infinite semi-circle part of the contour. 
For the —ie term the contour does not enclose the pole p? = E (p)+ie, which 
lies in the upper half p°-plane, thus giving the value 0. For the +ie term, the 
pole p® = E(p) — ie is in the lower half p°-plane giving —27id (p° — E (p)) 
by the residue theorem with the minus sign arising since the direction of the 
contour is in the clockwise direction. Similarly, by closing the p°-contour from 
above, we have 


(15.1.19) 


ee Pe = jeune - E(p)) 
t+—oo [p? — E(p) Fie] | 0. 


That is, in particular, 


eo i[p?-E(p)]t/h 
lim —W¥W—W¥__—_ = 0 15.1.20 
t—00 [p® —E (p) +- ic] ( ) 


which from (15.1.15), (15.1.11) formally establish the asymptotic boundary 
conditions w+ (@) > ®out/in (@) for t + too. 

The following representation for 4 (x) is easily obtained from (15.1.11), 
(15.1.15) 


(dp) eo i[p?-E(p)]t/h 
2h) [p° — E (p) F ie] 


eiP—E(p)tl/h pr (p) . 


vn () = Bourn (2) + f 


(15.1.21) 
Also we note that by using the integral representation 
dp) (dp’ ; ied 
54 (2 —2') = / (dp) (dp’) os ) 54 (p — p’) elPt/Fe ipa’ /h (15.1.2) 
(27h) 


one obtains from (15.1.8), (15.1.13) the following integral equations for 
Ax (p,p'): 


(eecs ! 1 (dp”) " "oy 
Ase) =P) + rae cig | Gaytl OPAL OP). 
(15.1.23) 


For a time-independent potential V (x), 
Vp) = | (az)er” * V(x) 


= (2h) 8 (p) V (p) (15.1.24) 
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and we may write 
Ax (p,p') = 6 (p° — p”) Ax (p,p’;p°) . (15.1.25) 


Upon substitution of (15.1.24), (15.1.25) in (15.1.23), and integrating over 
p’, the equations in (15.1.23) reduce to 


1 
[p° — E (p) Fie] 


d3p” 
ae e V (p—p”) As (p", psp). 


Ax (p,p';p°) = & (p—p’) 4 


(2mh)* 
(15.1.26) 
As we will see in §15.3, the following object defined by 
f (p,p’) = —407h m ([p° — E (p)] AX (p, p's p*)) (15.1.27) 
p°=E(p)=E(p') 


turns out to be important in scattering theory, and for reasons discussed 
there is referred to as the scattering amplitude. The restriction with a bar, 
on the right-hand side of (15.1.27) indicating to set p? = E(p) = E(p’), is 
referred to as the energy shell restriction. 

For a time-independent potential, w+ (a) in (15.1.21) becomes 


a3 dp’ ei[E(p’)-E@)]t/n 
Wt (x) _ Deut /in (x) i ES E 3 7 i 
(2rh)” (20h)? [E (p’) — E (p) Fie] 
x elk P-E(p)t]/h Fx (p, p’; E(p’)) D sut/in (p’) (15.1.28) 
where 


F, (p,p'; E(p')) = / @p" V(p—p") Ax (p",p'sE(p"))._(15.1.29) 


Ax (p,p’) are related to the full Green functions G: 
which satisfy the equations 
O 


lin - a| Gx (a, 0’) = +ihd* (2 — 2’). (15.1.30) 


(x, x’) in the theory 


It is readily verified, that the solutions of (15.1.30) are given by 


Gx (a, 2’) = GY (a, 2’) = = | (ae") Go. (a, 2”) V(x") Gz (2, 2’) (15.1.31) 


by the application, in the process, of the operator [ih 0/0t — Ho] to the latter 
and using (15.1.7). 
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Upon carrying out the Fourier transforms 


d 7 “ err 
Gt (x, x’) — sin [ oe eipa/h e 'P = en (p, p') (15.1.32) 


we note from (15.1.31), that the G' (p, p’) satisfy the integral equations 


4 ' 
Gx (00) = Bee Bp) ET 
” ” 
+por@en/ om’ CF ) Gx". (15.1.33) 
By comparing (15.1.33) with (15.1.23), we may infer that 
Ax (p,p') = Gz (p,p') [p — E(p’)] . (15.1.34) 
For a time-independent potential, 
Gz (p,p') = Gz (p, p’;p”) 5 (p® — pv”) (15.1.35) 


and we have 


Az (p,p'sp") = Gz (p, psp”) [p” — E(p’)| (15.1.36) 


which will have an important application in determining transition ampli- 
tudes in scattering processes in §15.3. 

We close this section, by elaborating rigorously on the nature of the limits 
\ws (t)) > | out /in (t)) for t + too, under some sufficiency conditions. To 
this end, consider a time-independent square-integrable potential V (x). We 
treat the time t — +00 limit only. The t — —oo limit may be treated in the 
same manner. 

Suppose that we are given a normalized state |®out), which in the x- 
description satisfies the condition 


ex [Bout (x)| < C < cv, (15.1.37) 


and develops in time as |®out (t)) = exp (—itHo/h) |Pout). 
The solution of (15.1.2) is given by 


I+ (t)) = exp (—itH/h) |b+) (15.1.38) 


and the asymptotic boundary condition |7)+ (t)) + |®out (t)) may be defined® 
by 


> Such a limit is referred to a strong one which in turn implies the weak limit 
(x | (+ (6) — Bout (t))) — 0, for t — +00, as follows by an elementary appli- 
cation of the Cauchy-Schwarz inequality in conjunction with (15.1.39) for any 
normalizable state |x). 
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lim || (t) — Bout (£) |] = 0. (15.1.39) 


t—+0o0 


The unitarity of the operator exp (—itH/h) implies from (15.1.39) that 


Tim |f4 — 2(8) Pouell = 0 (15.140) 
where 
Q(t) = exp (itH/h) exp (—itH/h) . (15.1.41) 


That the limit t — +00 in (15.1.40) exists follows by noting that 


(2 (41) = 9(¢2)) ou) = fdr (L007) [Bou 


te 

i eee , 

; | dr eb H/h y e-itHo/h G4) (15.142) 
te 

leads to 


| (ts) — 22) doutll < ;| [ dr |V Gaus (1 MF (15.143) 


On the other hand from (9.1.4), (9.1.8), 


- fe Eee im|x — x’? ; 
Bout (x, 7) = (==) fa x! exp (eet Bout (x!) (15.1.44) 


which from (15.1.37) gives 


3/2 
[Pout (7) < (5) c fire? (15.1.45) 
and ih 
m 
IMG I< (Se) Civil [InP (15.1.46) 


From this inequality, we may conclude that the left-hand of (15.1.43) 
vanishes for |ti], |t2| > oo. That is, {0 (t) |®out)} forms a Cauchy sequence® 
whose limit (in the strong sense), denoted by |W), exists for t + +00, thus 
establishing (15.1.39)/(15.1.40).” 

A similar analysis may be carried out for |¢_(t)) corresponding to a state 
|®;,(t)) in the limit t > —oo. 

The operators? 0., 01, defined as the (strong) limits of 9 (t), M1 (#): 


® See property (iv) in the definition of a Hilbert space in §1.7. 

” The condition in (15.1.37) may be relaxed but we will not go into these details 
here. 

8 Some authors interchange the + signs in 04 with + corresponding to the limits 
t — +00 in their notation. 
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Q4 = lim SUaTE pole (15.1.47) 
Ql = Jim eee wie (15.1.48) 


are referred to as Moller wave operators. In terms of Q+, we may write 


| ) =. | ®out sin) : (15.1.49) 


15.2 Particle Detection and Connection between 
Configuration and Momentum Spaces in Scattering 


Consider the scattering of a particle off a given potential of short range.? 
In the remote past, before any interaction occurs, the particle is prepared in 
some state |®;, (¢)), for t + —oo, and develops in time via the free Hamil- 
tonian Ho. Eventually in time, the particle would feel the presence of the 
potential and would be described by an interacting state |7_ (t)), satisfying 
the Schrédinger equation in (15.1.1), consistent with the boundary condition 
(815.1): |b_ (t)) > |®;, (¢)) for t + —oo. 

Typically, one would then enquire about the probability of finding the 
emerging particle from the scattering process beyond an arbitrary large radial 
distance within a cone,!° with apex at the scattering center (the origin of the 
coordinate system), on its way to the detection region (see Figure 15.1). This 
probability is given by 


Prob[x € Cp] ah dx |b_ (x, t)|? (15.2.1) 
Cp 


for t positive and large, w_ (x,t) = w_ (2) is given in (15.1.11), and 
Cp = CoN {|x| > D} (15.2.2) 
where D will be taken to be large, and for a given unit vector N, 
Co = {x: |x| > x-N 2 a|x|, for some a € (0, L]}. (15.2.3) 


In the preparatory stage, a wavepacket is prepared which propagates with 
some average momentum, say, p’ whose direction may be taken to define the 


° The scattering off a Coulomb potential is given in §15.5, where it will be seen, in 
particular, that the time development of the preparatory state does not develop 
via the free Hamiltonian Ho due to the slow decrease of the Coulomb potential 
at large distances. 

10 Such a point is emphasized to a large extent in: Dollard (1969); Amrein et al. 
(1970); Manoukian and Prugovetki (1971); Prugovetki (1971); Amrein (1981), 
and others. 
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To the detector 


O 


Fig. 15.1. After a particle emerges from a scattering process, a non vanishing 
probability of finding the particle within a cone, beyond an arbitrary large 
radial distance D, implies that the momentum it has acquired is directed 
within the same cone in conformity with one’s perception of scattering. 


z-axis of the coordinate system chosen. The packet will be assumed to have 
a given lateral width, i.e., in a direction perpendicular to p’, denoted by 2a. 


We first set the detector in (15.2.1), at a large distance |x| = , [xj + 22> 


D from the scattering center such that |x,)| > a, so that no appreciable 
interference occurs between the incident part ®;, (2) and the scattered part, 
represented by the integral on the right-hand side of the equation for w_ (x) 
in (15.1.11). We will consider the scattering process for |x| <a, ie., in the 
forward direction, later in §15.4. 

So what we need is a wavepacket which at time t > 0, corresponding to 
the detection time, of the form?! 


®i, (x,t) = [ex Go (x12) x (x\,)  (z,t) (15.2.4) 


11 A particular choice of a wavepacket is not necessary, but the form given in (15.2.4) 
clarifies many aspects of the theory. 
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where x (xi,) vanishes for 


xi > a. For a uniform lateral distribution this 


may be taken simply to be a step function!!>!? 


x (xi) = (15.2.5) 


On the other hand, €(z) is so chosen that its Fourier transform € (q) = 
*(q —p’) has a pronounced peak at q = p’ and the only appreciable value of 
\#(q —p’)|° occurs in the neighborhood of this point. With E (q) = q2/2m, 


e dq i[qz-— iat 
get) = fe elle POH 5 (q— py). (15.2.6) 


By a change of the integration variable from q to Q, 


Q=q-p (15.2.7) 
and 
¢ ~p?+2'Q (15.2.8) 
since only a small Q? is important, we may rewrite 
PA bee d j gob ~ 
E(t) alleen PSE le) 5(Q) (15.2.8) 
oo 2TH 
or ; 
€(z,t) = ellP’=-2(0') 4/4 (: = Zt) (15.2.10) 
m 


The integral multiplying € (z,¢) in (15.2.4) will be denoted by x(x), ¢) 


1 
x (pt) = Ts dx, GY. (x1, x4)5¢) (15.2.1) 


which from (9.1.38) is given by 


m m Pate 
x (X54) = 535 S d?xq) exp a (x — xi) _ (15.2.12) 


This integral has many interesting properties.'* It may be expressed as 


12 More precisely, y (x})) may be taken to vanish smoothly at |x| = a, in a con- 
tinuous manner, starting at 1/W7a? for |x’’| = 0, to avoid technical problems 
arising near the boundaries of the wavepacket. This, however, complicates the 
analysis to some extent. 

'3 For the relevant details of x (x)),t) in (15.2.12), (15.2.13) see the appendix to 
this section. 
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+, ,2 
x (x4) = 55 Ta el” 2" F (u,v) (15.2.13) 
with 
ma? [| 
es ey teal 15.2.14 
wa v re (15 ) 
? . 2 
F (u,v) = an | p dp Jo (pv) eu/?)e (15.2.15) 
0 


where Jo (pu) is the zeroth order Bessel function, and we have the normal- 
ization condition 


1 oe 2 
=a f vu du |F(u,v)|" =1 (15.2.16) 
that is, Il (2) || = 1. 
Att=0, x (x), 0) vanishes for |x| > a. On the other hand for t 4 0, 
and |x| > a, corresponding to the position of the detector set, mentioned 
earlier, 


x (xj,4) = O ((a/ |xi1)°”) (15.2.17) 
In the sequel, we consider the limits!* 
[x| > a > h/p' (15.2.18) 
and 
a’p’ /h|x| >> 2r. (15.2.19) 


In (A-15.2.11), we will see by noting that m |x| /t ~ p’, and this implies 
from (15.2.19), that 


i 2 Ax) |x (xy), ¢)[? 1 (15.2.20) 
x|| ga 


and the lateral width of the wavepacket does not spread significantly. 
To compute the probability in (15.2.1), we use the representation 


0 mn 1 isi dp® —ip®(t—t")/h 
Gee) = Qrih |x — x" | is Inh © 


x exp (, vam" |x — x'!) (15.2.21) 


in the integral equation for ~_ (x) in (15.1.11). For |x| > D, D large 


‘ For example, for a ~ 107? meters, |x| ~ 1 meter, a’p'/h|x| ~ 
p'1078 /kg.meter.s~', where h/p’ denotes the wavelength of the packet. For 
h/p' ~ 10~1° meters, a?p’/h|x| ~ 104. 
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m aie | ame Oe i 
Gee) an i (jv 2m H) 


x exp (- 5.20" n- x") (15.2.22) 


where 


n = x/ |x|. (15.2.23) 


For , |x| > D, D large, we see from (15.2.17), (15.2.18), (15.2.22), 
that ®;, (x,t) may be neglected on the right-hand side of (15.1.11). That is, 
no interference!> between the incident and scattered components of w_ (x) 
needs to be considered. 

Using the Fourier transforms of V (x), A_ (a, a’), in (15.1.12) /(15.1.24), 
(15.1.13)/(15.1.25), respectively, we obtain from (15.1.11), (15.2.22), 
(15.2.17), the asymptotic equality 


eillallx|-E(a)e]/h 


w- (x, t) = 


aah [| les Qnh)* 
F_(|q/n,q; E (q)) Pin (a) (15.2.24) 


for |x) ce > a‘/? |x|, in particular, and hence for |x)|| >> a. Here 


in (q) = / d3x e7'4*/? &.., (x, 0) (15.2.25) 


or 
~ 1 ~iq,) x! - 
Pin (Q) = Te dx} eum? (qu — p’). (15.2.26) 
Upon writing ~_ (x,t) = v_ (|x| ,n;t), we already see from (15.2.24), that 
a particle initially prepared with momentum q, with distribution provided by 
2 


le, (a)| , the particle emerges from the scattering process with momentum 


|q|n in the direction n. In particular, for a particle emerging, in configuration 
space, within a cone Co in (15.2.3), its momentum would be directed within 
the same cone. This is in conformity with one’s perception of a scattering 
process. 

In (15.2.24), (15.2.26) we make a change of the variables of integrations 


yj); x! to Q, p 
| vp [h|x| 1 


'® We will look into the forward direction, that is in the effective direction of prop- 
agation of the incident packet, in §15.4. 
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and note the invariance of the product measures 
d?qy d°x}) = d’Q d’9’. (15.2.28) 


From the limit in (15.2.19), the integral over p’ corresponding to the x’- 
integral in (15.2.26) becomes effectively replaced by (27h)? 63 (Q). On the 
other hand since 7 (q, — p’) has a pronounced peak at q, = p’ and is appre- 
ciably non-vanishing in the neighborhood of this point, we can carry out the 
expansion gq? ~ p’? + 2p’ (qi — p’) as in (15.2.8) for expi[qi |x| — E (qi) t] /h 
in (15.2.24), and we may effectively evaluate F_ (qin,qi;E(q1)) at q, = 
(0, p’). 

Accordingly, we obtain asymptotically for w_ (x,t) in (15.2.24), 


i diaceeea 2 Ie O 
oe, A NY al ". B(p! i. ||P’ |lx|-£(e )t] XS TRS: 


wy (x, t) =e 


where we have used the fact that 


aed g p ont pl 
I. oan oS (f G mt) ) ad) =a ixl— ot (15.2.30) 


and qd’ =qi —p’. 
For D large, we then have from (15.2.1), (15.2.30), 


3 2 m \? / he 1\\|2 
[axe coor= f a0(sh2) IF mp Pe) 


pene 
gy hel Fe) 
xf dx] (15.2.31) 


where 
Co (n) = {n:1 >n-N 2a, for some a € (0, 1}} (15.2.32) 


and the unit vector N, and a are chosen such that |x| >a. 
In the next section we will use (15.2.31) to obtain an expression for the 
differential cross section of the process. 


Appendix to §15.2: Some Properties of F (u, v) 


The function’® F (u,v) defined in (15.2.15), of the variables u, and v 
in (15.2.14), may be expressed in terms of so-called Lommel functions. For 
u/v <1, 


‘6 For more related details, see: Manoukian (1989); Wolf (1951); Born and Wolf 
(1975). 
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2m iu/2 : 
F (u,v) = an [Uy (u,v) —i U2 (u,v)] (A-15.2.1) 
and for u/v > 1, 
2m f. —iv?/2u + iu/2 iu/2 
F (u,v) = a ie —ie'’/* VY (u,v) —e'/* Vy (u,v) (A-15.2.2) 


where U,, (u,v), Vn (u,v) are Lommel functions defined by 


Un (u,v) = S>(-1) ie Joj4n (0) (A-15.2.3) 
j=0 

Vn (1,0) = 2 (1! (2) dayne (0) (A-15.24) 
j=0 


and Jm (uv) is a Bessel function of the first kind of order m. 
In particular 


(A-15.2.5) 


(A-15.2.6) 


and 
pe v dv [UZ (u,v) + UZ (u,v)] = . [Jo (u) cosu+ Jy (u)sinul (A-15.2.7) 


F' (u,v) satisfies the normalization condition in (15.2.16). 
For u/v <1, 


an 1 
Fue) _ 27 Qiv/2 7, (y)| = - O (=) (A-15.2.8) 
U U U 
giving rise to the behavior in (15.2.17). 
For u/v > 1, 
2 ; 2Q2mi 1 
Fae) — ere 4 2A emg (w= Zo(2).  (ats.2.) 
U U U U 


Finally we note from (A-15.2.7), (15.2.13) that 
| d?xy |x (x,t) |? =1— (Jo (u) cosut Ji (u)sinu) — (A-15.2.10) 
x|| <a 


and for u > 1, 
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i : x) [x (x)),) |? = 1. (A-15.2.11) 
x|||<a 


For u = 50, the right-hand side of (A-15.2.10) attains the value 0.92. The 


condition (15.2.19) implies that u >> te and for p’ ~ m |x| /t, this leads 
to the condition u >> 27. That is, under this limit the lateral width of the 
wavepacket does not spread significantly. 
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15.3.1 Expression for the Differential Cross Section 


In reference to (15.2.4)—(15.2.6), 


Saye 
x dfx| [n |x| — Zt | é3 2 
i: ( ~ | dx, [ az Pm Owe) 5.3.1 
|x)||<a as 


D Ta? D a? 


denotes the probability, per unit area, that the incident particle flows through 
the plane z = D at time t, for both large, in the absence of a potential.” 
On the other hand, the left-hand side of (15.2.31) represents the probability 
that an incident particle is scattered within a cone, and is found beyond a 
radial distance D at time t. Accordingly for a sufficiently narrow cone, i.e., for 
which a in (15.2.32) is close to one, n — N, we obtain, by dividing (15.2.31) 
by the probability per unit area in (15.3.1), the expression for the differential 
cross section ays Sete 
aa = (so) IF-('n, ps E(P'))| 
where the conservation of energy is evident. 
One may define the scattering amplitude 


; (15.3.2) 


m 
f(p.p)=-545 [ ap" Vp-p") Ap" PLE) (15.33) 
with 
p=|p’|n (15.3.4) 
to rewrite simply 
do 112 
qq 7 If (PP) (15.3.5) 


where we have used the definition of F_ (p, p’; E (p’)) in (15.1.29). The reason 
for choosing the multiplicative (—1) factor in (15.3.3) will be seen in the next 
section when studying the so-called optical theorem. 


1” In the presence of a potential, the probability of observing the particle in the 
forward direction is altered due to the interference of the incident wavepacket 
and the scattered amplitude in that direction as will be investigated in the next 
section. 
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From (15.1.27), (15.1.36), one may also write the scattering amplitude as 
f (p,p!) = —47°h m( [p° - E (p)] 


x G(p,p';p°) [p° — E(p’)] ) 


(15.3.6) 
p°=E(p)=E(p’) 
involving the full Green function G, (p, p’; p°) (see (15.1.33), (15.1.35)) with 
a restriction set in (15.3.6) on the energy shell as indicated. 
The cross section is defined by 


o= f aa\f(p.p)) (15.3.7) 


and a further important relation between o and the scattering amplitude will 
be derived in the next section referred to as the optical theorem. 

For a systematic study of the scattering amplitude f (p, p’) we derive an 
important integral equation which follows from (15.1.26). For arbitrary p°, 


A_ (q,p’;p") = 6 (q—p’)4 


3,/ 
ef dviq—q)A_(q,psp°). (15.3.8) 


Upon multiplying this equation by V(p — q), integrating over q, and setting, 
in general!® 


[aa V(q—q’) A- (q’, p's p°) = T (a, sp”) (15.3.9) 


off the energy shell, gives 


déq V(p—q) 
— T (q,p'sp° 

Qnh)* [p° — E(q) + ie] (a,P52") 
(15.3.10) 

referred to as the T-matrix. In particular, from (15.1.26), (15.1.27) and 


(15.3.10), the scattering amplitude may be written as 


T (p,p';p”) =Vip-p)+ f 


, m 1,0 
=-~—, T ; ; 15.3.11 
f(p.p’) = 555 T (pp sp") are ( ) 
To first order in V, i.e., for a weak potential, 
T (p,p';p”) = V(p— p’) (15.3.12) 


'8 Note that for p? = E(p’), T(q,p’; E(p’)) coincides with F_ (q, p’; E(p’)) — 
see (15.1.29). 
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referred to as the first Born approximation, the differential cross section takes 


the simple form 
do m? 


dQ 42h 
with p = |p’|n. For some applications of the Born approximation see Prob- 
lems 15.1, 15.2. 


\V(p—p’)|’ (15.3.13) 


15.3.2 Sufficiency Conditions for the Validity of the Born 
Expansion 


We carry out a rigorous study of sufficiency conditions for the validity of 
the Born approximation in (15.3.12) and to arbitrary orders in the potential 
energy V(x) starting from the integral equation for the T-matrix given in 
(15.3.10). To this end, with m replaced by the reduced mass yu of a two- 
particle system and for V(x) = V(r), r = |x| as a function of the distance 
between the two particles, suppose that the following sufficiency conditions 
are satisfied: 


i r?dr |V(r)| < 00 and | r dr|V(r)| < o. (15.3.14) 
0 0 


The first condition refers to the absolute integrability of V(x). 
We set p® = E (p’) in the integral equation (15.3.10) and denote 


T(p,p'; E(p’)) =T(p,p’). (15.3.15) 
To first order in the potential, 
T (p,p’) = [ax ei(P-P’)-*/h V(x) (15.3.16) 
and more generally, we carry out an expansion 


T(p,p’) = 5) T™ (p,p’) (15.3.17) 


n>1 


where T”) (p, p’) is of n*® order in the potential. Using the Fourier transform 
in (15.3.16), and the integral (see (9.1.25), (9.1.31)) (E (q) = q?/2u) 


/ d°q eia-x/h _ ” ely 2mp® |x| (15 : 18) 
(2rh)° [p09 —E(q)+ie] 2mh? |x| as 
one obtains the explicit expression (n > 2) 
n-1 ; ip|x1—x2|/h 
7) (pp) = (—gg)” f abas...d8en oP Vox) 
T [x1 — X9| 
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eip|x2—xs|/h eiP|Xn—1—Xn|/h . ; 
se 9 red 6 ee ns (G0 Ma cas as 
a) |X2 — X3| ( 1) IXn—-1 — Xn| rn) 
(15.3.19) 
We may then bound 7”) (p, p’) in absolute value as 
(n) ! eet as (n=1) 
7 (pp)| < (saa) [axa IV een)l |Z? Gen)|—15.3.20) 
where 

IO Gg) = [ax aed ace eee 

V (x1) eip|x1—x2|/h v( iP |Xn-1—Xn|/h (15.3.21) 
x V(x.) ————_... V(xp_1) ——————__. 3. 
‘i = xa] aoe 
Using the inequality 

dQ 4 
Iles a (15.3.22) 

x1 -_ X2| TL 

shown later, we have the following x2-independent bound 
V CO 
[fen < anf ry dr, |V(r,)| (15.3.23) 
|X1 — X2| 0 


which by hypothesis (see (15.3.14)) exists. 
From (15.3.23), we then have 


ae (x,)| < Ca drivel) (15.3.24) 


and, for all n > 1, 


ba (p, p’) 


< ( | «*xiveo!) & Le avo!) (15.3.25) 


where both factors on the right-hand side exist by hypothesis. 
The series in (15.3.17) is then absolutely convergent for 
Qu fr 
7 J, rdr|V(r)| <1 (15.3.26) 


giving rise to a sufficiency condition for the validity of the Born series in 
(15.3.17). 
For a particle of mass m in a bounded potential |V(r)| < Vo of finite range 
V(r) =0 for r > R, the condition in (15.3.26) is satisfied for 
m 


=5 Vo RE Ty (15.3.27) 
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To establish the inequality in (15.3.22), we use the expansion 


foe) L L 
=407 aE (=) ~ ¥¢ m (81) Yi'm (2) (15.3.28) 


rs 


where re and rs denote, respectively, the smaller and the larger of r1, ro. 
Hence 


dQ, An E (r1 — T2) a O (re - a 


x1 — x2| 


(15.3.29) 


ry T2 


and since 9 (rg — 11) /T2 < O(r2 — 11) /r1, and O (r; — r2) +O (rg — 171) = 1, 
the inequality in (15.3.22) follows. Note that an equality in (15.3.22) holds if 
TL > TQ. 


15.3.3 Two-Particle Scattering 


Needless to say, the scattering theory developed above and the expression 
for the differential cross section obtained in (15.3.5) hold for the interaction 
of a particle of mass m = m, with a much heavier (approximately motionless) 
particle of mass m2. If m2 is not large, and the particles are not identical 
and of spin 0, all of the above formulae are still valid in the center of mass 
of the two particles if one simply replaces m = m, in them by the reduced 
mass = m1M2/ (m1 + m2). The transformation to the laboratory system is 
the same as in classical dynamics.'® For definiteness, consider the particle of 
mass mg (the target particle) at rest, and a spherically symmetric potential so 
that the scattering, in the center of mass, is specified by the angle 6 only. The 
scattering angles J; and V2 of the particles of masses m, and mg, respectively, 
in the laboratory system are given by the well know formulae 


tan, = sino /( + cos 0) (15.3.30) 
m2 


Bo = (mr — 0) /2 (15.3.31) 


with the latter giving the recoil angle of the target particle. The differential 
cross section in the laboratory system is given by 


do 


do ,,. d(cos 0) 
qa, (v1) 


= =, (4¢)—— 15.3.32 
ai aa! d(cos V1) ( ) 


'8 Cf. Marion and Thornton (1988), p. 310, p. 326. 
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where do (0) /dQ is the one obtained in the center of mass system. 
For identical (i.e., indistinguishable) particles of mass m, = m2 =m, 


0,=- (15.3.33) 


and we have to use the proper symmetrization of the scattering amplitudes, 
based on the statistics of the particles, since a detector cannot distinguish 
between them. In the center of mass system two possible routes of scattering 
are shown in Figure 15.2. 


(a) (b) 


Fig. 15.2. For identical particles, the detector (set at angle 6-shown in the 
center of mass frame) cannot distinguish between the two possible routes (a) 
and (b) of scattering. The necessary symmetrization, based on the statistics of 
the particles, of the scattering amplitudes leads to interference effects which 
are observed experimentally. 


Consider two indistinguishable particles of spin 0. In this case we have 
to add to f (0) the amplitude f (a — 6) corresponding to the route of scat- 
tering shown in Figure 15.2 (b) before defining the differential cross section. 
The amplitude f (@) is referred to as the direct scattering amplitude, while 
f (a — @) as the exchange one. The differential cross section (in the center of 
mass) is then given by 


do 2 
= IF (8) +f (m9) 
=|f (OP + if (w- AP +2 Re(f* )f(m—98)). (5.3.34) 


For a detector set at 6 = 1/2, do/dQ is given by 4|f ()|? rather than 
2|f (0)|? as the latter would be expected by a naive classical argument. Such 
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a doubling is observed experimentally and is due to the interference term in 
(15.3.34) as a result of the correlation between the particles arising from the 
symmetry of the states under the interchange of the two particles. 

The situation with fermions turns out to be quite interesting. For def- 
initeness consider the scattering of two indistinguishable spin 1/2 particles 
interacting via a spin-independent interaction as above. Let |x), |x’) denote 
the normalized spin states of the particles. The effective scattering amplitude 
would then be given by 


f (8) Ix1) Ix2) — f (@ — 8) |x4) Ix) (15.3.35) 


and the differential cross section is 


< =(f (+ |f (r—0)|? —2 Re(f* (0) f (wm —8)) |(x|x’)|?_ (15.3.36) 


Therefore an interference term arises only if the spin states are non- 
orthogonal. In particular if the spins of the particles are in opposite directions 
of a quantization axis no interference term arises. On the other hand, if the 
spins of the particles are along the same direction, then 


do 2 
qo = |f (0) — f(r -4)| (15.3.37) 
This expression is interesting in the sense that at 6 = 7/2, the differential 
cross vanishes, unlike for the bosonic case, and is verified experimentally. 
Now consider the scattering of unpolarized spin 1/2 particles. We know 
from §2.8, §5.5, from the addition of spin 1/2’s, we have the triplet state, 
corresponding to a spin 1, and the singlet state, corresponding to a spin 0. 
The Fermi character of the spin 1/2 particles, requires from (15.3.36) that 


do ,. . 
59 =I —F(r- 9) 
do 
qq =F OV +1F OP 
(15.3.38) 
do ‘i F 
qq UH =IFOM +f @— 8) 
do 3 
<a LL = LF (8) — F (m - 9) 
with weight factors 1/4 for unpolarized particles, giving 
doy 
cee = LF (OI? + LF (w — 8) — Re (F* (6) F (8) (15.3.39) 


where u stands for unpolarized, and unlike (15.3.36) note that there is no 
factor of 2 in the interference term in (15.3.39). 
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The result in (15.3.39) may be also obtained by noting the symmetry 
properties of the triplet and singlet states. To this end, note from (5.6.21) 
that for spin states of spins s1, So, 


(81,713 82, ™M2|8, mM) = (-1).F8-* (s2,™m2; 81,™4|s,™m) (15.3.40) 


where in the above case sj = 52 = 1/2. Therefore the triplet state (s = 1) 
is symmetric in spin space, while the singlet (s = 0) is anti-symmetric. The 
Pauli exclusion principle then requires that 


do, _ 
dQ 


TE@- FOP +ZEO+F AP (15.3.41) 


which coincides with the expression in (15.3.39 ). The 3 to 1 factors above 
correspond to the number of allowed values for m for s = 1, and 0, respec- 
tively. 

For spin-dependent interactions of spin 1/2 particles, we may use the very 
general analysis carried out in §8.6 for the scattering of a spin 1/2 particle 
off a spin 0 target as well as the one for the scattering off a spin 1/2 target. 
Given an M-matrix relating an initial and final state, the differential cross 
section of the scattering process is given by 


do Tr[M pM] 
dQ Tr [pe] 


(15.3.42) 


where p“) is the initial density operator, and may be normalized as Tr[p] = 
1, and M p Mt is the final density operator. General structures of the M 
matrices were determined in §8.6 and polarized as well as unpolarized incident 
beams were considered. For details we refer the reader to that section. 


15.4 The Optical Theorem and Its Interpretation; Phase 
Shifts 


We derive a relation between the cross section and the scattering ampli- 
tude in the forward direction, which is referred to as the optical theorem. The 
physical meaning embodied in this result will be also emphasized. Finally, we 
decompose the scattering amplitude into partial amplitudes specified by an- 
gular momentum states, referred to as a phase shift analysis, by making use, 
in the precess, of the optical theorem. 


15.4.1 The Optical Theorem 


To obtain the above mentioned relation, we multiply the equation for 
A_ (q, p’;p®) in (15.3.8) by T* (q, p;p°) integrate over q, and use the defin- 
ition in (15.3.9) to obtain after re-arrangements of terms 
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T* (p’,p;p") = [a dq’ A* (q’,p;p°) V(q' — q) A_ (a, psp”) 


Pq ae 1 r 
fans PP”) pag TOP) 
(15.4.1) 


where we have used, in the process, the reality of the potential V(x) to inter 
that V*(q) = V(—q). 

Upon taking the complex conjugate of (15.4.1) and interchanging the 
momenta p © p’, we also have 


T(p, p's p”) = [ea d°q’ A* (q’,p;p°) V(q’ — q) A_(q, psp”) 
d3q 1 


_ T* . T 1, 0 

| om (q,P:P") pS Eg) =a) (a,P'sp") 
(15.4.2) 

where in writing the first integral we have used again the reality of the po- 

tential and conveniently relabelled the integration variables q < q’. The first 


integral in (15.4.2) is the same as the first integral in (15.4.1). 
We set p® = E(p’) = E(p), and use the relation 


[E(p’) — E(q) —ie]~* —[E(p’) — E(q) +ie]~* = 2ni 6(E(p’)— E(q)) (15.4.3) 


to obtain by subtracting (15.4.1) from (15.4.2) 
T(p,p'; E(p’)) — T*(p', p; E(p')) 


277i m|p’ | 1 / ” / / / / 
=-—,_ | aa" T :E T -B 
eae! fase" T° (p's BD") Tip’. p's 1) 
(15.4.4) 
where we note the (—) sign multiplying the integral on the right-hand side. 


Finally using the definitions in (15.3.3)/(15.1.29), (15.3.9), (15.4.4) may be 
rewritten in terms of the scattering amplitude, 


‘ i[p’ ‘ 
flp.p)) — F(e',p) = SEF f ao” F*(\p'in",p) F(p'tn",p') (15.45) 
In particular for p’ = p, we have from the expression of the cross section 


in (15.3.7), the following equality referred to as the optical theorem 


= Ath 
P| 


relating the cross section to the imaginary part of the scattering amplitude 
in the forward direction. The choice of the minus sign (—1) factor (a phase) 


o Im f(p,p) (15.4.6) 
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in (15.3.3) adjusts the sign of Im f(p, p) to be positive matching consistently 
the positivity of o in (15.3.7), (15.4.6). 

To understand why o is related to the scattering amplitude in the forward 
direction, we will investigate the scattering process in this direction and see 
what quantum physics has to say about this. 

In the forward direction, i.e., for |x)| Sa, the incident component ®jy (7) 
of ~_(ax) in (15.1.11) cannot be neglected in comparison to the integral on 
the right-hand side of (15.1.11) as we have done in (15.2.24) for the case 
|x)| >> @, using the vanishing property of x(x),t). In the present case for 
|x\\| Sa, |x| > D, D large, (15.2.24) becomes simply replaced by 


m d’q_ is, 
‘: ie sillallx|-2(a)#]/n 
¥-(2) > Pinl2) — Sa | (Qmh)s © 


x F_(|qin,q;B(q)) ®in(q) (15.4.7) 


leading to the asymptotic equality 


m 


th? |x| 


|ab_(x, t)|? = |®in (x, t)|? Re{ M(x, t)} (15.4.8) 


where Re{-} denotes the real part, and 


3 25 
M(x,t) = / 24 Gillalix!-2@)41/" (qin, q; E(q)) bin(a) 8%, (x, 1) 


(27h)3 
(15.4.9) 

The second term on the right-hand side of (15.4.8) provides an interfer- 
ence term between the incident and the scattered components in the forward 
direction. 

Now we look at the intensity in the forward direction, as obtained from 
(15.4.8), on a screen, set parallel to the x-y plane, at a distance z = D, by 
integrating |2_ (x, t)|? over an area < 7a”. 

Accordingly, we are led to consider the integral 


/ . d?x), M(x, t) (15.4.10) 
x|| ma 


and, as in (15.2.18), (15.2.19), we consider the limits (p’ = |p’]) 


z>a>h/p! (15.4.11) 
and simultaneously,7? 
/ 
7 a2 /z > Qn. (15.4.12) 


For |x|\| <a, z= D >a, we have 


20 A similar method was used by van de Hulst (1949). See also Newton (1976). 
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x 
|x] ~ z+ oe (15.4.13) 


and we may effectively set n ~ no = (0,0,1) in F_ since 6 ~ 0 for D >> a. By 
making a change of the variables integrations qj, xj), Xi in (15.4.9), (15.4.10), 


(15.2.26), respectively to Q’, p, p’, with the latter two dimensionless, 


p= xp /zh, p’ = xi V/p//zh (15.4.14) 
Q’= gy) V zh/p! (15.4.15) 


the corresponding product measure changes as follows, 
h 
d?qy, d?x;) d?x)) > a @?Q! dp dp! (15.4.16) 


and the z factor in the latter cancels out the multiplicative 1/|x| factor in 
the second term on the right-hand side of (15.4.8) for z > a Z |x\|. 

From (A-15.2.9) and (15.2.12) (see also (15.2.20)), one may effectively re- 
place x(x), ¢) in (15.4.9) /(15.4.10) by 1/V7ra?. One may now repeat a similar 
analysis as the one carried out through (15.2.25)—(15.2.29). We also note by 
writing in the exponent in (15.4.9) (see also (15.2.10), (15.2.25), (15.2.26)) 


12 
‘bd ama Sree gel 
hh 2hz 


2 
ase +E (15.4.17) 


with |p| < a,/p’/zh, we are led to the evaluation of the simple Gaussian 
integral in (15.4.10) 


/ d’?p exp(ip?/2) ~ 27i (15.4.18) 


|p|<av/p'/zh 


which from (15.4.12), we have extended the limit of integration to infinity. 
All told, we have from (15.2.20), (15.2.10) and (15.4.8) 


> ra’ = Im f(p’, p’) 
(15.4.19) 
where we have used the fact that Re(i...) = —Im(...), the definitions in 
(15.3.3)/(15.1.29), and recall that p’ = (0,0, p’) = p’no. 
The result in (15.4.19) is remarkable. In the absence of a potential the 
expression within the square brackets would be just 7a”. In the presence of an 


/ dx) [Ox 2)? = 
|x|\|<a@ 
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interaction, the interference between the incident and scattered components 
of q_ (x,t), in the forward direction, reduces the intensity observed in that 
direction. The cross sectional area on the screen is thus reduced from a? 
by just the correct amount (see (15.4.6) with p’ = |p’| = |p|) to compensate 
for the probability loss when a particle is scattered into all angles. It is a 
consequence of the conservation of probability. The optical theorem will be 
also studied in the context of inelastic processes in §15.8. 

There are many authors associated with the optical theorem in physics, in 
general, such as Bohr, Peierls, Placzek (in quantum physics), and several oth- 
ers, dating back to Lord Rayleigh in his studies of the color and polarization 
of the sky.?4 

The first Born approximation in (15.3.12), (15.3.16) obviously violates the 
optical theorem since V(p — p’) for p = p’ is real. From (15.4.6),(15.3.7), 


Arh 
Ip| 


where p = |p’|n and thus the result embodied in the optical theorem relates, 
consistently, different orders in the potential (see Problem 15.2). 


a epip y= i: dl f(p.p’)? (15.4.20) 


15.4.2 Phase Shifts Analysis 


Consider the elastic scattering of a particle off a target particle initially 
at rest both of spin 0. Let p’, p denote the initial, final relative momenta, 
respectively, and yz the reduced mass. For the interaction between the particles 
given by a potential energy which depends solely on the magnitude of the 
relative position vectors between the particles, we have a rotational invariant 
theory. 

For spin 0 particles, the scattering matrix (15.3.10)/(15.3.11) may be 
expanded in terms of the relative angular momentum states, in the center of 
mass frame, as follows (|p| = |p’| = p) 


(p|T|p’) = >> 0m, Yom (B) Yeim: (B’) (p,e,m|T |p, @,m') — (15.4.21) 
Lm eam 


where we have used the expansion of the states (p| in (5.10.95) in terms of 
angular momentum states. 

T must be diagonal in @,m and also be independent of m due to the 
independence of the theory on the orientation of the system. Accordingly, we 
write 


[pe(p) — 1] 87h? 


(p, £,m|T |p", £,m’) = pAl 
i Lp 


bypbvies (15.4.2) 


where the numerical coefficient was chosen for convenience. 


21 For the fascinating history of the optical theorem, in general, see Newton (1976). 
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By using the addition theorem of the spherical harmonics in (5.3.67), 
(15.4.22) and (15.4.21) give 


(p|T|p’) = ante S (2041) = P,(cos 6) (15.4.23) 


where 6 is the angle between the momenta, i.e., p’ + p = cos 0. 
From (15.3.11), we then have the following expansion for the scattering 
amplitude f(p,p’) = f(p, 4): 


Co 


p.0 7m 2+ 1) — 1] P;(cos 6) (15.4.24) 
=0 


where the coefficient [p¢(p) — 1] is to be determined. For V = 0, pg — 1. 
By using the orthogonality relation of the Legendre polynomials 


a 2 
| sin 0 dé Pe(cos 0) Py (cos 0) = WaT Over (15.4.25) 


we obtain from (15.3.7)/(15.4.20), i-e., from the optical theorem, 


wh? zi 


mh? (20+ 1) |pe —1]° = 


sues im (4 mt) (15.4.26) 
£=0 l= i 


suppressing the p-dependence of pg for simplicity of the notation. Using 
Im [(p¢ — 1)/i] = 1 — Re pe, we may infer that 


loc — 1]? =2 —2Re py (15.4.27) 
or that |p¢|? = 1. That is, pe is a phase factor which we may write as 
pe = ee (15.4.28) 


with 6, real, and is referred to as the phase shift, which depends on p and &. 
The scattering amplitude in (15.4.24) takes the form of a sum of partial 
scattering amplitudes 


Co 


ae 2¢ + 1) (e7!% — 1) Pe(cos 6) (15.4.29) 
=0 


To obtain information on the phase shift 6, we note from (15.4.27), 
in the process, that with ®(q) = (27h)?d3(q — p’),v_(x) = 
w_(x, p) exp(iE(p)t/h), we have, for |x| — ov, 


eilxlip/|/n 


Gop) — e*P" + F(p,8) —E 


(15.4.30) 
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where we have used the definitions (15.3.3)/(15.3.9) and (15.1.21) (see also 
(15.4.7)). 

Taking the advantage of the expansion of expip+x/h in terms?? of the 
Legendre polynomials P,(cos@) and the spherical Bessel functions j¢(pr/h) 
n (5.10.36), we have from (15.4.29),(15.4.30), with |x| =r, 


he £45 . (pr bn 
w_(x,p) — or 2a (2e+ 1)i*e’* cos d¢ sin (e a =) 


£=0 
£ 
+tan dp cos (F ~ =) P2(cos 0) (15.4.31) 
where we have used the asymptotic behavior 
_ pr h . (pr en 
3 15.4.32 
iE) —— Asin (4 =) (15.4.32) 


for the spherical Bessel function. 
For a spherically symmetric potential V(r), we may write 


Co 


=u (20 +1) i® el cos dy ge(r ") (cos) (15.4.33) 
r 
=0 


and from (15.4.31), ge(r) satisfies the boundary condition, 


gel) sin (4 +) + tan dy cos (e = =) (15.4.34) 


More generally, for all r, it is easy to show that g(r) satisfies the one 
dimensional differential equation 


qd? &(é+1) ap p 
rs 72 Fa V(r) 4 F | ate) = (15.4.35) 


as obtained from the time-independent Schrédinger equation. 
Two real solutions of (15.4.35) for V(r) = 0 are given by 


ae(pr/h) = je (F), be (E) =-2 ne (FE) (15.4.36) 


where ne(x) is the so-closed spherical Neumann function,?* behaving asymp- 
totically for r — oo, as 


ae(pr/h) —— sin & >) (15.4.37) 


2 Note that for ®in(q) = (2rh)?5°(q—p’), x points in the same direction as p (see 
below (15.2.26)). 
3 Cf. Arfken and Weber (1995), p. 678, p. 682. 
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be (pr/h) ——+ cos (F >) (15.4.38) 


Tr 


Upon defining the Green function, associated with (15.4.35), 


Golr,r’) = Jaw (F) be (F) ew! r) tag (=) be (=) O(r ‘| 


(15.4.39) 
satisfying (see Problem 15.4), 
d= &é+1 h? 
oe uo yi | Ge(r,r’) = d(r —1") (15.4.40) 


as verified by using, in the process that, dO(r’ — r)/dr = —d(r — r’), dO(r — 
r’)/dr = 6(r — 7’), and the property 


(aE) 6 CR) (E) (ae (E)) =e Os4an 


the solution of (15.4.35) may be given in the form of an integral equation 


gelr) = ag (=) - : an Gee UGd) ae) (15.4.42) 
where 5 
u(r) = pV ("): (15.4.43) 


Form the boundary condition (15.4.34), we may then use (15.4.42), and 
the asymptotic behavior of Ge(r,r’) as obtained from (15.4.36)—(15.4.38), for 
r — o, to get 


h Co 
tan dg = | dr ae (=) u(r) ge(r). (15.4.44) 
P Jo A 
In the Born approximation, we have from (15.4.42) 
giving 


tan de & 6g & = is r?dr (ic cane u(r) (15.4.45) 


for small 6, assuming, of course, the convergence of the latter integral. 


For small r, 
pr Bey r 
ae (7) — (©) / (2+ 1)!! (15.4.46) 


showing that ge(r) — 0 for r — 0. From (15.4.45) one may also formally infer 
that 
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5p x o((f) (15.4.47) 


for p > 0 under suitable conditions imposed on u(r) as, for example, will be 
the case if u(r) is of finite range. 

A Born series may be formally developed for ge(r) from (15.4.42) given 
by 


ge(r) = ae (=) + S- is dr‘. ef dr}, Ge(r,r,) u(r}) 


n>1 


f 
x Golr’,,r) u(r) ...Gel(r,_1,7%,) ulr’,) ae (2) (15.4.48) 


and is real. 

For a given potential V(r), information may be obtained on the sign 
of d¢ as follows. We scale the potential energy V(r) by two parameters 
V(r) = AiV(r),V(r) — A2V(r). The corresponding solutions ge(r) and 
phase shifts d¢ will be denoted by ge(Ai, 7) and 6¢(A;),7 = 1,2. Upon multi- 
plying the differential equation satisfied by g¢(A1,1r) from the left by ge(A2,r), 
and similarly upon multiplying the differential equation satisfied by ge(A2,r) 
from the left by ge(A1, 1°) and subtracting the latter equation from the former, 
we obtain 


d d 
ge(A2,7) 5 9e( Ars) — ge(Ar,7) 5 9e( A257) 
=(\i- ») [ dr'u(r’)ge(Ar, 7’) ge(A2, 7’). (15.4.49) 
0 
Taking the limit r — oo, and using (15.4.34), this gives 


tan 6¢(A1) — tan dg(A2) = mee - d») [ dr'u(r’) ge(A1, 7) ge(A2, 1") 
Pp 0 


(15.4.50) 
Setting Ay = A+ AA, A2 = A, for AA — 0, we get 

d h fe 2 

— (A) = -= dr u(r) (ge(A, 7) cos 6e(A)) (15.4.51) 
dX PJo 
which upon integrating over \ from 0 to 1, leads to 

h fe i 2 

be = ee dr u(r) dA (ge(A, 7) cos 6e(A)) (15.4.52) 
0 0 


showing a correlation between the sign of dz and the potential u(r). The phase 
shift is positive for u(r) < 0, and negative for u(r) > 0, in general. 
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Intuitively, for a potential energy of finite range, say, a, a particle imping- 
ing on a target with “impact parameter” ~ Al/p, with @ such that hé/p > a 
is expected to be unimportant in scattering and one may formally cut-off the 
sum in (15.4.29) at @ = [pa/h], where [2] is largest positive integer < x. In 
particular, for low energy, one may take 


f(p,9) ~ Ph (ito —1) = fo(p, 8) (15.4.53) 


2ip 
for the scattering amplitude. 

The phase shift analysis carried out at the end of this section will be 
generalized in $15.8 to include inelastic processes and see how the counterpart 
of formula (15.4.29) is modified in such cases. For some applications see the 
problems section. 


15.5 Coulomb Scattering 


This section deals with a systematic treatment of Coulomb scattering. 
We first extend the result of Problem 15.3 to the Coulomb interaction to 
extract the asymptotic “free” propagator corresponding to the propagation 
of a particle at large distances from the scattering center. We then show that 
the asymptotic time development of a charged particle is not given by the 
free Hamiltonian and, in turn, obtain the corresponding modified asymptotic 
time evolution operator. The modifications arise because of the long range 
nature of the Coulomb potential and a particle feels its presence no matter 
how far it is from the scattering center. This is followed by investigating the 
behavior of the full Coulomb Green function G+ near the energy shell. Finally 
the Coulomb scattering amplitude is obtained from a time-dependent setting. 
The Coulomb scattering of two identical particles will be also discussed. 


15.5.1 Asymptotically “Free’ Coulomb Green Functions 


We extend the result of Problem 15.3 to study the nature of the integrals 


/ dp e'P*/" Gi(p,p’;p°) for |x| > 00 (15.5.1) 


for p° near E(p’) = p’*/2y for a two-particle system of reduced mass p, for 
the full Coulomb Green function in (9.9.54). 

By changing the variable of integration in this latter equation from € to 
z, where 


2= GLB) Ew) (15.5.2) 


we have (see (9.9.54)) 
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Ap® p? — E(p’) tie\* 
Ga PON Ss 7 H 
+(p,p';p’) 2 0 +ie Ap 


for p? ~ E(p’), 
Ap /h _ A 
2up? 
with V(x) denoting a Coulomb potential. The denominator in the integrand 
in (15.5.3) in the square brackets may be rewritten as 


y= (15.5.4) 


12 


(z+1) (» 2) ie FT | (15.5.5) 


By changing the integration variable in (15.5.1) from p to Q = p— 
p’z/(z+ 1), and using the integral 


eiQ-x/h ,o(2+1 i|p’||x 
[aa a — ; ada ra ) exp (27) (15.5.6) 
> Gate Tie 


we obtain for the integral in (15.5.1), for p° ~ E(p’), the expression 


1 ain ei -x/h Ap® p _ E(p’) Bae tiy 
p \ 8p° p? — E(p’) tie 4p 


a Peer | Zz . 
dzae" griaeyet)) 15.5.7 
x | ea E +) e ( ) 
where , 
ax = 5 [lp'llx|— +x (15.5.8) 


and we note that we have multiplied and divided the integral in (15.5.1) by 
exp (i p’+ x/h). 
We are interested in the limit a; — oo, where we note the positivity of 


a+, in general. To this end, we change the variable of integration in (15.5.7) 
from z to p 


een (15.5.9) 
p 


Then it readily follows from (15.5.7) that for az — co, 


[ee eip:x/h Gi (p, p’;p’) aay eip’-x/h anit In( 1e'lls}—p") 8° (p’) 
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where the x-independent part 


1 ng 2p” ry /2 eons 

Bp) aigeT &P [rim (AE) err ein 

(15.5.11) 
with (1 iy) denoting the gamma function, defines, for p° ~ E(p), the as- 
ymptotic “free” propagators. We note the change in the nature of the singu- 
larity at p° ~ E(p’) from a simple pole with a non-trivial dependence on the 
Coulomb coupling 4. Also the exponential factor exp (i p’ + x/h) is changed 
from the asymptotically free one, corresponding to a short range potential, to 


one with a distortion given by the additional x-dependent exponential factor 
n (15.5.10). 


GY (p’) = 


15.5.2 Asymptotic Time Development of a Charged Particle State 


Consider the integral 


(oe) —iax v—-1 
/ tha @(-4a) an ; errs (15.5.12) 


for a real, Re 3 > 0, Rev > 0 and O(+a) is the step function. In reference to 
the asymptotic propagator in (15.5.11), let (with p’ — p) 


+i (tie — E(p)) = 8 (15.5.13) 
lFiy=v (15.5.14) 
t/h=a (15.5.15) 


to infer from (15.5.12) that for « — +0, 


co d 0 ’ 
ih) i: CP -iv"t/h GC (p) 


— 
Lu 


2rh * 


2p’ |t 
= O(4t Jew —; |B) )t + (sgn oe In ( e ‘| (15.5.16) 


where we have cancelled (1 = iy) in (15.5.11) with the one in (15.5.12) and 
used the property 


(Fi)F = (eFF) TT = e718? (15.5.17) 


to cancel out the exp(7y/2) factor as well. 

From (15.5.16) we see that the asymptotic “free” time evolution oper- 
ator for t — too, is modified, for the Coulomb case, from the free one 
exp(—iHot/h) to exp(—iHoc(t)/h), where4 


4 This expression was discovered by Dollard (1964) by a completely different ap- 
proach. 
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4Ho|t 
Hoo(t) = Hot + (sgn t) “ in( a ) (15.5.18) 


and reflects the fact that a charged particle always feels the presence of the 
Coulomb force due to its long range effect. 

An intuitive argument?° leading to (15.5.16)/(15.5.18) nay be readily 
given. For t — too, the momentum and the position of a particle are re- 
lated classically by |p| ~ m|x|/|¢|. Accordingly, replacing |x| by |¢||p|/m in 
V(x) and integrating over t, to generate the asymptotic “free” time evolution 
operator we obtain, in addition to tp?/2m, for a particle of mass m, the cor- 
rection (Aji/|p|) In |¢| which has the correct time dependence as in (15.5.18) 
with |p| > /2mHp. Incidentally, for a potential which vanishes faster than 
the Coulomb potential, the above substitution |x| — |t||p|/m in the potential 
gives a vanishing contribution to [ V(|¢||p|/m) dé for |t| > co and hence no 
modification to the term Hot in (15.5.18) arises. 


15.5.3 The Full Green Function G,_ Near the Energy Shell 


We make a change of the variable of the integration variable € in the 
Coulomb Green function G.(p, p’;p°), relevant to the problem at hand, in 
(9.9.54) to z = p4z, where pz is defined in (9.9.55). Near the energy shell 
p? ~ E(p) = E(p’), ie., for py — oo, the expression for G4(p, p’;p°) in 
(9.9.54) takes the simple form 


G.(p p’:p’) = i LM (pay iM dz yin yA 
=e 7 V 8p? ((p—p!)?]? Jo dz (z= 1) 


= Le (p4)—7 = . gtiy 
= 3s a, dep (8.5.19) 


where in the last equality we have integrated by parts. 
We use the integral 


[oe oth 
0 = 


= TY ie) ‘ 2 
ee (Foy) ea +)) (15.5.20) 


and the definitions of p in (9.9.55), 7 in (15.5.4), to obtain near the energy 
shell 


r 1 . AL, [(p—p’)? 
Bx O 0C I 
eres onere a (p—p’)? exp ( ‘hip| »| 4p? 


25 Dollard and Vello (1966). 
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M™(1—iy) | noc 
——$—— 15.5.21 
where G9“ (p) is the asymptotic “free” propagator in (15.5.11). 

Upon comparison of (15.5.21) with (15.3.6), with G§.(p) now replaced 
by GS°(p) for the Coulomb case, one is tempted to identify the Coulomb 
scattering amplitude by 


yn 2A (ie, (p= P)?)]\ Pay) 
fc(p,P’) = pope p(Faein | pe }) ich) (15.5.22) 


where I*(1 — iy)/T'(1 — iy) is a phase factor. That the expression for the 
Coulomb scattering problem is indeed given by (15.5.22) will be now shown 
by a direct time-dependent treatment via the evolution operators. 

For a decomposition of fc(p, p’) = fc(p, 9) into partial scattering ampli- 
tudes see Problem 15.10. 


15.5.4 The Scattering Amplitude via Evolution Operators 


With the asymptotic “free’ time evolution operator given by 
exp (—iHoc(t)/h), with Hoc(t) defined in (15.5.18), the transition amplitude 
for the scattering off a Coulomb potential with initial momentum p’ to a final 
momentum p is given by 


A(p, p’) = (ple iHoa(t)/h e itH/h eiTH/h © —iHoc(T)/h ip’) (15.5.23) 


with T + —oo,t — +00. The operator exp (it H/h) exp (—iHoc(r)/h) for 
T — —oo, for example, replaces the conventional Mgller wave operator Q_ in 
(15.1.47) (see also Problem 15.17). 

Using the representation 


loo) en i(t—7)p°/h : 
dp® ———__-. = —2riei(¢-7) H/h 15.5.24 
/ p Snag 7 ae Tie (15.5.24) 


for t > 7, and the definition (see (9.7.1), (9.9.4)) 


1 


pause (20h)? Gy(p,p';p) (15.5.25) 


(P| 
we may rewrite the amplitude A(p, p’) as 
3 oo 
A(p, p') = — (arh)"h / dp? enilp”-B(P)t/h gilp’—B(p')Ir/n 
os —oo 


iu et) | Fe (a) 0 
x exp | —— In ex In Gi(p,p; 
» pr (7a) few [an (7a) | eto. we 


(15.5.26) 
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for t > +00,T — —oo. 
The integrals in (15.5.12) alow us to use the representation (¢ — 0) 


|t| iy W1+iy) _ 4/2 - F en ig’t/h 
— = ————e” dq. ———— 15.5.2 
Ot) ( h Qn i ee f (q° + ie) +7 ee) 


Replacing this in (15.2.26), thus introducing an integral over q° and p®, 
multiplying and dividing the resulting integrand by [p° — E(p) + ie] cia and 


making a change of the variable of integration p° to Q° = p® + q°, we have 
oo oo o—1(Q°-E(p))t/h 
e 
A(p, p’) = / dQ i dq° aes 
EE fi? fe Gap) i ie 
- 1 
(q° + ie) #9 


K(p,p’,7;Q° — q°) (15.5.28) 


where we have set 


2 : G 2 
(2rh)?h | DA+ iy) (01/2 pile” E(p')Ir/h (exp Dey (=)) 
: On ppl 


id 2 (2), = 
x (exp pen (POD) ye — 0p)+ ic] G-0W.v'") 


= K(p,p',7:0"). (15.5.29) 


For t > +00 (15.5.28) reduces formally to 


A(p.p!) = —27i | * dQ? 6(Q° - E(p)) 


foe} a2 


li d : : K / ‘ O 
eo Ga nae pos  Rne —e) 
(15.5.30) 


by formally making a change of variable of integration from q®° to « = 
q°/(Q°— E(p)).2 In the z-integrand in (15.5.30), as the product of two 
factors, we first replace a by zero in K, and the first factor for a 20, e¢ > +0 
is independent of a. Using the integral 


had 1 20 1 
| .q-arpenGrpT = Thi) Pie) 


for e — 0, we get 


?6 Similar limits appear in the work of Papanicolaou (1976). 
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an K(p,p’,7;Q°) 

2" [3090 (0° _ B »P 7; 15.5.32 

Ap) =F fag? (Q"- Be) pe (15.5.32) 

Repeating the same procedure for the T — —oo limit, by using, in the 
process, the representation (see (15.5.12)) 


ay A): ato fae See 


in the expression for K(p, p’,7; Q°) (with p° — Q°), multiplying and dividing 
the resulting (Q°, qg°)-integrand in (15.5.32) by [Q°— E(p’)+ie]+~7 we obtain 
for T — —oo, 


(27h)? 


A(p, p’) = — fc(p, Pp’) 6(E(p) — E(p’)) (15.5.34) 


where in integrating over Q°, with the energy shell constraints provided by 
5(Q° — E(p))6(Q° — E(p’)), we have used (15.5.21), (15.5.22). 

To see that fc(p, p’) actually represents the scattering amplitude for the 
Coulomb potential, we note that the transition probability for the process is 
given from (15.5.34) to be 


1 4 
P(p,p’) = = |fo(p, p’)|? 


T/2 Q2rh 


T/2 F 
. (/ dt -i{e(p)-E( i) 6(E(p)— E(p’))  (15.5.35) 


where we have used the integral representation of the delta distribution for 
T — oo. Equation (15.5.35) is formally interpreted as giving for the transition 
probability per unit time the expression 


P(p,p’) _ (2ah)* 2 0(E(p) — E(p’)) 


=, / 
T pe |fco(p,p’)| oth 


(15.5.36) 


Upon integrating (15.5.36) over |p|? d|p|/(27h)?, for a fixed solid angle 
about a unit vector n, we obtain 


iQ lp|?d|p| P(p, p’) 
> Qh  T 


/ 
= a fc(Ip’|n, p’)|? (15.5.37) 


Finally we note that for a scattering process with an initial sharp momen- 
tum p’ with normalization of the state carried out within a unit volume of 
space, p’/ represents the probability that an incident particle crosses a unit 
cross sectional area, perpendicular to p’, per unit time, i.e., it denotes the 
flux. Thus we obtain the differential cross section by dividing the right-hand 
side of (15.5.37) by p’/p, 
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do 


qq = le(p.P')!’ (15.5.38) 


Pp = |p’|n, where fo(p, p’) is given in (15.5.22). 

Defining the angle 6 by p+ p’ = |p’|? cos, in (15.5.22), (15.5.38) gives 
do | Tae 

dQ 4|p’|4sin*(6/2)’ 


(15.5.39) 


It is remarkable that the quantum mechanical answer in (15.5.39) for 
Coulomb scattering agrees with the classical one.?’ Due to the long range 
nature of the potential, the cross section does not exist since it is not in- 
tegrable at @ ~ 0. In practice, the Coulomb potential, however, is usually 
screened at large distances and this divergence problem does not arise. 

For the scattering particles of two indistinguishable particles, the solution 
of amplitude fo(p, p’) = fc(@) in (15.5.22) allows us to determine the explicit 
expression of the interference term (the exchange effect) of the differential 
cross section discussed at the end of §15.3. 

In terms of the scattering angle 6 in the center of mass, (15.5.22) may be 
rewritten as 


2 2 * | 
fo(@) = at exp ( i— infsin? 6/2) Wo (15.5.40) 


for particles of charge te, and we have used the fact that = m/2. Also the 
scattering angle of the incident particle is given by J = 6/2, d(cos @)/dcos 0 = 
4cos¥, and v denotes the speed of the incident particle. 

According to the symmetrization to be carried out based on the statis- 
tics of the particles, we have from the discussion at the end of 815.3 the 
following result for the differential cross section for the scattering of two in- 
distinguishable charged particles of charges +e and of spin 0 or spin 1/2, in 
the laboratory frame with one of the particles initially at rest,?° 


do _4etcos¥f 1 1 cos[(e?/hv) In tan? VJ] 
dQ|ran mvt [ sin? — cost d sin? 0 cos? 0 


(15.5.41) 

Here ¢ = 1 for spin 0 particles, ¢ = —1 for spin 1/2 particles polarized 
in the same direction, « = 0 for spin 1/2 polarized in opposite directions, 
€ = —1/2 for completely unpolarized spin 1/2 particles. 

For fi > 0, the rapid oscillations of the last term in (15.5.41) gives rise 
to a vanishing contribution when averaged over a small angular breadth thus 
recovering the classical result. This same conclusion of the vanishing of the 
exchange term in (15.5.41) also follows for small scattering angles and for 


27 Tn quantum electrodynamics, the differential cross section is modified. 
?8 Such scattering processes are usually called Mott scatterings for the pioneering 
work of N. F. Mott on the Coulomb scattering of identical charged particles. 
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small speeds. As discussed in §15.3, the doubling of the differential cross sec- 
tion, over the classical one, for spin 0 particles and its vanishing for polarized 
spin 1/2 particles, in the same direction, at 0 = 7/4 (i.e., 0 = 7/2 in the 
center of mass system) should be noted. 


15.6 Functional Treatment of Scattering Theory 


The functional treatment of the transformation function (xt|p’t’) based 
on the quantum dynamical principle studied in §11.2 will be used to write 
down an explicit expression for the scattering amplitude involving functional 
differentiations with respect to external sources.?? The functional approach 
turns to be quite useful for studying, in particular, scattering at small de- 
flection angles at high energies. It is also useful for the determination of the 
asymptotic “free” propagators for long range interactions such as the Coulomb 
one in a straightforward manner without using the explicit solution of full 
Green function. These applications of the general functional treatment pro- 
vided here will be given in the next section. 

For recasting our functional solution in (15.6.6), (15.6.20) derived below 
into a path integral form see Problem 15.13.79 

From the analysis carried out in §11.2, the transformation function 
(xt|p’t’), for a particle in a given potential V(x) is explicitly given form 
(11.2.22), (11.2.4), (11.2.27), to be 


(xt|p't’) = exp (-7 | dr V (-inss)) cet|p'ty| (15.6.1) 


where (E(p) = p?/2y, with y denoting a reduced mass) 


(xt|p't’) = exp [x -p’ — E(p’)(t — t)| 


x exp (a fe *dr![t — 5 ]F (0) -F(7)) (15.6.2) 


T= Max(r, 7’) (15.6.3) 


and F(r) is an external (vector) source coupled linearly to x in the Hamil- 
tonian 


H = —+4+V(x) —x-F(r) (15.6.4) 
24 


2° See also Manoukian (1985, 1987a). 
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By using the identity in (11.2.28) and the translational operation property 
of the functional differential operator 


exp [ow ix Pe )- aa (15.6.5) 


we may rewrite (15.6.1), (15.6.2) as 


(xt|p’t’) = exp t x-p’ — E(p’)(t—#’)] 


| 


xen (Sy i af ee lary RG) 
x exp (- if arv(x- Pen +R ))) 


in analogy to (11.2.31). An interesting application of (15.6.6) to scattering at 
small deflection angles will be given in the next section. 

Since in (15.6.6), we finally set F = 0, the theory becomes translational 
invariant in time and (xt|p’t’) is a function of t— 1’. 

For t >t, 


(15.6.6) 
F=0 


(xt|x't’) = G4 (xt,x’t’) (15.6.7) 
with G(xt,x’t’/) = 0 for t < t’ (see (9.7.13)) and 


(xt|p't’) = G4 (xt, p’t’) 
= / d3x/eiP*'/ G, (xt, x’t’) (15.6.8) 


From the Fourier transform of G4(xt,x’t’) in (15.1.32), (15.1.35), and 
(15.6.8), we then have 


eS Ta, 1h 
G,(p,p';p°) = — iE AT ell? He)T/h ex e P/F (xT | p'0) 


h (2rh)3 Jo oo 


for ¢ — +0, with the ie prescription involving the boundary condition in 
(9.7.12). 
Upon writing (T =t— 1’) 


(xT | p’0) = exp (; x-+p’— e()T)) K(x,p’;T) (15.6.10) 


rl 


thus defining K(x, p’;T) from (15.6.6) to be 


Keep) ae (5 far f arlt—slaey ae) 


840 15 Quantum Scattering 


: t / 
x exp (-7/ dr Vix —P(t—7) 4+ F(7))) (15.6.1) 
hy M F=0 
We then obtain 
1) 2 _ 1 [™ gg gile”- 20’) +icla/n 
Gi(p, PsP) com | dae 
x [etx e-(P—P)/P I(x, p's a) (15.6.12) 


where a plays the role of time and this notation for it is introduced quite 
often in the literature.°° 

In the integrand in (15.6.12), we recognize [p° — E(p’) + ie] as the inverse 
of the free Green function in the energy-momentum description. 

To use (15.6.12) in deriving an expression for the scattering amplitude 
f(p,p’), we note from (15.3.11), (15.3.9), (15.1.29), (15.1.36) 


f(p,p’) = eT evr" V(p—p")Gi(p", p's p®) [p° — E(p’)] 


p°=E(p’) 
(15.6.13) 
This suggests to multiply G(p, p’;p°) in (15.6.12) by [p° — E(p’)] thus 
giving 


1 ca 0 : oO , iz 
G+, PP”) — BP) = — Gaps I “ & eas ") 


x [ex e7(P-P')/h K(x, p's a) 
(15.6.14) 


where we have used the fact that 0/0a of the first exponential generates the 
factor i[p° — E(p’)|/h in the integrand (for « — +0). 
Now from (15.6.10), and the fact that (x|p’) = expix- p’/h, we have 


K(x, p50) =1 (15.6.15) 
We now consider the cases for which 


lim | d°x e*-P)/2 K(x, p’; a) (15.6.16) 


a—oco 


exits. This, in particular, implies that 


lim eve f atx e *(P-P/NK (x, p/s a) = 0. (15.6.17) 


a—oo 


3° This is especially the case in field theory. 
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We may then integrate (15.6.14) over a by parts to obtain on the energy 
shell p° = E(p’), « — +0, 


(27h)*G4(p, p's p°)[p° — E(p’)] 


p°=E(p’) 


= —[(27h)?63(p — p’) af aa f atx cee PIS Kx pl a)| 


(15.6.18) 
or 
G+(p, psp") [p° — E(p’)] 
p°=E(p') 
1 , 
= im. ape | x e(P—P)/h K(x, p’; a) (15.6.19) 


From this the scattering amplitude f(p, p’) in (15.6.13) may be then obtained 
from the expression for K(x, p’; a) to be 


lim | d3x e~*(P-P9/" V(x) K(x,p';a) — (15.6.20) 


Oth? asoo 


f(p,p’) = 


where K(x, p’;t — t’) is given in (15.6.11), and we have integrated over 
p” in (15.6.13). The presence of the K(x,p’;q) factor in the integrand 
in (15.6.20) gives an obvious modification to the Born approximation in 
(15.3.11)/(15.3.12) in the exact theory. 

In case that the limit in (15.6.16) does not exist, as we will encounter for 
Coulomb scattering, (15.6.14) cannot be integrated parts. To deal with such 
situations, one may study the behavior of G',(p, p’; p°) near the energy shell 
directly from (15.6.12). To this end, define the integration variable 


f= el ° — E(p’)| (15.6.21) 
to obtain 
1, 0\f,0 Ny) 1 iz(1+ie) 
Gi(p,psp )[p — E(p’)| (nhs | dze 
x Jos BPP IEG (x' Se! (15.6.22) 
p? — E(p’) 


for p° — E(p’) > 0, e > +0. 
Applications of (15.6.20) and (15.6.22) will be given in the next section. 
The scattering amplitude for the Coulomb interaction was already de- 
rived and explicitly given in $15.5. A functional treatment of scattering am- 
plitudes for long range interactions, in general, including the Coulomb one 
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may be given by a modification of transformation functions, such as the one 
n (15.6.1), based on an extension of the quantum dynamical principle, which 
takes into account the unescapable fact that asymptotically in time (see, e.g., 
(15.5.18)) a particle still feels the presence of the interaction and the corre- 
sponding asymptotic “free” states depend on the coupling parameter(s) of the 
interaction.?! 

For a path integral treatment of scattering theory see Problem 15.13, and 
the reference below.*! 


15.7 Scattering at Small Deflection Angles at High 
Energies: Eikonal Approximation 


As mentioned in the previous section, the functional approach turns out 
to be quite suited for studying scattering at small deflection angles at high 
energies. Such an analysis will be carried and the resulting approximate ex- 
pression for the scattering amplitude will be obtained and is shown to satisfy 
the optical theorem at high energies as an important consistency check of the 
conservation of probability. This approximation is referred to as the eikonal 
approximation.°? An application will be also carried out to determine the 
asymptotic “free” Green function for the Coulomb potential without the use 
of the explicit solution of the Coulomb Green function. 


15.7.1 Eikonal Approximation 


We have seen in the previous section that the scattering amplitude f(p, p’) 
for a scattering process may be rewritten as in (15.6.20), where the factor 
K(x, p’;q) in the integrand, defined in (15.6.11) involving functional differ- 
entiations, may be conveniently re-expressed as 


K(x, p':T) )=en (5, i wef Ee) 5a 5a) 


x exp (- rede )) | a 


(15.7.1) 


Here as in (11.2.31), we have made the substitution F(t) ~ Vh Q(r). The 
latter, in the argument of V takes into account the deviation of the dynamics 


from a straight line [x — a _ 7]. The effect of this deviation is obtained 


3! See Manoukian (1985). 
82 The word eikonal (from Greek) means image and this term is borrowed from 
optics where some similar formulae as in here appear. 
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by the functional differentiations provided by the exponential factor involving 
the functional differential operators (6/5Q(7)) - (6/5Q(7’)). 

For scattering at small deflection angles, we may as a first approximation, 
neglect the contribution of the functional differential operators arising in 
(15.7.1) and set Q = 0. Corrections to this approximation may be then 
systematically taken into account by carrying out the functional differential 
operations spelled out in (15.7.1). 

Accordingly from (15.6.20), the scattering amplitude takes the simple 
form 


f(p,P') =~ 5 7 


x exp (-; i dp v(x 2 = 3)) (15.7.2) 


where we have taken the limit a — oo in (15.6.20) and introduced a new 
integration variable (. 

We note that (15.7.2) modifies the Born approximation in (15.3.12) / 
(15.3.11) by the presence of a phase factor in the integrand, depending on 
the potential, accumulated during the scattering process. 

It is convenient to write the integration variable x as 


ex ex (P-P ‘)/R V(x) 


p’ 
x=b+E— (15.7.3) 
m 
where b is orthogonal to p’, —oo < € < oo, and 


/ 
dx = 2 dé d®b. (15.7.4) 
Lb 


Also we note that 


(p—p’)-p'= 5(P p’)* 4 5 (p p”) 
= —5(p —p’)” (15.7.5) 


since p? = p”, and hence for small deflection angles, we may replace x + (p— 
p’) by b- (p—p’) in the first exponential in (15.7.2). By making a change of 
the variable integration 3 to 7 = G — €, we may then rewrite (15.7.2) as 


f(p,p’) = -1f = ag fee v(b+ee “Je ~ib-(p—p’)/h 


ee lee} / 
x exp eas dn v(b- ey) 
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ilp’| [~ Ff oi. ibe =piyE 
= dé | ab ei -P 
27h Joo é 5 


a i re p’ 
x pe eas dn v(p- n)) : (15.7.6) 


Carrying out the €-integration, this gives 


» _ i|p'] d2b en ib:(P-P’)/h 
IPP) = 5 e 


x E —exp (= iS dp vb.»)) | (15.7.7) 


where we have finally introduced the integration variable p. The expression 
in (15.7.7) is usually referred to as the impact parameter representation of 
the scattering amplitude, where |b| plays the role of the impact parameter. 
If for some R, |b| > R, V(b, p) = 0, for all p, then the b-integral in (15.7.7) 
will be restricted for |b| < R. 

For V(x), depending on the distance |x| = r between two particles, the 
angular part of the integral in (15.7.7) may be explicitly carried out giving 


: / Co 2 / 0 
fp.) = BEL pa Jy (AP tosin 3) [1 —expix(6,t'D] (05.7.8) 
h Jo A 2 
where Jo is the zeroth order Bessel function and 


x@IP) =- zea / 7 dp V(VP +p?) (15.7.9) 


is called the eikonal phase function. The integrability of Vv (VP + P?) on p 


restricts the class of potentials for the validity of the treatment. 

It is remarkable that the eikonal approximation in (15.7.8) satisfies the 
optical theorem at high energies. To this end, in the forward direction, i.e., 
for 6 =0, Jo(0) = 1, 


Anh 90 
=] Im f(p’,p’) =4n [ bdb[1 — cos x (b, |p’|)] . (15.7.10) 
0 


On the other hand, 


0 
dQ = 27 sin 6d0 = 87 sin : a(sin 5) (15.7.11) 


for the element of the solid angle, and with « = 2 |p’|sin(0/2)/h, we have 


feat)? =2n [bar [~ b! db! [1 — eC IP'1)] [a — efx ("IP") 
0 0 
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2|[p'|/n 
at KK Jo(Kb) Jo(Kb’). (15.7.12) 
0 
At high energies, i.e., for |p’| large, 


5(b — B') 


15.7.1 
; (15.7.13) 


2|[p'|/n 
/ KK Jo(Kb) Jo(Kb’) — 
0 
from which 
factory? = anf bdb[1 — cos x (0, \p’|)] . (15.7.14) 
0 


This, unlike the Born approximation expression, satisfies the equality in 
(15.4.20) of the optical theorem as a consistency check. The Born approxi- 
mation, however, is not restricted to small deflection scattering angles. 

For an eikonal approximation for the Coulomb potential see Prob- 
lem 15.15. 


15.7.2 Determination of Asymptotic “Free” Green Function of the 
Coulomb Interaction 


The starting point for the determination of the asymptotic “free” propa- 
gators is provided by examining (15.6.22) near the energy shell. 
Upon substituting 


K (x,p'; 2h/(p° — E(p'))) 


5 pzh/(@®-E(P')) p’ 
~ exp; | dg v(x _ es) (15.7.15) 
0 


in (15.6.22), we obtain 


[eve eP/na (p pp") ~ oe a dz ei tie) 
van [pP° — E(p’) + ie] Jo 


5 pzh/(P°- EP’) p’ 
x exp—7 | dg v(x - es) (15.7.16) 
h Jo H 


for p° — E(p’) > 0, e > +0. 
For the Coulomb potential V(x) = A/ |x|, 


zh/(p°-E(p’)) / \ 2\p'| zh 
| as v(x Fa) = Hin ( , bale ) 
0 bb Ip’| L(p° — E(p)) |x| (1 — cos 4) 


(15.7.17) 
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for (p° — E(p’)) ~ 0. Hence 


5 pzh/@-E(P')) p’ 
0 


1 2p2zh 
~ ——- exp intn ( a ) (15.7.18) 
[p° — E(p’) + ie] (|p"| |x| — p’ + x) 


where 7 = Au/fi|p’|. 
Finally using the integral (« — +0) 


| dz ei#(H+ie) (z)—i7 — je™/? T(1 — iy) (15.7.19) 
0 
this gives from (15.7.18), (15.7.16), 

Je e'P*/" Gi (p,p';p”) 


iv In(2p’2 \ |p’ |lx|—p’-x 
e7iyin(2p/p) in| ese 


~ eb p’/h ee 
[po Bip) fae) 


eT =i9) 
(15.7.20) 


near the energy shell, which is to be compared with (15.5.10), (15.5.11), with 
the latter obtained from the explicit solution of G;. Equation (15.7.20) gives 
the asymptotic “free” Coulomb Green function G9°(p’) in (15.5.11). 

For additional applications of the eikonal approximation see Prob- 
lem 15.14, 15.15. See also Problem 15.16. 


15.8 Multi-Channel Scatterings of Clusters and Bound 
Systems 


In the present section, we extend our earlier analysis in this chapter to de- 
scribe the scattering of bound particles and clusters (§2.5), in general, as well, 
where the initial and final particles in a process are not necessarily the same. 
The internal states of the particles are also allowed to change, in the analy- 
sis, upon scattering. For example, a hydrogen atom may be excited from its 
ground state or be completely ionized, with its electron being stripped from 
the proton, upon the impact of the atom by an electron. To carry out such 
analyses, we first introduce the concept of channels and channel Hamiltoni- 
ans. This will allow us to discuss various possible processes, such as the ones 
mentioned above, and define the Hamiltonians corresponding to the asymp- 
totic time evolutions immediately after the preparatory stage and just before 
the detection stage in scattering. This is followed by finding the solutions of 
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the fully interacting theories, corresponding to the preparatory states, ob- 
taining the expressions for the differential cross sections and generalizing the 
optical theorem of §15.4 to accommodate the above cases. We then study 
some subtleties occurring in carrying out the Born approximation for three- 
particle systems and develop equations for handling such a problem. Finally, 
we extend the phase shift analysis carried out in §15.4 to extract information 
on inelastic processes. 


15.8.1 Channels and Channel Hamiltonians 


We consider a system of particles interacting via two-body potentials. To 
describe the general scattering processes intended in this section, we follow 
the analysis carried out in $2.5 and group the particles in consideration into 
clusters. The Hamiltonian of the system may be then written quite generally 
in the form in (2.5.48), where Va, Ve in (2.5.49), (2.5.50) are the intra- 
clusteral and inter-clusteral interaction potentials. 

For example consider a system of three particles forming two clusters, with 
one consisting of one particle and the other one consisting of two particles. 
Such a grouping may, for example, correspond to studying a scattering process 
involving a positron, defining the first cluster, and a hydrogen atom, defining 
the second cluster, composed of an electron and a proton. 

The Hamiltonian of the system of the three particles, interacting with 
two-body potentials, may be written as 


3 
H => Sp; /2m; + Vio (x1 — X2) + Vi3 (x1 = X3) + V23 (X2 os X3) (15.8.1) 
i=1 
For the two cluster system, the kinetic energy term of H in (15.8.1) may 
be rewritten (see §2.5, in particular, (2.5.48)), consistently with Galilean in- 
variance, as 


2 mi 2 ( ie = (maim p) 
Pp (Py a m1?) P2 + P3 M 


Ay ({1, (2 — 
o({ mt ,3)}) oy Vana m1 + 2(mz + ms) 
m a oe 2 
(Pp. Goa Pa Ps)) (ps =F ag Pet Ps)) 
+ + 
2m2 2m3 
(15.8.2) 
where 

P=p,+P.+Pp3, M=m,+m2+ms (15.8.3) 


and pi denotes the momentum of the cluster involving the one particle. 
The physical interpretation of the terms in (15.8.2) is as follows. P?/2M 
represents the kinetic energy associated with the center of mass motion. The 
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second and third term denote, respectively, the kinetic energies of the two 
clusters with the center of mass motion removed. The fourth and fifth terms 
denote the kinetic energies of the two particles making up the second cluster 
with their center of mass motion removed. 
We note that the second and the third terms in (15.8.2) may be combined 
to yield 
((m2 + ms) Pi — m1 (Py + P3))” 
2M? p14 


(15.8.4) 


where 
1 = ™y1 (mg + m3) /M (15.8.5) 


Similarly the fourth and the fifth terms in (15.8.2) combine to yield 


(m3P2 = m2P3)° 


5 (15.8.6) 
2 (m2 + m3)” He3 
where 
}123 = Mgm3/ (m2 + m3) (15.8.7) 
Also note that 
do3 = (M3Py — MeP3) / (M2 + m3) (15.8.8) 


denotes the relative momentum of particle 2 with respect to particle 3, and 


G1 = [(™m2 + ms) Py — M1 (Py + P3)]/M (15.8.9) 


is the momentum of particle 1 relative to the center of mass of the cluster 
consisting of particles 2 and 3. 

To introduce the concept of a channel and the corresponding channel 
Hamiltonian, we introduce the variables 


vr =X2—X3 (15.8.10) 
R2 = (mox2 + m3x3) /(m2 + m3) (15.8.11) 
R = (m 1X1 + M2X2 + m3x3) /M (15.8.12) 
n=x,—R, (15.8.13) 


where r, 1) are conjugate to qo3, q,, respectively, to rewrite the Hamiltonian 
in (15.8.1) /(15.8.2), as (see (2.5.58)) 


h? h? h m3 
H= Vi. Vv. V24+YV; = 
OE Re Sia Yigg aoe ne ma + ms 
m2 
+ V; + ——r]+V; 15.8.14 
13 (1 Mn + Mg r) 23 (r) ( ) 
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and ji, [423 are the reduced masses defined in (15.8.5), (15.8.7). Here Via, 
Viz are referred to as inter-clusteral interaction potentials describing the in- 
teraction of the first cluster, consisting of one particle, with the two particles 
making up the second cluster. On the other hand, V23(r) is referred to as the 
intra-clusteral interaction potential responsible for the interactions between 
the two particles within the second cluster. 

The concept of a channel and the corresponding channel Hamiltonian 
naturally arises from (15.8.14) as follows. If for |n| — oo, Viz, Vig — 0 in 
(15.8.14), then we may define a channel Hamiltonian A, 

‘ae h 


Ay = VR 


h2 
2 
2M ¥ 


an Saag Vt V2 (®) (15.8.15) 


with one cluster remotely separated from the other. Here there is no break 
up of the second cluster made up of particles 2 and 3 interacting via the 
potential Vo3(r). This Hamiltonian may be then used to describe the ini- 
tial (or final) stage of a scattering process when the two clusters are widely 
separated. The grouping of the three particles into two groups (clusters) de- 
fines a possible arrangement channel. This latter channel may be denoted 
by {(1) , (2,3)} consisting of two clusters, one involving particle 1, and the 
other involving particles 2 and 3. By interchanging particle 1 and 2, we may 
introduce the channel {(2) ,(1,3)} and, in turn introduce the corresponding 
channel Hamiltonian H2. Another possible channel is {(1) , (2) , (3)} consist- 
ing of three clusters each made up of one particle and so on. 

A multi-channel scattering from channel {(1),(2,3)} to channel 
{(2), (1,3)} is depicted in Figure 15.3. 

Most importantly, we note that the total Hamiltonian H in (15.8.1), may 
be equivalently rewritten in terms of the channel Hamiltonians H,, H2 intro- 
duced above as 


PE Hey, =a (15.8.16) 
where 
Ay = Ho ({1, (2,3) }) + Vo3 (x2 — Xs) (15.8.17) 
Vi = Viz (X1 — X2) + Vis (x1 — Xs) (15.8.18) 
Hz = Ho ({2, (1,3) }) + Vis (x1 — Xs) (15.8.19) 
V2 = Via (x1 — X2) + Voz (X2 — x3) (15.8.20) 


where Hp ({1, (2,3)}) is defined in (15.8.2). Here V, defines the inter-clusteral 
interaction between the clusters (1) , (2,3), and similarly, V2 defines the inter- 
clusteral interaction between the clusters (2) , (1,3). 

For the collision of two clusters containing, respectively, n1, n2 particles, 
the Hamiltonian, consistent with Galilean invariance, may be written as 
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Fig. 15.3. The figure depicts the scattering of two clusters from channel 
{(1) , (2,3)} to channel {(2) ,(1,3)}, with channel Hamiltonians H, He, re- 
spectively, as described in the text. 


2 


2 
pe (Pe FEPL) we (Pas — FEP2) 
aM * 2m1¢ = a 
B=1 B=1 


2Mag 


P,.— 4p)? (Pp, —- Mp) & 
(Pi. — 7P) (Po — ¥P) + So Viag (X1a — X1g) 


2M, 2Mp = 
neg n2 ny 

+ S> Vaag (X20 — X28) + S> So Vaas (1 — X20) (15.8.21) 
a<B a=16=1 


in the notation of (2.5.48), where, in particular, the last term is responsible 
for the inter-clusteral interaction. The fourth and fifth kinetic energy terms 
may combined to 


MoP1.—M,P2.\7? 1 
15.8.22 
( M 2H12 ae 
where ae we 
1 2 
eee ie So 15.8.23 
M2 = WE Me ( ) 


is the reduced mass of the two clusters. (M2P1. — M,P»2.) /M denotes the 
relative momentum of the first cluster with respect to the second one. 
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On the other hand, the five kinetic energy terms in (15.8.21) may be also 
simply combined (see, for example (2.5.21)) to 


2 Se. 
S7 SS Pag/2map (15.8.24) 


a=1 B=1 


which allows us to re-group together the momenta, as done in the kinetic en- 
ergy terms in (15.8.21), in different manners to define new clusters. Finally, 
we note that we may also re-group the potential energy terms in (15.8.21) 
consistently with the emerging newly chosen set of clusters and, in turn, 
extract the inter-clusteral part of the potential energies as done for the ex- 
ample corresponding to the one depicted in Figure 15.3, and carried out in 
(15.8.16)—(15.8.20). 


15.8.2 Interacting States Corresponding to Preparatory Channels 


Consider a system of non-interacting clusters with total energy Ea 
as the sum of their binding energies and their kinetic energies. Let 
|@2 (E,)) exp (—iE,t/h) denotes the state describing the system, where 
|@2 (E,)) satisfies the time-independent Schrédinger equation 


(Ha — Ea) |®i, (Ea)) = 0 (15.8.25) 


where H, is the channel Hamiltonian for the system of the non-interacting 
clusters. Note that, H, includes the intra-clusteral interaction potentials re- 
sponsible for keeping (binding) the particles within the clusters together. Let 
VY, denote the inter-cluster potential responsible for the interaction of clusters 
when they eventually merge together in a scattering process. 
The total Hamiltonian of the interacting system is, by definition, given 
by 
H=H,+V, (15.8.26) 


The interacting state corresponding to the initial system of non- 
interacting clusters may be written as |e (E,)) exp (—iEgt/h), where 


1 
(Ba — Ha tie 


|? (Ea)) = |®f, (Ea)) 4 5 Va |W (Ea)) (15.8.27) 


as seen below. 
Upon multiplying this equation by (EF, — Ha + ie) and by re-arrangement 
of terms we get 


(Ea — H + ie) |b% (Ea)) = (Ea — Ha + ie) |®f, (Ea) (15.8.28) 


where we have used (15.8.26). By taking the limit « — +0 and using (15.8.25), 
we verify that [ve (Ea)) satisfies the time-dependent Schrédinger equation 
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(H — Eq) |v% (Ea)) =0 (15.8.29) 


involving the fully interacting Hamiltonian H. 

Formally, if we replace E, in (15.8.27) by p®, multiply the resulting equa- 
tion by x (p° — E,) exp—ip®t/h and integrate over p®°, for some function 
X (p° — E,) which is appreciably non-zero for p° ~ E,, we obtain 


[. a [|v (p")) —|@in (»"))| x (p® — Ea) exp (—ip°t/h) 


ar dp" (p° — E,) et Va |b2 (p°)) — (15.8.30) 
wo 2h * P a (p° — H, + ie) a — \p oO. 


For t < 0 we may, in the complex p°-plane, close the contour of integration 
from above, where Im p” > 0, thus avoiding any pole that may arise from the 
Hamiltonian H, for e — +0. We may thus formally infer that for t — —oo, the 
right-hand side of (15.8.30) would be zero. This gives the correct asymptotic 
boundary condition for the interacting state. In the sequel we consider the 
limit where x (p? — Eq) becomes (27h) 6 (p° — Eq) as done in (15.8.27). 
Eventually the initial clusters merge together and the inter-cluster inter- 
action V, becomes non-negligible. After their interactions, some new clusters 
may emerge in the process with a corresponding new channel Hamiltonian 
Hy. By definition the same Hamiltonian H in (15.8.26) may be rewritten as 


H=H,+V, (15.8.31) 


where Vj, is the inter-clusteral interaction potential of the new clusters. 

To obtain the amplitude for transition from the a channel to the b one, we 
first multiply and divide the right-hand side of (15.8.27) by [E, — Hy + ie], 
to get 


JW (Bal) = gag pag [Ba — Ha ti) + (Ha ~ Hi) 1 (Ba) 
1 1 
+ Tm Hy pig) | (Ba ~ Ha + ie) + (Ha — Ho)] pap az Vald* (Ex) 
(15.8.32) 


From (15.8.26), (15.8.31), Ha—H, = V,—Va. Accordingly, by using (15.8.29), 
and the defining (15.8.27) all over again on the right-hand side of (15.8.32), 
we obtain from the latter 


1 


* (E.)) = =} ie |®%, (Fa Va |o® (Ea 15.8.33 
lw (Ea) = eae ig lielh (H)) + Volv" (B))] (158.83) 
and note the presence of the ie factor multiplying |®%, (E,)) on the right-hand 


side of the equation. 
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The transition amplitude from the a channel to the b channel, with the 
latter described by an asymptotic state [erat (E)) exp (—itE,/h), is then 
obtained by considering the limit t — +00 of 


e-i(Ea—Ep)t/h 


(Bout (t) | w% (t)) = 5 (a,b) + [E, — B, +1] ( 


Bi, (Eo) | Vo YW (Ea)) 
(15.8.34) 


where we have multiplied the equation by exp —i(E, — E») t/h to define the 
corresponding time-dependent states, and have used the equation 


(®° , (Ep)| (Ea — Hy + ie) = (Ba — Ey + ie) (88, (Ep) (15.8.35) 


Note the cancellation of the ie factor multiplying |®%, (Z,)) in (15.8.33) on 
account of the formal orthogonality relation 
(8 


out 


(Ey) | ®%, (Ea)) = 6 (a,b) (15.8.36) 


where 6 (a, b) is symbolic standing for all the variables defining the state. Even 
for elastic scattering, where the initial and final clusters are the same, if there 
is a change in the momentum of a cluster in the process then 6 (a, 6) = 0. 
The transition probability rate (probability per unit time) from the a 
channel to the 6 channel is then given from (15.8.34) to be (t — +00) 


d b a 2 ee 1 2 1 
dt (Bout (t) |b (t)) | = 2Im {To d(a, b) hi \Tha [Ea = E, _ ic] \ 
= 26 (a,b) Im (Tra) + “5 (fy = By ral 
(15.8.37) 
where 
Tha = (Bou, (Ep) |Vi 0% (Ea) - (15.8.38) 


For 6 (a,b) = 0, corresponding to any changes occurring in the variables 
of the process by the collision of the clusters, (t + --co) 


& 
dt 


2 
a é (2, = Boia)? (15.8.39) 


\(o%,, (| ¥* ©) 
where the conservation of the total energy is evident. 

For an application, consider the collision of a cluster of momentum p’, 
with wavefunction normalized within a unit volume of space, with a cluster 
at rest. In the center of mass frame, if ~, denotes the reduced mass of the 
two cluster system, then p’/j1q denotes the incident flux. Upon summing 
(integrating) + ©) over the group of final states observed in the experiment 
pertaining to the 6 channel, we obtain from (15.8.39) 
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=oh FO) 2 
a = Oh S- ey (15.8.40) 


for the cross section for the transition from the a channel to the b channel. 

On the other hand, if we sum the left-hand side of (15.8.37) over all pos- 
sible channels with all configurations that may make transitions to from the 
a channel, we have to obtain zero on account of the conservation probability 
and the presence of the time derivative d/dt. Accordingly, (15.8.37) leads 
from (15.8.40) to the equality, 


b 
c=) ti= a se Sie 5 (a, b) Im (—T’,,) (15.8.41) 
b b 


where o is the total cross section for transition from the a channel, and 
T/, corresponds to the transition for the elastic scattering where no changes 
occur in the variables of the scattering process. Equation (15.8.41) is the 
generalization of the optical theorem in (15.4.6). 


15.8.4 Basic Processes 


As an application of (15.8.40), we first consider the scattering of a particle 
off a bound state. More specifically we treat the problem of the scattering of 
a positron (e*) off a hydrogen atom. Let m denote the mass of e* or e~, and 
mo the mass of the proton, taking into account that mg >> m. Labelling e*, 
e~ to be particles 1, 2, we have for the Hamiltonian in (15.8.14), 


iH py 2m+mo Fa ps (m+mo) e? 
2m (m+mo) 2m mo In - mo 
motm 
e? e? 
+ 
mn m ee I 
p? pe ne geo -g2 


+ (15.8.42) 
2m 2m |n—-z| |r| 
where n, r denote the position vectors of e+, e~ from the origin where the 
proton is situated. 
We consider the elastic scattering of et off the hydrogen atom with the 
latter remaining in its initial ground state. Then 


2 2 

Y= -— + TC Hh (15.8.43) 
In-rl [nl 

signalling the fact that the electron remains bound to the proton in the 

scattering process. 
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Let p’ denotes the initial momentum of e*, and (see (7.1.4)), (mo >> m) 


®o (x) = ay = h? /me? (15.8.44) 


1 
— exp (~ |x| /ao), 
TAG 


is the ground-state wavefunction of the hydrogen atom, with Eo, below, de- 
noting the ground-state energy. 
As an approximation to (15.8.40), (15.8.38) we may take?% 


do Qnm f° pedp p? p” 2 
_ te E I(p,p" 15.8.45 
dQ Alp Jo (amas O°" 2m am pee” =e 
with (q = p—p’) 
2 2 
rp.p) = fain ae oy? [+] 5.80 
In-r| |n 
Using the integrals 
Oe kl ee eee 


F 1 272] 
pa e4?/F JG (x)? = f+ <4 a 


4 he 
= F (|q)) (15.8.48) 
we obtain from (15.8.45) 
do met 
= [1 — F (|q|)]° (15.8.49) 


dQ — 4(p’)‘ sin* (4/2) 
lq] = 2|p’|sin 0/2 (15.8.50) 


with @ denoting the scattering angle of eT. 

F (|q|) is called the atomic form factor. For |p’|sin 0/2 < h/ao, F (\q|) ~ 
1, and the hydrogen atom appears neutral to et, do/dQ ~ 0 and no scattering 
occurs. For |p’| sin 6/2 >> h/ao, F (|q|) ~ 0, and the differential cross section 
in (15.8.49) approaches the ordinary Coulomb scattering one in (15.5.39) 
with e* experiencing the charge of the proton. For intermediate values of 
|p’| sin 0/2, e+ sees the atom with an effective charge 


Cet = €r\/ Z (|q]) (15.8.51) 


33 Here we have also neglected logarithmic distortions as occurring in (15.5.10) 
accompanying charged particles. 
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with 
Z (Iq) = [1 - F(la/)]° (15.8.52) 


as an effective charge “renormalization” factor, with total screening of the 
charge of the proton Z (|q|) — 0 at zero momentum transfer |q| > 0. 

As another application of (15.8.40), we treat the problem of the ionization 
of the hydrogen atom upon the impact of an electron, with the atom initially 
in its ground-state. 

The Hamiltonian is given by the expression (mo >> m) 


2 2 2 2 2 
Pi Pd e e € 
A= t t 15.8.53 
2rn 2m |n-xr| nl (rl ( 
and 2 2 2 2 2 
Pt Po e e€ e€ 
Bee tea, Vy, =——— = — 15.8.54 
2m 2m r| In-r| |n| ( 


with r denoting the position vector of the initially bound electron. 

Let ®o (x), as in (15.8.44), denote the ground state of the hydrogen atom. 
We take H, = H,, with ®(x, p.) denoting an eigenfunction corresponding to 
the continuous spectrum of the hydrogen atom. We know from (1.8.51), that 


(® (+, P2)| Bo (+)) = 0. (15.8.55) 
As a first approximation to (15.8.38), we take the amplitude 


; e? e? 
A(P1,P),P2) = [ee d®r eiPin/? O* (r,p aoe - — 
( 1 1 2) ( 2) In —r| in| 


x Bo (r) eiPim/? (15.8.56) 


For the exchange amplitude, for the two-electron system, we may then 
define 


' 2 2 
Aexe (P1;P1P = fa d°r &* (n,p ecient | Zz =| 
pee a mH inl 
x Bo (r) elPin/? (15.8.57) 


by interchanging n « r in the final states in (15.8.56). 

Due to the orthogonality relation in (15.8.55), the second term —e?/ |n| 
in the square brackets in (15.8.56) vanishes upon integration over r. Using 
the integral (15.8.47), we obtain 


Are h? 


where 
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fF (a,p,) = f ar e-"/" &* (2, p2) Bo (2 (15.8.59) 


qQ=P)—P\- (15.8.60) 


In (15.8.57), we cannot dismiss with the —e?/ |n| term in the square brack- 
ets since ®* (n,p,) and ®o (r) depend on different variables. Using, (15.8.47), 
and the integral 


pe eiQ7/h By (r) = 84/743 [1+ Q?a3/n?| 


= 9 (|Ql) (15.8.61) 


we may rewrite 


d3q! 4re?h? 
(20h)? a”? 


Aexc (P1, P41, P2) = / an e-i(a pi )-n/h @* (1, Po) 


x [g (la’ — pil) — 9 (pil) - (15.8.62) 


As a second approximation, we replace ®* (x, p.) in (15.8.59), (15.8.62) 
by exp (—ip. -x/h). This gives 


Are*h? 
A(P1, P13 P2) = ae 9 (Ia + Pol) - (15.8.63) 


Conservation of energy, implies that 


[Pil = pe? — p} + 2mEo (15.8.64) 


Hence for an initial incident electron of sufficiently high energy p’a2/h? > 1, 
p3 not large, g(|p;|) ~ 0, and we obtain from (15.8.62) 


Ane? h? 
Aexe (P1;P1;P2) = mo Pe g (la + Pol) (15.8.65) 
— 
or 
q’ ! 
Aexe (P,P, P2) = pee A(P1, P41, P2) (15.8.66) 
_ 


where q is given in (15.8.60) and A(p,,p{, Pp.) in (15.8.63). 
For an unpolarized beam of electrons, we have to take the following com- 
bination of A and Aexe (see (15.3.39)) 
|A? + | Aexe|” — Re (A* Aexe) = P (15.8.67) 


in computing do for the process. 
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In detail, (15.8.63), (15.8.66) give 
Ane*h? A 
P=( x ) 9 (la+ Pal) 
q 
From (15.8.40), (15.8.68), 


olay cal ae Pi” Pi? Po” 
? : 15.8. 
mpl! | 2nh)° (enn)? Cole Or be og ee 


and by setting 


= p3/2m (15.8.70) 


we obtain, by integrating over |p,| in the process over the delta distribution 


n (15.8.69), 
3 


do 2mm g PalPel f an dQ, P (15.8.71) 
deg A(2rh)® |p| 


where |p,| is given in (15.8.64), P in (15.8.68), (15.8.61). For carrying out 
the angular integrals under a simplifying assumption made on the correction 
due to exchange in P, as given in (15.8.68), see Problem 15.19. 


15.8.5 Born Approximation, Connectedness and Faddeev 
Equations 


Consider a two-particle system with a two-body potential Viz. The opera- 


tor form of the Green function** G = |[p° — H + ie] ~! up to a multiplicative 
constant, may be written as 


G=G°+G@Vi2G (15.8.72) 
or formally in a perturbative expansion as 


G= 5° (@V2)" (15.8.73) 


n2>0 


where G? = [p° —Ao+ ie] * and H is the free two-particle Hamiltonian. 
In the center of mass of the two-particle system, the matrix element 
(P1,P2|G°Vi2|p,, pS), in detail, is given by 


0 0 
(P1,P2|G°Vi2G° |p}, Py) 
34 To simplify the notation further, we omit the + sign in G4, G4, corresponding 


to € — +0, in the remaining part of this section. G is related to the resolvent of 
HA. 
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5° (pi + Po — Pi — Pd) 1 
as pea (di2|Vizlai2) 5. Be Bee 
[p?- BE - PE +ie [p? — Fe — BEE ie 


(15.8.74) 
up to a finite multiplicative constant, and where, in the notation of (15.8.8), 


Gi2 = (M2p, — MP2) / (m1 + m2) (15.8.75) 


denotes the relative momentum of particle 1 with respect to 2, and note that 
Giz di2 =Pi- Pi (15.8.76) 


denotes the momentum transfer to particle 1. The 6° (p,; + ps — pi — p4) 
factor arises as a consequence of momentum conservation. 

The expression on the right-hand side of (15.8.74) is depicted diagram- 
matically in Figure 15.4 (a), where the horizontal lines represent the two 
particles with initial and final momenta p{,p4 and pj, py, respectively. They 
are connected by the wiggly line, representing the interaction potential, giv- 
ing rise to a net momentum transfer p, — p| = — (Py — ps). The right and 
left of the interaction lines are multiplied by G°, with Ho replaced by the 
sum of the kinetic energies of the particles with momenta p{, p and pj, po, 
respectively. An overall 53 (p, + py — p — p5) arises in G?VG°VG®, and for 
the higher orders as well, as a consequence of momentum conservation. 


Pi. Ge Vio Ge pi Pie Ge Vio Ge Vio Ge pi 


/ 
ePo 


Fig. 15.4. (a) This figure depicts the expression on the right-hand side of 
(15.8.74). The wiggly line represents the potential energy Viz, connecting the 
two particles, with Ho in the G®, on the right and left of Vi2, evaluated, respec- 
tively, at the sum of the kinetic energies of the two particles with momenta 
P,P» and p,, py. The figure in (b) represents the product GV 2G°Vi2G°, 
with p/ = p, + Py — ps, as a net integration variable, and Ho in Go in the 
middle, is evaluated at p/’*/2mi + py? /2mo. 


Equation (15.8.72) may be depicted diagrammatically as shown in Fig- 
ure 15.5, where the two horizontal unconnected lines represent two free par- 
ticles. The remaining part of figure represents the connected part of G where 
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the two particles are necessarily interacting connected by at least one wiggly 
line as seen in Figure 15.4. 


= + 


Fig. 15.5. A diagrammatic representation of G in (15.8.72), where the first 
two horizontal lines on the right-hand side show two non-interacting parti- 
cles, i.e., not connected with wiggly lines as in Figure 15.4. The second term 
gives the connected part of G, represented by graphs of the type shown in 
Figure 15.4, and others of arbitrary orders, where the two lines, representing 
the particles, are connected by at least one wiggly line. 


From (15.8.72) and Figure 15.4, the connected part C of G is given by 


1 1 


ie V; 
(p9 — H+ie) **(p9 — Hp + ie) 


(15.8.7) 


Now let us investigate the situation corresponding to an expansion as in 
(15.8.73) for a three-particle system interacting with two-body potentials. 
As in (15.8.8), (15.8.9), we define the momenta 


qi; = (msp; — mip,) / (mi + mj) (15.8.78) 
q; = [(mj + mx) p; — Mi (P; + Px)] /M (15.8.79) 

with i,j,k =1,2,3, jAiFk. 
The free three-particle Hamiltonian may be written in various forms (see 


(15.8.2), 


Pi | Ps | PR ie | 


Ho = = 
o Imy » 2m2 ? 2m3 2ti2 23 
a 33 a re 
2u23 0 2h 
2 2 
Sa DE gp (15.8.80) 
2H13 22 


where 


[ig = mum, / (mM; + m5) (15.8.81) 
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fs = Mi (mM; + mx) /M (15.8.82) 


with the indices defined as above. 
The total potential energy V may be defined as in (15.8.1) consisting of 
three terms Viz, Vi3, V23. We may also introduce four channel Hamiltonians 


see 
A, = Ho + V23 

15.8.83 
Hz = Ho + Viz ( ) 
Az = Hy + Viz 


corresponding, respectively, to all the particles are free, particle 1 is free, 
particle 2 is free, and particle 3 is free. The inter-clusteral potentials V,, 
a = 0,1,2,3, such that H, + Va = H, with the latter denoting the total 
Hamiltonian, are then given by 


Vo = Vi2 + Vig + Vo3 = V 
VY, = Vie + Viz 

15.8.84 
V2 = Vie + Voz ( ) 
V3 = Vig + V23 


From (15.8.27), (15.8.38), the transition operator, whose matrix elements 
are given by Tq in (15.8.38), from the a channel to the b channel, may be 
written as 


1 
(ia t a 15.8. 
Uy Vo ay 7 (15.8.85) 
off the energy shell. 
As before 
G=G°+@ VG (15.8.86) 


where now V consists of the three terms in (15.8.84), and G? = 
(p° — Ho + ie)? for a free three-particle system. 

A perturbation expansion for G as in (15.8.73) is not useful as will be 
seen below. For example, up to a finite multiplicative constant, 


0° (py + Py + P3 — Pi — PS — P5) 


3 
(» ~ p2/2mi + i) 
i=l 


(P1,P2,P3 |G°v G° |p‘, Pd, PS) = 


x [(di2 |Vizlai2) 6 (p3 — P3) + (ay3|Vislais) 5° (Py — ps) 
1 


3 
» -> pi? /2m; + ie 


t=1 


+ (do3|V23lq93) 5° (P, — P')| (15.8.87) 


where the first term within the square brackets indicates that particle 3 is 
just a “spectator” and does not participate in the interaction, while particles 
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1 and 2 interact via the potential Viz. A similar interpretation is given to 
the other two terms involving 6° (p, — p4) and 63 (p, — p/,) as factors. The 
expression on the right-hand side of (15.8.87) is represented diagrammati- 
cally in Figure 15.6 (a). The deltas within the square brackets corresponding 
to “spectator” particle will be referred to as dangerous deltas for reasons 


discussed below. 


Fig. 15.6. (a) The first graph shows that particle 3, a “spectator”, does 
not participate in the interaction, while particles 1 and 2 interact via the 
potential Vi2, depicted by the vertical wiggly line connecting lines 1 and 2. A 
similar interpretation is also given for the other two graphs. All these graphs 
are disconnected including dangerous deltas associated with the “spectator” 
particles. (b) A connected graph, where all the particles interact, representing 
the term G°V23G°V2G", involving no dangerous deltas, coming from a higher 
order expansion of G. 


The problem with (15.8.87) and similar terms of higher orders is 
as follows. In computing transition amplitudes, one would take the ab- 
solute values squared of terms such as in (15.8.87). The overall delta 
6° (p; + Po +Pp3—p), —Pps—p) is harmless in the sense that in com- 
puting a physical quantity, such as a transition amplitude, per unit vol- 
ume v, as vu — oo, as done in going from (15.5.37) to (15.5.38), 
and also in going from (15.8.39) to (15.8.40), in obtaining the differen- 
tial cross sections, the squaring of 5° (p,; +p .+p3— pi — py — Ps) gives 
5°(0)0° (p, + Po + P3 — Pi — PS — P54) = 
(v/ (2rh)*)63 (p, + po + P3 — pi — Py — ph), v — 00, and the volume v does 
not contribute in such a computation. This is unlike the squaring of the deltas 
within the square brackets in (15.8.87) and any manipulation involving them 
becomes meaningless. 

It is thus clear that an expansion such as in (15.8.73), for the present case, 
is not useful, and we have to obtain the connected part of G, as done for the 
two-particle case in (15.8.77). We also have to find an iterative summation 
procedure in evaluating the connected part of G involving no dangerous deltas 
of the types mentioned above. 

The connected part of G may be found as follows. We collect all the 
graphs where particle 3 is a spectator, and denote the resultant graph by 
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A3. Similarly define Ay and Aj. Finally, we note that G° represents the three 
particles as free. The connected part C' of G is then given by inspection to 
be 


C=G=A3-—Ag—Ay-—G (15.8.88) 
From (15.8.77), we note that 
A3 = Gi2Vi2G° (15.8.89) 
Ag = G13Vi3G° (15.8.90) 
Ay = Go3Vo3@° (15.8.91) 


where now Ho, of course, is the free three-particle Hamiltonian, 


Gy= |p == Ve) (15.8.92) 


The connected part C of G is depicted in Figure 15.7. 


COLETS 


Go 


Fig. 15.7. The connected part of G, where A3, Az, Ai are defined in (15.8.89), 
(15.8.90), (15.8.91), respectively. 


The explicit demonstration of (15.8.88) is straightforward. To this end, 
we note that the transition operator from the channel 0 to channel 0, Uoo 
in (15.8.85), i.e., for which the three particles are initially and finally free, is 
given by 


Uno =V + V GU = V + UnnG°V 


=V4iVGV 
= (15.8.93) 


In particular, 
VG=UGC (15.8.94) 


which, in turn implies that 


G=G°+GU @ (15.8.95) 
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Define two-particle amplitudes 
Uy = Vir + VirGyrVir (15.8.96) 


with cyclic interchange of 7,7,k, 7 < k, i,j,k = 1,2,3, with Gj, defined in 
(15.8.92), involving the three-particle free Hamiltonian Ho, and the potential 
Vik. 
Also set 
Vix + VjnGOU = U0 (15.8.97) 


and note that 


vOuy@s.y® —y (15.8.98) 


as implied by the first equality in (15.8.93), and the fact that Viet+Vi3+Va3 = 
V. 
Upon substituting 
G° = Vix — GjxVjn@ (15.8.99) 
in (15.8.97), using the latter equation all over again in the resulting expres- 
sion, and (15.8.98), we obtain 


U® = Vip + VinGye (v; 4UuO 4 yu) 


=U; + VinGix (UM +0) (15.8.100) 
where we have, in the process, used (15.8.94). Finally, since 
VinGje = UiG° (15.8.101) 
as in (15.8.94), we have 
U® =U; +U,;6° (u® th ue) (15.8.102) 
These equations are referred to as the Faddeev equations.?° We note, in par- 
ticular, that each U is coupled to two different U)’s with j 4 i, only. 
Equations (15.8.102) provide an iterative procedure to determine the U“) 


in terms of two-particle amplitudes U;, U2, U3 defined in (15.8.96). 
From (15.8.95), (15.8.97), (15.8.98) and (15.8.102), we then have 


G = G+ G(U, + Us + U3) Go + CU, (u® + yu) G° 
+ QU, (u® a u®) G+ G°u,c (u® is u) G° (15.8.103) 


Now it is easy to see that 


35 Faddeev (1960, 1961, 1962). 
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G° (U, + Up + Uz) G® = Ay + Ag+ Az (15.8.104) 


with A;, Ag, Ag defined in (15.8.89)—(15.8.91), from which we obtain 
C= GU, (u? ue ) a + GU2G® (U uy +u®) Ge 
+GQUs (u® 4: uy) @ (15.8.105) 


for the connected part of G. That C is indeed connected follows by noting 
that from iterating the Faddeev equations in determining U, U@), U®), in 
terms of U;, U2, U3, we always obtain for C' the sum of terms of the form 


-- (G°U;) (G°U;) G°--- (15.8.106) 


with i # 7, for any two consecutive terms in a product in the sum. The fact 
that 7 ~ j in (15.8.106), as just discussed, implies that such terms are nec- 
essarily connected and thus do not involve the dangerous one particle deltas 
mentioned earlier. Equations (15.8.105), (15.8.102) allow thus the determi- 
nation of the connected part C of G.%6 


15.8.6 Phase Shifts Analysis 


Consider a two-cluster system in the center of mass frame. The total 
angular momentum in this frame consists (see (2.7.39) /(2.7.40)) of the orbital 
angular momentum of relative motion and the internal angular momenta of 
each of the two clusters. For simplicity of presentation, we study the collision 
of a cluster with a target cluster both of zero internal angular momenta. A 
generalization including a target with non zero internal angular momentum 
is given in Problem 15.20 using the angular momentum decomposition in 
(5.10.99), (5.10.103). 

Let Tua denote the elastic T-scattering matrix, in the center of mass, of 
the collision of the two clusters with zero internal angular momenta. In this 
case the angular momentum will be just the orbital angular momentum of the 
relative motion. Assuming rotational invariance, the same analysis leading to 


(15.4.23) gives 


2 re 1 
Pie (20 +1) —sS Py (cos 0) (15.8.107) 


=) 


where [tq denotes the reduced mass of the two-cluster system, and, as we will 
see below, when other channels are available for the scattering process, nf (p) 
is not just a phase for all 2. 


36 For analyses corresponding to more than three particles, cf., Weinberg (1964); 
Yakubovsky (1967); Grinyuk (1980). 
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The elastic cross section gq is given from (15.8.40) to be 


THe i dp p? pi” 2 
ae Gog Pa ph Peon ep 15.8.108 
poh J (2nh)?  \ ta 2a oe ( ) 


where E, is the sum of internal energies of the two clusters. The integration 
OVEY Pa gives 


Coa = eT sf a ale (15.8.109) 
which allows us to define the elastic scattering amplitude (pz = |p,| = |p/]) 
Ha 
aa a9 = —-a— Tua 15.8.11 
foa (Pas 8) = — 3% (15.8.110) 
as in (15.3.11), with 
faa (Pa, 9) (2 + 1) [n? (p) — 1] Pe (cos 8) (15.8.111) 
=0 


From (15.8.109) the elastic cross section Og_ may be rewritten as (see 
(15.4.25)) 


a ox 2 + 1) |n¥ (p) — 1) (15.8.112) 
Pa (= 


On the other hand, the optical theorem in (15.8.41) relates Im faa (Pa, 0) to 
the total cross section o. The latter may, in general, be different from Oga 
when other channels are available for the scattering process to go into from 
the a channel. One may then introduce the concept of the reactive cross 
section 0;¢, to account for this, defined by 


Ore = O — Og (15.8.113) 


where O.¢ = Ogq is the scattering cross section for the a channel in question. 
From (15.8.41), the total cross section is given by 


Anh 
o = —~ Im faa (Da, 0) (15.8.114) 
Pa 
or from (15.8.111) 
FD 2 + 1) Re (1 — n? (p)) (15.8.115) 
Pa =0 


For the reactive cross section we then have 


Fre = (2 +1) [1 — |n’ (»)?| (15.8.116) 
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as is readily checked from (15.8.113), (15.8.115). Therefore, when there are 
other channels available to go into from the a channel, we must have 


In¢ (p)|? <1 (15.8.117) 
That is, we may write 


ne (p) = |n¢ (p)| e782) (15.8.118) 


with |77 (p)| < 1 in this more general case. It is interesting to note that if 
inelastic processes are permissible, i.e., |nf (p)| < 1, then from (15.8.111) this 
implies that the “elastic channel” a is available as well for the collision process 
as faa (p,@) would be non-zero. Note also that o,- attains its maximum, for 
Ore = Oeg. 

For further generalizations, allowing non-zero internal angular momenta 


see Problem 15.20. 


15.9 Passage of Particles through Media; Neutron 
Interferometer 


An application of great practical importance connected with the passage 
of charged particles through matter is that of their energy loss by collisions 
with the underlying medium. We here consider the one associated with the 
passage of charged particles in hydrogen as a direct application of the formal- 
ism developed in the previous section. The analysis may, however, be applied 
to other media as well. Finally, we study the problem of neutron interferom- 
etry which has been quite important in recent years in quantum physics and 
is expected to have further applications in the future. Here we are interested 
in the splitting and re-combination of a beam of neutrons in the Earth’s 
gravitational field and in the determination of the gravity induced quantum 
mechanical phase shift arising from the interference of the combined beams. 
The phase shift depends, in particular, on the gravitational and the Planck 
constants. 


15.9.1 Passage of Charged Particles through Hydrogen 


Consider the collision of a charged particle of mass M and charge eg witha 
hydrogen atom. The amplitude for the excitation of the atom from the ground 
state Bg to an excited state ®, is given, approximately from (15.8.58) to be 


Amegeh? 
A(p,p’) = = (®, Je-#*/*|o ) (15.9.1) 


where q = p,, — p’ is the momentum transfer to the charged particle. The 
cross section of excitation is then (see, for example, (15.8.69)) 


868 15 Quantum Scattering 


2 32 274 3 12 2 
spa ae a e a (8 re eee Fn) 
h\p’| (2rh)>  \ Qu Qu 


| 2 


—iq:x/h 
g Male 2120) (15.9.2) 


where 
Mm 


M+m 
is the reduced mass of the system, with the reduced mass of the atom taken 
to be the mass of the electron. Here Eo, E, denote the energies associated 
with the states ®g, ®,,, respectively. 

The |p,,|-integration may be readily carried out by using the property of 
the delta function to set 


IPnl = VP? + 2u(Eo — En). (15.9.4) 


b= (15.9.3) 


To carry out the angular integration, we note that for given |p’|,|p,,|, with 
the latter defined in (15.9.4), 


a’ = p;, — 2|P,||p'|cos8 + p? = ¢? (15.9.5) 
we have 
sranedes (15.9.6) 
IP,.| |P’| 
and 
dmax =|P'|+|Ppl,  dmin = [P’] — [Ppl - (15.9.7) 
Hence, 
oe ee / Se (15.9.8) 
[p’| Gmin ia 
where : 
Fro(q) = (Pn feo!" |0 | (15.9.9) 


and we have arbitrarily chosen the direction of q to be along the z-axis in 
computing F,0(q). 

Consider a system of atoms with ./ denoting the number of atoms per 
unit volume. We may then introduce the energy loss per unit path length of 
the charged particle in passing through the medium by 


— = (En — Eo) NV ono (15.9.10) 


where the summation (integration) goes over all the accessible states for the 
discrete as well as the continuous energy levels. 
We may rewrite (15.9.10) in an equivalent form by setting 
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2m 


“p (En ~ Bo) Fnol4) = Fno(4) (15.9.1) 


to obtain 


dt an as. i oS Fate) (15.9.12) 
4. 


db m|p'|? es 
where we recall that dmin/max depend on n. 
The function f,0(q) has the following important completeness property 


S~ fao(g) = 1 (15.9.13) 


for all g to be shown later below. 

Because of the singular nature of Fy,o(q)/q> for q — 0, and its rapid 
damping for large q, because of the 1/q® factor and the matrix element of 
the exponential exp —igz/h in F,9(q) (see, for example, (15.8.48)), the main 
region of integration in (15.9.8) comes form small g. The latter corresponds 
to a small momentum transfer to the charged particle resulting in an almost 
undeflected straight path of the particle. 

We consider the high-energy limit |p’| >> |p’| — |p,,|, from which 


dmax = Ip’| a IP nl =2 Ip’| _ (\p’| _ IP») 
~2|p'| = daax (15.9.14) 
and from (15.9.4), 


L(En — Eo) rool W(En — Eo) 
Qmax |p’ | 


2 
dmin = lp'| ~~ IP,| = = te (15.9.15) 
Since the main contribution to the integral in (15.9.8) comes from small 
q, we may effectively carry out the so-called dipole approximation to Fj,9(q) 
and replace the latter by q?|(®,,|z|®g)|" /h2 by expanding the exponential 
exp (—igz/h) ~ 1—igz/h, and using the orthogonality of |®,,) and |®g). This 
gives the q-independent quantity for f,o(q) 
2m 
Fz (En — Eo) |(®n|z|®o)|” = fro (15.9.16) 
referred to as the oscillator strength, while fno(q) in (15.9.11) as the gener- 
alized oscillator strength, with d in 455 corresponding to the dipole approxi- 
mation. 
Accordingly, we may rewrite (15.9.12) as 


dé = Arepe” u2=N 
dL mip’? 
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F 2m f 2 ! 
In (dmax) — x Dy (En — Bo) |(Pn|2/®o)|" In (din) | (5.9.17) 


n 


where in writing the first term we have used the completeness summation 
formula in (15.9.13) and the fact that q/,,, is independent of n. 
Upon defining an “average ionization potential’ J through 


2m 


a '(En — Ep) |(®p|z|®o) 2 In(E, — Ep) =n I (15.9.18) 


we have from (15.9.13), (15.9.14), (15.9.15), (15.9.17) 
dé Anege? (ee ) 
= n 


= 15.9.19 
dL mv? I ( ) 
where p’ = pv, v is the initial velocity of the incident charged particle relative 
to an atom, M in (15.9.3) denotes its mass and m denotes the mass of the 
electron in the atom. 
It is interesting to note that for M >> m, (15.9.19) reduces to 


dé Anege Qmv? 
dL somv? se I 


(15.9.20) 


which is independent of the mass of the incident charged particle. 
On the other hand for an incident electron = m/2, and (neglecting the 
exchange effect), 


dé 4 252 2 
= mee A (= ). (15.9.21) 


dL smu I 


It remains to establish the completeness summation formula in (15.9.13). 
To this end, we note that from the Schrédinger equation, 


, h2 : 
(En — Eo) | d3x B* e—147/hG, = igs [oxoeiy - (VO* by — &* VGo) 
m 


_ihg 3 e igz/h 0 * * O 
See d°’x Agen Do Pn, Do 
h? [2iq a) q? 
a —igz/h| qt 2 tees 
2m | A (#6 +0) a h (®, le )| , 
(15.9.22) 


Upon multiplying the above equation by (® 


o'/" 0, ) 2m/q? and sum- 
ming (integrating) over n, we obtain from (15.9.11) 

0 

—® 

Oz 0) 


3 fla > (aoe) (a 


oe ige/h 
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2 
as B “(0 om "|Bn) (® je) 
h? [ 2ig a) q’ 
—" |99/6,|£a,\+ L(a,|6 
F | w (20| geo) + Fe (Pol) 
~] (15.9.23) 


where we have used the fact that (®9|O®o/0z) = 0, (®p|®o) = 1. 

A similar analysis as above may be carried out for multi-electron atoms, 
where one of the changes to be made in the analysis is to replace exp —iq - x/h 
by >0, exp (—iq- x;/h), with a summation going over the electrons in the 
atom (see Problem 15.21). For the hydrogen case, J ~ 15 eV. For other 
atoms, I is proportional to the atomic number Z with a proportionality 
factor roughly of the order of 10 eV. 

Needless to say, the charged particle would also emit radiation but we will 
not go into this here. 


15.9.2 Neutron Interferometer 


Consider a neutron beam which strikes the surface of a (cubic) crystal at 
an incident angle 6 as shown in Figure 15.8, with wave vector k = p/h. For 
the so-called Laue scattering,®” the transmitted beam wave vector k; is the 
same as the incident one, while the one of the reflected beam k,. differs from 
k, by a reciprocal lattice vector Q, with the latter parallel to the face of the 
crystal. The initial incident state may be written as 


wy = el = e171, + ixe) (15.9.24) 
where x1, X2 are standing waves given by 
X1 = COs koa, xo. sin kon (15.9.25) 


and 


|ko| = Bas (15.9.26) 
2 ag 


with atomic locations given by 


ayn = (n1a1, N2d2, 2343) (15.9.27) 


where n1,72,nN3 are integers. We note that?® 

37 For additional details of Laue scattering from crystals see Greenberger and Over- 
hauser (1979); Olariu and Iovitzu Popescu (1985); Rauch and Petrascheck (1978). 

38 Without loss of generality it is assumed that there is an atom located at x2 = 0 
as indicted in (15.9.27). 
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Face of crystal io ee 


Fig. 15.8. The figure depicts the so-called Laue scattering of a beam of 
neutrons from a (cubic) crystal where the associated transmitted wave vector 
k; is the same as that of the incident one, while the wave vector k, of the 
reflected beam differs from k; by a reciprocal lattice vector Q. The incident 
angle @ is referred to as the Bragg angle. 


x1]? = cos? "2 (15.9.28) 


a2 
takes its maximum value at atomic sites, while 
Ta 
ly2|? = sin? —* (15.9.29) 
a2 


takes rather its minimum value of zero at atomic sites. That is, i(expik,21)x2 
essentially goes through the crystal undisturbed (see Figure 15.9), while 
(expikya1)x1 interacts relatively stronger with the crystal. Accordingly, 
where the beam emerges from the opposite face of the crystal, we may write 


wp =e" (Nyy + ix2) (15.9.30) 


where 7) is a complex number 
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n= ae'? (15.9.31) 


with a, @ real, are characteristics of the crystal and its thickness, and where 
a is the absorption coefficient. For crystals with negligible absorption of neu- 
trons, such as silicon of several millimeters thickness, one may take a = 1. 


eif 


/9 f 
/“cos 5 


Fig. 15.9. For the standing wave x1, |y1|? takes its maximum values at 
the atomic sites, and hence (expiki21)x1 interacts rather strongly with the 
crystal. On the other hand, for the standing wave y2, |y2|” takes its mini- 
mum value of zero at the atomic sites, and hence i(expikix1)x2 propagates 
essentially undisturbed in the crystal. For a crystal of negligible absorption of 
neutrons, when the beam reaches the opposite face of the crystal, it emerges 
from it with a reflection coefficient i(expi@/2) sin 6/2 and transmission coef- 
ficient (expi3/2) cos 3/2, where @ is real and is a characteristic of the crystal 
and its thickness. 


With a = 1, we may rewrite wy as 


, 


wp =pe®* +nelk™ (15.9.32) 


where 
p = e8/? cos 7 « = iel/? sin 3 (15.9.33) 


and 
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x’ = (41, —22) (15.9.34) 
k- x’ =k,-x (15.9.35) 
k,. = (ky, —k). (15.9.36) 


One may then introduce the transmission and reflection probabilities 

T = |p|? = cos? - R=|s|? = sin? 7 (15.9.37) 
Note the presence of the i multiplicative factor in « given in (15.9.33). 
Referring to the experimental situation of a neutron interferometer, de- 
picted in Figure 15.10, we note that since the amplitude to go from A to B, 
for example, outside the crystals is, up to an unimportant multiplicative fac- 
tor, just the phase factor exp (iM(xg — x,))?/2htg,, where M is the mass 
of the neutron, the intensities [,, Iz at the detection sites D,, Dz are given 
from (15.9.32) /(15.9.33), up to the same overall unimportant factor 


Ty = |n3 + pxp|” = R(T — R)? (15.9.38) 


Tp = |n2p + pr? |? = 4T R? (15.9.39) 


using the symmetry between the segments BC, AD and similarly of the 
segments AB, DC for propagation outside the crystals. 

Now we rotate the whole system in Figure 15.10 by an angle 4, raising 
the segment BC at a height H above the segment AD, both in horizontal 
direction, in the Earth’s gravitational field as shown in Figure 15.11. 

For the amplitude of propagation between two points in a given potential 
in almost a straight line, we may take the corresponding approximation from 
the expression already derived in (11.2.31), 


t 


(xt|x't/) = (xt|x't)}exp—5 arv(x + (x — x’) 
t’ 


rT-t 


(tt) 


where (xt|x’t’), is the free Green function. 

For the subsequent analysis we consider the thickness of the slabs a < d. 
We choose the potential energy on the segment AD to be zero as a reference 
point, and take 


ae T—tp MgH 
= dr V = ————— | ~ (tc — tg) ——— 
a8 T (xz + (xc — XB) (tc = (tc — te) i 


=( d jae 


vcos@ h 


=A (15.9.41) 
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20 
d/cos0 D 


(TOP VIEW) 


Fig. 15.10. (Neutron Interferometer) A beam of neutrons strikes a slab of 
crystal in a system of three parallel slabs equally spaced at distances d apart 
and each of thickness a. The beams BC’, DC interfere at C' and the intensities 
of neutrons emerging from C' are measured by the detectors D;, Dz. Even- 
tually the whole system will be rotated about the direction of the incident 
beam by an angle ¢ raising the beam BC above the beam AD, both moving 
horizontally, in the Earth’s gravitational field (see also Figure 15.11). 


corresponding to the line segment BC for neutrons travelling at a height H as 
given in Figure 15.11. outside the crystals, where v is the speed of a neutron. 

By symmetry, (xctc|xptB),y = (xptp|xata), for the free parts of the 
Green function, while (xgtg|xata) = (xctc|xptp), for propagation outside 
the crystals, which according to (15.9.40), are, up to unimportant numerical 
factors, just phase factors which as easily seen below may be factored out in 
computing the amplitudes corresponding to the intensities [,, Iz determined 
at the detection sites D,, D2, and hence are unimportant. 
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From Figure 15.11, (15.9.40), (15.9.41) and (15.9.33) we have, up to an 
unimportant common multiplicative factor, in analogy to (15.9.38), (15.9.39), 


= [nei + prr| 


= R{1 —2TR(1+ cos A)] (15.9.42) 
and 
i= x20. + pr? |” 
= 2TR?(1+ cos A). (15.9.43) 


d/ cos 0 


H = 2d sin@sin ¢ 


Fig. 15.11. By a rotation of the whole system in Figure 15.10 about the 
direction of the incident beam by an angle ¢, the beam BC is raised above 
the beam AD, both in a horizontal direction, by a height H, as indicated, 
in the Earth’s gravitational field. For 6 4 0, the difference Iz — I of the 
intensities measured by the detectors D2 and Dj, oscillates as a function 
of ¢ with a phase depending on the gravitational constant g and the Planck 
constant h. 


We note that J, + Ig is independent of the phase, while 
Ip — 1, = R(ATR—-1)+4TR’ cosA (15.9.44) 
has an oscillatory character in ¢, and may be recast in the form 


(Iz — 1,)(¢) 
(12 — 11)(0) 


ee (15.9.45) 


= a1 + A208 
1 2 ( hw 
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and is a function of a gravity induced quantum mechanical phase depending 
on the gravitational constant g and the Planck constant h. 

An experiment of the sort described above has been carried out®? and 
the induced phase shift has been observed. The parameters taken were a = 
0.2 cm,d = 3.5cm,6 = 22.1°,h/Mv = 1.445 A. If a is not neglected in the 
above analysis, then A in (15.9.44) is to be multiplied? by the corrective 
factor (1+ a/d). 


Problems 


15.1. 


15.2. 


15.3. 


15.4. 


15.5. 


15.6. 


Find the scattering amplitude and the cross section in the Born ap- 
proximation for the Yukawa potential A(exp —j|x|)/ |x|, and the ex- 
ponential potential X’(exp — |x|), 4 > 0. Use the sufficiency condition 
(15.3.26) to set bounds on the parameters in question for the absolute 
convergence of the Born series. 

Form (15.4.20) you have seen that the optical theorem relates different 
orders in the potential. As a matter of fact, the Born approximation 
violates the optical theorem, as seen, for example, from the previous 
problem, since Im fg = 0, while og # 0. Accordingly, it is necessary 
to go to second order in the potential in determining Im f(p’,p’) to 
verify the optical theorem. Use (15.3.10)/(15.3.11) to second order in 
V to verify the optical theorem in (15.4.6) with o determined in the 
Born approximation op. 

For a potential of short range, i.e., for one which vanishes faster than 
the Coulomb potential, show that for |x| > co (p° ~ E(p’)) 


ip’ x/h 
(p° — B(p') tie)’ 
Show that the radial partial Green function Ge(r,r’) in (15.4.39) sat- 
isfies the differential equation (15.4.40) and hence (15.4.42) provides 
a solution for ge(r). 
Restrict the sum in (15.4.29) for the scattering amplitude f(p,@) to 


S and P waves, i.e., for 2 = 0, = 1, to write the differential cross 
section as 


Jer elP'*/2 Gs (p, p'; p°) 


do fi? 

dQ p2 
Find bounds on the coefficients A, B, C. Also express these coefficients 
in terms of the phase shifts 59,5; and find the general structure for o. 
For the Born approximation to the phase shift 6¢ in (15.4.45), for 
small 5¢ such that tan &¢ ~ 5¢, (exp 2i5¢) —1 ~ 2i5¢, use the expansion 
formula 


(A+ Bcos6 + C cos” 6). 


3° Colella et al. (1975). See also Greenberger and Overhauser (1979). 
40 Greenberger and Overhauser (1979). 
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15.7. 


15.8. 


15.9. 
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Sr(20+ 1) (ie (Ry) ee ner 


where Q = 2(p/h) sin(@/2), to obtain an expression for the scattering 
amplitude and compare it with the one given in (15.3.12)/(15.3.11) 
for a spherically symmetric potential V(r). 

For the = 0 contribution in (15.4.29), set go(r) = g(k,r) in (15.4.33) 
with p/h = k. Use the Schrédinger equation to show that for R large 


drg(k,r)g(0,r) 
g(k, R)g(0, R) 


Q(k, R) = Q(0, R) is fo 


where Q(k, R) = (dg(k, R)/dR) /g(k, R), and g(0,R) may be taken 
to be (R — a) for some constant a. Use this equation, together with 
the asymptotic form for g(k, R) in (15.4.34) to obtain the so-called 
effective range expansion 


1 
kcot 89 = —— ma a+ 


at low energies k? ~ 0, where 


g(0,r) — u(r) g(0,r) = 0. The parameters a, ro are respectively called 
the scattering length and the effective range. Interpret the significance 
of these parameters, and obtain the @ = 0 contribution to the cross 
section at low energies. 

Consider a particle in the spherically symmetric potential 


Oe Vo, r<R 
0, r2e>R 


at low energies. 

(i) For Vo < 0, determine the scattering length a and effective ro 
introduced in the previous problem. 

(ii) Carry out a qualitative analysis of the sign of a versus the sign 
and magnitude of Vo in a plot of go(r) (ie., for 2 = 0) versus r. 

(iii) At low energies, determine the € = 0 contribution of the cross 
section. Under what condition the latter and the phase shift 5o 
vanish? The vanishing property of the € = 0 contribution to the 
cross section is referred to as the Ramsauer effect. 

For the spherically symmetric potential in the previous problem with 

Vo < 0, consider the low energy ¢ = 0 scattering amplitude fo(p, 6) in 

(15.4.53). Investigate the nature of the poles that fo(p,@) may have 


15.10. 


15.11. 


15.12. 


15.13. 


15.14. 


15.15. 


15.16. 


15.17. 


15.18. 


Problems 879 


on the positive imaginary axis in the complex p-plane and the bound 
states. Investigate also the analyticity property of fo(p, @) in the com- 
plex energy E = p?/2m plane. 

Expand the scattering amplitude fc(p,0) for the Coulomb potential 
in (15.5.22) in terms of the Legendre polynomials P,(cos @), where 
|p — p’| = 2psin 6/2, and show that up to an overall phase factor 


Co 


Ah Tr(é+1+ iy) 
fo(p, 0) = Dip Dale + yas eae 0) 
where y = Ay/hp. For an attractive potential y = —|A| u/hp, and 


T(€+1+ iA] u/h) develops poles at non-positive integers for p = ik, 
k > 0. Use this fact to show that the corresponding energy values 
p* /2u coincide with the discrete energy levels of the hydrogen atom. 
For the potential V(r) = co for r < Rand zero otherwise, compute the 
differential cross section. Investigate the high energy behavior of the 
cross section and show that it does not coincide with the one obtained 
from a naive classical computation. Interpret this result. 

Use the intuitive argument given below (15.5.18) to obtain the asymp- 
totic “free” time evolution operator for a potential V(r) which vanishes 
at infinity like r~> for some 0 < 6 < 1 in analogy to the Coulomb case 
exp(—iAoc(t)/h) with Hoc(t) given in (15.5.18). 

[Ref.: Manoukian and Prugovecki (1971). See also this reference for 
related technical details and physical considerations.] 

Use the analysis carried out in $11.4 to recast the expression for the 
scattering amplitude as obtained from (15.6.6) in a path integral for- 
malism. 

Develop the eikonal approximation in (15.7.7) for the Yukawa poten- 
tial V(x) = V(b, p) = \exp (-10 vb” + p?) /V/b? + p?, uo > 0. 
[Hint: You may express x(b, |p’|) in (15.7.9) in terms of the Bessel 
function Ko({0 |b]).| 

Use the asymptotic behavior of the Bessel function Ko({Uo |b|) for poo > 
+0 to obtain an eikonal approximation for the Coulomb potential up 
to an overall unimportant (infinite) phase factor by the application of 
the previous problem. 

Use the method developed through (15.7.15)—(15.7.20) to derive an 
expression for the asymptotic “free” Green function corresponding to 
potential V(r) = Ar~°, for some 0 < § < 1. See also Problem 15.12. 
Repeat the procedure worked out through (15.1.41)—(15.1.46), to 
prove the existence of the modified Moller wave operators Q4c¢ for 
the Coulomb potential as the (strong) limits t — too of Qc(t) = 
ettH/he—iHoc(t)/h where Hoc(t) is given in (15.5.18). 

[Ref.: Dollard (1964).] 

The scattering S' operator is so defined that (®°,,, |:S|®2,) = (ph | we); 
where |p2) = 02 |62), ph = 05 |®2,,), and QF are given by the 


out /? a 
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15.19. 


15.20. 


15.21. 
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(strong) limits t — -too of exp (itH/h) exp (—itH./h), where H, is 

the corresponding c-channel Hamiltonian. Relate this operator to the 

transition matrix T,, in (15.8.38) for an incoming |®%,) state and an 

outgoing | Be) state. Formulate and spell out precisely the conditions 

for which this operator is unitary. 

To obtain a better agreement with experiments, one may replace* the 
q’ q 


7a 7a | in (15.8.68), taking into account 
IP2— P| [Po — P| 


factor |14 


2 2 2 

of the exchange effect, by | 14 ( a 7) a : . Make such 

p*— P2 p*—P2 
a replacement in (15.8.71) and carry out the angular integrations. A 
change of an angular integration variable as in (15.9.6) is useful. 
[*Ochkur (1963, 1964).| 
Carry out the decomposition of Tyq in (15.8.107) for the scattering of 
a particle of spin zero off a cluster whose internal angular momentum 
Ss in not zero. 
[Hint. Follow an analysis as in (15.4.21) and use expansions as in 
(5.10.99) /(5.10.103) in terms of Clebsch-Gordan coefficients. See also 
85.9] 
Can you extend the analysis to the case for the scattering of one 
cluster off another if the internal angular momenta of both clusters 
are non-zero? 
Recapitulate the analysis carried out in $15.9 for the energy loss per 
unit path length by a charged particle passing through a medium 
with multi-electron atoms. Simply replace exp (—iq- x/f) in (15.9.9) 
by J, exp (iq - x;/h) as a sum over all the electrons in an atom. The 
corresponding completeness summation formula to (15.9.13) would be 
equal to the atomic number Z. The “average ionization potential” I 


will then involve (on og |®o) 
the resulting new expression corresponding to the left-hand side of 
(15.9.18) by Z. You should then obtain the same expression as in 


(15.9.19) with -”simply replaced by .VZ, where the latter denotes 
the number of electrons per unit volume present in the medium. 


2 
instead, and is defined by dividing 
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Quantum Description of Relativistic Particles 


This chapter is involved with the method of providing a quantum de- 
scription of relativistic particles. One is confronted with the requirement of 
developing such a formalism, as imposed by nature, when the energy and mo- 
mentum of a particle are large enough so that the Schrédinger equation, with 
a non-relativistic kinetic energy, becomes inapplicable. A relativistic theory, 
as a result of the exchange that takes place between energy and matter, allows 
the creation of an unlimited number of particles and the number of particles in 
a physical process need not be conserved. An appropriate description of such 
physical processes for which a variable number of particles may be created or 
destroyed is provided by the very rich concept of a quantum field. For exam- 
ple, photon emissions and annihilations are explained by the introduction of 
the electromagnetic quantum field. The theory which emerges from extend- 
ing quantum physics to the relativistic regime is called Relativistic Quantum 
Field Theory or just Quantum Field Theory. Quantum Electrodynamics is 
an example of a quantum field theory and is the most precise theory devised 
by man when confronted with experiments. The essence of special relativity 
is that all inertial frames are completely equivalent in explaining a physical 
theory as one inertial frame cannot be distinguished from another. This in- 
variance property of physical theories in all inertial frames as required by 
the special theory of relativity is also readily implemented in quantum field 
theory. 

This chapter is not involved with the intricacies of quantum field the- 
ory, except for the moderate exception in §16.9, but with the details of the 
precursor of such a theory. It provides the bridge between quantum physics 
developed so far in the text and the theory of quantum fields. As a first step, 
we have to find replacements for Schroédinger’s equation when the energies and 
momenta of the particles to be described are too high for a non-relativistic 
treatment. This will lead us to the Klein-Gordon equation for spin 0 particles 
and the Dirac equation for spin 1/2 particles, and later on (§16.8) relativistic 
wave equations will be developed for higher spins as well. These equations 
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are Lorentz covariant in the sense that they have the same form in all iner- 
tial frames. In our presentation, massive as well as massless particles will be 
considered. 

We have already dealt with the Dirac equation in §7.4, 87.5 in describing 
relativistic corrections to the hydrogen atom spectrum. The relativistic wave 
equations predict the existence of negative energy states with negative mass, 
with energies going down to —oo implying the instability of the corresponding 
systems. Dirac, in his spin 1/2 theory, was led to postulate that the negative 
energy states are completely filled by electrons, according to the Pauli exclu- 
sion principle, so that no transitions to these states are possible and stability 
would be achieved. This vacuum state thickly populated by electrons has been 
called the Dirac sea. In 1930, he has interpreted the absence of an electron in 
the sea (a “hole”) as an anti-particle — the positron. The physics community 
found it difficult to accept his prediction until C. D. Anderson discovered the 
positron in 1932 who apparently was not aware of Dirac’s prediction! at the 
time of his discovery. It is also interesting to note that G. Gamow referred'to 
the electrons in the negative energy states, postulated by Dirac, as “donkey 
electrons” because they would move in the opposite direction of the applied 
force. It is important to realize that a theory which started with the hope of 
providing a quantum description of a single relativistic particle led eventually 
to a multi-particle theory. 

Historically, the Klein-Gordon equation was introduced before Dirac in- 
troduced his. Because of the emergence of negative energy states and the 
difficulty in defining a positive definite probability density, the Klein-Gordon 
equation was ignored until Dirac developed his formalism and led to his pre- 
diction of the anti-particle (the positron), and its discovery, at which time 
the importance of the Klein-Gordon equation was recognized and consistent 
field theories were formulated. 

Dirac’s theory provided tremendous insight into the nature of a combined 
theory consisting of quantum physics and relativity leading eventually to the 
birth of quantum field theory. As far as the achievements of the Dirac theory 
were, we may summarize by saying that besides being a Lorentz covariant 
theory and describes spin 1/2 particles, it also predicted approximately the 
gyromagnetic ratio g = 2 of the electron,” it gave the correct fine-structure of 
the hydrogen atom and, of course, led to the discovery of anti-matter. These 
achievements were so great, that apparently Dirac himself remarked? in one 
of his talks that his equation was more intelligent than its author. 

The first six sections of this chapter deal with the Dirac equation. Al- 
though some knowledge of the basics of special relativity is assumed of the 
reader, the intricacies of the so-called Lorentz transformations are spelled 
out in §16.2. Special emphasis is put on the concept of helicity in this latter 


' cf, Weisskopf (1980). 
? A correction to the g = 2 value was derived in §8.5. 
3 Weisskopf (1980). 
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reference, where one learns that helicity as the spin component taken parallel 
to a particle’s momentum, as the latter is determined in each of the corre- 
sponding inertial frames, has a Lorentz (relativistic) invariant meaning and 
is well defined for massive as well as for massless particles. §16.6 provides a 
detailed physical interpretation of the Dirac theory and, in particular, to the 
origin of the relativistic corrections to the hydrogen atom worked out in 87.4, 
87.5. The exact bound Coulomb problem is solved in the Appendix to §16.6. 
The Klein-Gordon equation and some equations which follow from it are the 
subject of §16.7. Relativistic wave equations for any mass M > 0 and higher 
spins are developed in §16.8. The last section, §16.9 deals with the spin and 
statistics connection whose importance can never be overemphasized. We use 
the ingenious approach due to Schwinger in examining typical Lagrangian 
densities in relativistic quantum field theories to investigate the nature of the 
spin and statistics connection. Here familiarity with Lagrangian dynamics is 
essential. In the appendix to this section a detailed treatment of the so-called 
action integral is given, however, for systems admitting Grassmann variables 
as well. 

The Spin and Statistics Theorem, in its simplest form, states that no 
two identical particles of half-odd integer spins (fermions) can occupy the 
same state, while any number of identical particles of integer spins (bosons) 
may do so without limitation. The practical effect of this theorem prevails 
over the whole of science and provides the basis for explaining the periodic 
table of elements from which our world and we are made of. Matter is stable 
because of the spin and statistics connection (the so-called Pauli exclusion 
principle) as applied to electrons as particles of spin 1/2. It also explains 
as to why matter occupies such a large volume of space.* We have seen in 
Chapter 14, §14.4 on The Collapse of “Bosonic Matter” as to what happens 
to matter if the spin and statistics connection (the Pauli exclusion principle) 
were abolished. In regard to matter without the spin and statistics connection, 
and before carrying out the analyses intended in this chapter, it is worth 
recalling? F. Dyson’s words: “Matter in bulk would collapse into a condensed 
high density phase. The assembly of any two macroscopic objects would release 
energy comparable to that of an atomic bomb.... Matter without the exclusion 
principle is unstable.”, and also quote from the translator’s Preface of the 
classic book® by S.-I. Tomonaga on The Story of Spin: The existence of spin, 
and the statistics associated with it, is the most subtle and ingenious design 
of Nature — without it the whole universe would collapse. 


4 See §14.4 and the Introduction to Chapter 14. 
5 See the Introduction to Chapter 14. 
5 T. Oka in the Translator’s Preface of the book by Tomonaga (1997). 
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16.1 The Dirac Equation and Pauli’s Fundamental 
Theorem 


Let us go over the method used in $7.4 to obtain the Dirac equation of 
spin 1/2. Using the constraint between the energy E', and the momentum p, 
of a free particle of mass M 


p?c? + M?ct- E=0, (16.1.1) 


where c denotes the speed of light, we formally obtain, upon the substitutions 
E — ihd/0t, p— —ihV, the equation 


3) 
ins W(x, 1) = (V-ReV? - Mc!) u(x, t) (16.1.2) 
where for spin 1/2, w(x, t) is a two-component spinor. The positive operator 


V/-Re2V? + M2c! is a well defined operator,” by Fourier transform, for 
example, and by using the relations 


a3 ' 
JV -ReV? + M24 6°(x — x’) =) oe /h22p?2 + M2c4 ep &-*)/h 


(27h) 
= K(x —x’) (16.1.3) 
w(x,t) = [ox 63(x — x’) w(x’, 0), (16.1.4) 
(16.1.2) leads to 
in x, t)= [ox K(x — x’) W(x’, t). (16.1.5) 


This equation is non-local in x. It is also not easily handed, in general, in 
the presence of interactions unless one is dealing with very weak interactions 
which would allow one to carry out an expansion under the square root. 
Accordingly, to obtain a more manageable equation, one may go back to 


(16.1.1), multiply the latter by Vere + M?ct+ E| to get 
(p?c? + Met — E?) b=0 (16.1.6) 
with E — ihd/0t, p — —ihV. Using the identity in (7.4.4), involving the 


Pauli matrices, and the defining coupled equations (7.4.6), (7.4.7), we obtain 


” That the square root of such an operator is well defined is not sufficiently em- 
phasized in the literature. For handling such a square-root operator in a mathe- 
matical rigorous way, see, for example, Daubechies and Lieb (1983). 
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h h 
(v4 ES + Me) w= (16.1.7) 
i ic Ot 


where W is a four-component spinor, and it is understood that Mc is multi- 
plied by the 4x4 unit matrix diag[1,1,1, 1]. The matrices 7°, y = (y', 77, 7) 


| 9 ‘3 


are given by (see also the Appendix to §7.4) 


se a ) . oe (16.1.8) 


where J in here is the 2 x 2 unit matrix, 0 = (01, 02,03) are the Pauli 
matrices, and the gamma matrices have the properties (4 = 0,1, 2,3) 


Tr(y)=0, ()l=7, (lay (xt)h = 797°, (16.1.9) 
(Y=, Yar, )=-7%, (=, (16.110) 
=, A) =-7, )o=a7, O)o=-7, (6.1.11) 


=r y=) =) =-5 (16.1.12) 


where I in (16.1.12) now is the 4 x 4 unit matrix. 
Using the notation, 


a) a 
ay Oi, §=1,2,3, =o (16.1.13) 


we may rewrite (16.1.7) in the form 


mn 
(2 On 4 =) v=0 (16.1.14) 


1 


with a summation over upper and lower repeated indices p = 0,1,2,3, un- 


derstood, or simply as 
O M 
(2+ x) v=0 (16.1.15) 


where yO = y“0,,. Equation (16.1.15) is the celebrated Dirac equation. 

Here we emphasize the positions of the indices y in 7" and 0,. This point 
will be clear when we study the relativistic invariance of the theory in the 
next section. 

To obtain the Dirac equation for a charged particle interacting with an 
extend electromagnetic field specified by ¢, A, we carry out the minimal 
substitutions ih0/Ot — ihO/Ot — ed, —ihV ihV — (e/c)A, where e is 
the charge of the particle in question. That is, we make the substitutions 


iis 5 Oi Megs, (16.1.16) 
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where® 
Ap = —4, A= (Aj, As, As) (16.1.17) 
to obtain from (16.1.14) 
re) e€ Mc 
1 A —|v= 16.1.1 
"(4 eAn) +9 | Toetd8) 
where again we emphasize the positions of the indices w = 0,1,2,3 in 
(16.1.18). 
The matrices y“ in (16.1.8) satisfy the key anti-commutation relations 
{4 ,y’} = —2g"” (16.1.19) 
where? 
—1000 
0 100 
pv) _ = 
0 001 


is the so-called Minkowski metric on which more will be said in the next 
section. 

The Dirac equations (16.1.14) may be equivalently rewritten in terms of 
other sets of gamma matrices 7" that satisfy the same anti-commutation 
relations (16.1.19) as the original ones in (16.1.8), in the form 


(ve ns x) w= (16.1.21) 
1 


for some new spinors W’ related simply to WV, thus showing the representation 
independence of the Dirac equation. That is, one may appropriately choose, in 
general, different sets of gamma matrices 7/" satisfying the anti-commutation 
relations in (16.1.19) in setting up the Dirac equation as well. This statement 
is the content of a theorem called Pauli’s Fundamental Theorem which is 
proved in the appendix to this section. It states that there exists a 4 x 4 
non-singular matrix G (i.e., det G # 0) such that 


o'" = Gy#G (16.1.22) 
Uv’ =GU (16.1.23) 


8 We will see in the next section that, consistently, A‘ = Ai, A? = Az, A® = As, 
A° = —Ap. 

° Several authors, such as J. Schwinger, S. Weinberg, W. Pauli,..., have adopted 
the definition in (16.1.20) for the metric. Equivalently, some other authors prefer 
to use their g’” as the negative of the one in (16.1.20), with their right-hand side 
of (16.1.19) being replaced by 2g"”. It is interesting to note that Dirac has used 
both (signatures) definitions, see [Dirac (1959, 2001)]. 
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relating (7/",W') to (y",W), which are consistent with (16.1.14), (16.1.21) 
with the latter obtained by multiplying the former from the left by G and 
inserting the identity G~' G = I just to the left of in (16.1.14). 

The matrices 7“ in (16.1.8) are said to provide the Dirac-Pauli represen- 
tation of the gamma matrices satisfying (16.1.19). Given the metric g'” as 
defined in (16.1.20), and the anti-commutation relation in (16.1.19), some re- 
lations involving gamma matrices which are representation independent are 
given in Table 16.1. In the Dirac-Pauli representation y°, defined in the Table, 
is given by 


p= (; ; (16.1.24) 


Table 16.1. Given the definition of the metric g’” in (16.1.20), and the 
anti-commutation relations of gamma matrices in (16.1.19), the Table gives 
some relations involving them which are representation independent. The 
Greek indices go over 0, 1, 2,3. 


[g”] =diag[-1,1,1,1], {7,7} =—294", 
qe’ =o T+ 5 [7], 

(Prat, (PP =-1, 1=1,2,3. 
dag a 

Quv "(97)" = 2”, 

Suv (VV) VY = 49", 

gv (PV) = 27? a7", 

[es [y7.7"| = 4(-y7 gt? — 7g), 

Telyey |= 4g", 

Be pyre | Sage g agg ag a"), 
Tr [odd number of +’s] = 0. 

For 7° = iy yy, (9°)? =1, {7°,7"} =0, 
(y"a,)? = —I[a? - (2°)?], (y+ a)? = —Ia?, 


O 1 . 
a = (a1, a2, a3), ao=—-a, a=a, i=1,2,3. 
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As an example of a G transformation which will find important applica- 
tions to problems with zero mass particles is the unitary matrix 


= (117). ot=c=(j 1) (16.1.25) 


leading to the so-called chiral representation in which y° is diagonal, 


10 0 —I p O°o 1 I 0 


where note that y’ coincides with the one in the Dirac-Pauli representation. 
Another useful representation is the so-called Majorana representation 
provided by the unitary matrix 


1 Io 
G=G1=Gt=— 7 16.1.27 
5 (e077 eae 

leading to 
10 0 a nd [02 0 
Ao (2 faa e oy (16.1.28) 
mi _ (103 0 12 (0 a2 13 (io, 0 
es ae q at) Y ee a) 


(16.1.29) 
in which 7" is diagonal, and (y’"/i)” = (7'""/i) which makes the Dirac 
operator (7/0/i+ Mc/h) real. 

As a final example, we introduce the unitary matrix of the form 


G=Icos6+y-nsiné (16.1.30) 

where n is a unit 3-vector, and @ is an arbitrary angle. Since y' = —y, 
(y-n)? = —I, we have 

Gi = G~' = Icos0—y-nsiné. (16.1.31) 


In Problem 16.1, the reader is asked to find the new representation of the 
gamma matrices for fixed n and 0. 

The matrix G in (16.1.30) will be also applied in $16.6 in a slightly dif- 
ferent context in studying, in the process, the physical content of the Dirac 
equation, where we use the fact that for any dimensionless non-zero three- 
vector a such that 


n= — (16.1.32) 


(16.1.33) 
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1/2 
1 
sind =|a 16.1.34 
al 2Va2 + 1(Va2+1+1) ( ) 
we have (see Problem 16.2) 
Gy (y-at1)G th =7Va2 +1. (16.1.35) 


Appendix to §16.1: Pauli’s Fundamental Theorem 


Consider the Dirac-Pauli representation of the gamma matrices y" in 
(16.1.8) satisfying the anti-commutation relations in (16.1.19). In the vector 
space generated by all 4 x 4 matrices, we may choose as bases the following 
set of 16 matrices constructed out of the 7: 


{Pal ={LP i PVP RVR i iti Pr, 
IPV iP iy ie iyi ier} 
mea, Sy ial ae 
(A-16.1.1) 


3 oe 3 
= Lette ints Pal roa 


j,k,m = 1,2,3, A=1,2,...,16, where y° is defined in (16.1.24). We note 
that for all A=1,...,16 


PS... “iy Shs (A-16.1.2) 


It is interesting to actually spell out the explicit forms of the ordered set 
of elements in {Ia}: 


rd={(01)- (04): Go) 30) 
(2.8): (88). (58). (5.2)}- aera 


The simplicity of the structure of the matrices in the set {I 4} involving first 
the unit matrix followed by matrices involving the Pauli ones, explains the 
ordering of the elements originally taken in (A-16.1.1). 
In particular, we note that for the product of any two elements I'4, Ig 
in {Pats we have 
I'alp=€Eapleo (A-16.1.4) 


for some numbers €4g = +1,+i, I is in {Ia}. Since every matrix I", is 
its own inverse (see (A-16.1.2)), we also have 
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Ip faa 050g" = (Fale) =p 
B 


To. (A-16.1.5) 


Now for matrices 7/", which satisfy the anti-commutation relations 
{o/ o/"} = 29, (A-16.1.6) 


We may, in one-to-one correspondence to the elements [4 in ere e defined 
in (A-16.1.1), introduce the ordered set 


i ikem pk 7m 
{l4} = {1 A ag any i: (A-16.1.7) 


Because of the representation independent properties of the gamma ma- 
trices in Table 16.1, we have 


Bmeigt. “eo (A-16.1.8) 
faB 
where the €4p are the same numbers appearing in (A-16.1.4), (A-16.1.5) for 
the corresponding elements I'4, Ig, Ic. 
The matrices in {Ta}, and similarly in eens are linearly independent. 
Given a 4 x 4 matrix U, we may introduce a matrix 


U= STAR (A-16.1.9) 
A 


Upon multiplying the latter from the left by Ig, and from the right by Ig, 
we obtain by using (A-16.1.4), (A-16.1.8) 


CCU rss SolGUIc = U (A-16.1.10) 
Cc 


where we have noted that the sum over A is equivalent to a sum over C 
corresponding to the generated matrices IG, Ic. That is, 


rgU =UTz. (A-16.1.11) 


It is easy to see that the linear independence of the matrices [4 implies 
that the matrix U may be chosen in (A-16.1.9) so that U is not the zero 


matrix. Accordingly, suppose that some matrix element (U ) ae of U is not 


ZEYO, 1.€., (U) ung 4 0 for some pair (ao, bo). Now introduce a matrix V 


such that all of its matrix elements (V) are zero except the one element 


(V) a, = 1/4a. 
Given the above constructed matrix V, we may introduce, in turn, the 
matrix 


ab 


Sy rev rs (A-16.1.12) 
A 
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Almost an identical analysis as the one carried out in going from (A-16.1.9) 
to (A-16.1.11) then shows that 


pV! =V'Tb. (A-16.1.13) 
Upon multiplying the latter from the right by U an using (A-16.1.11) give 
IgV'U =V'UTs, (A-16.1.14) 


for all B = 1,...,16. That is, V’'U commutes with all the elements of the 
set {Ia} in (A-16.1.1), which forms a complete set of 4 x 4 matrices, and 


hence V’ U is, necessarily, some multiple of the identity matrix J.1° This is 
the content of Schur’s Lemma, and we may write 


ViU=6I (A-16.1.15) 
for some number ( to be determined. 2 
Taking the trace of V’U in (A-16.1.15) and using the definition of V’ in 
(A-16.1.12) give 


48 =Tr [WU] = So Tr [Pa V4] 


=o [MVUry] 


= Sor [VU (La)? 


aA (A-16.1.16) 


where in writing the third equality we have, in the process, used (A-16.1.11). 
Hence @ = 1, and from (A-16.1.15), we may take G = U, G-!=V’ in the 


notation of (16.1.22). We multiply (A-16.1.13) from the left by G and choose 
Trp =y", [bg =7'", to obtain 


GyGt=7* (A-16.1.17) 


thus establishing the theorem. We note that G may be multiplied by any 
non-zero constant for the validity of (A-16.1.17). That is, G is defined up to 
an arbitrary finite multiplicative constant. 


10 Actually, it is sufficient that v' U commutes with all the y" for w = 0,1,2,3 
since this will imply that V’U commutes with all the matrices I'4 in {Ta} in 
(A-16.1.1). 
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A basic physical requirement which goes into the heart of the theory of 
relativity is that physical laws should be the same in all inertial frames. This 
means, in particular, that dynamical equations should not change as one goes 
from one inertial frame to another up to a mere relabelling of the variables 
of the underlying theories. Because of such relabellings of the variables, the 
equations are said to transform covariantly from one inertial frame to another 
and the corresponding rules of transformations are called, in general, Lorentz 
transformations. This will be spelled out for the Dirac equation below. In 
particular, we will develop the rules for the transformation of a Dirac spinor 
under pure Lorentz transformations (relativistic boosts) and under spatial 
rotations. 


16.2.1 Lorentz Transformations 


If an event is labelled by (t,x), in one inertial frame, then this same event 
will be labelled, say, by (t’,x’) in another inertial frame. The transformation 
rules which connect the different labellings (t,x) to (#’,x’), for the same 
event, for two inertial frames are called Lorentz Transformations. One inertial 
frame F’ may move with a uniform velocity v with respect to another frame 
F, as determined in F’, with a possible orientation of the Cartesian space 
coordinate axes of F” as also determined in F' at time t = 0. Ifa (t = 0,x = 0) 
reading in F’ corresponds to a (t’ = 0,x’ = 0) one in F”, then the underlying 
Lorentz Transformations are called homogeneous ones, otherwise they are 
called inhomogeneous. In the former case, the origins of the coordinate axes 
set up by F and F” coincide at the time readings t = 0, t/ = 0 by observers 
located at the corresponding origins of the respective coordinate systems of 
F and F’. 

We use the notation 2° = ct, x = (z!,27,23), (x") = (2°,x) = 2, p= 
0,1,2,3. In Euclidean space, the distance squared between two points labelled 
by x and y in a given coordinate system remains invariant under rotation of 
the coordinate system for which the labelling of the points x, y change to, 
say, x’, y’, ie., (x — y)? = (x’ — y’)?. Similarly, for Lorentz transformations 
x,y —2',y’, the following quadratic form remains invariant 


0 


saya aur aay lai =")? (16.2.1) 
which one may conveniently rewrite as 
(2! — y")guv(2” —y”) = invariant (16.2.2) 


where gy = Jv is referred to as the Minkowski metric and the matrix [g,..], 
with matrix elements g,,, = g"”, u,v = 0,1, 2,3, is defined in (16.1.20). One 
may set 

Iuvt" = Ly, (16.2.3) 
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and rewrite the left-hand side of (16.2.2) simply as (x" — y")(tp — Yy). That 
is 9,» may be used to lower the index v in x”. We note, in particular, that 
(16.2.3) implies that 


typ =—2°, 2, =2',1=1,2,3 (16.2.4) 


One may raise the index jz in x, by multiplying it by g’”, ie., 2” = tpg”, 
where g"” is numerically equal to g,,. We also note that 


Govg’™ = ne (16.2.5) 


where [5,“] = diag[1, 1,1, 1] is the identity 4 x 4 matrix. 


? ? ” 
Under a homogeneous Lorentz transformation x — x’, we have 


al! = AM a” (16.2.6) 

For a pure boost along the z-axis, for example, x/° = n(x°—2v/c), a!* = !, 
a! = 2?, x2! = n(x — vt), and hence (1 = (1 — v?/c?)~/) 

A’; = 6°; + (n —1)65;3 (16.2.7) 

Mo=n, Mo=—d"nu/e, A: =—banu/e (16.2.8) 


More generally, the elements A“,,, depending on an arbitrary uniform relative 

velocity v of F” relative to F', as determined in F’,, and an arbitrary angle of 

rotation y about an arbitrary unit three-vector n, are given in Table 16.2. 
From (16.2.6), 


= MH, 16.2.9 
ae ( ) 

since A“, is independent of x, and from the chain rule 

O Ox’! O 
= 16.2.10 
Ox’ — Ox” Ox! ( ) 
we obtain the simple rule, to be used below, that 

Oy = MOOG; (16.2.11) 


where 0, = 0/0x” in the notation in (16.1.13). 
From the inverse transformation x’ — «x to the one in (16.2.6), it is readily 
shown that 
Ave = (A71)HY (16.2.12) 


and hence 
AP RO Se GPP, (16.2.13) 
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Table 16.2. Explicit expressions for the elements A“, (including the infin- 
itesimal ones for small 5v, dy) of the homogeneous Lorentz transformations 
in (16.2.6). The rotation matrix elements R” are given in (2.1.4). Note that 
AY = A’, (see (16.2.4)), 1,7 = 1,2,3; u,v =0,1,2,3. 


AYE (A7tyHY AP,» 6, + bw, 
AP, AY = ge’ bw” = —bw"" 
ij ij ik v'y ij ijk, k 
AY = RY +(n-1)R 2 bw = en" dp 
2\-1/2 
A’ =n= (1-%) wo =0 
c 
A°, =—n Uv Te = _ bu 
c Cc 
5 v i bu’ 
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From (16.2.11), we may replace 0, by A”, Of, 
(16.1.14) to obtain 


m 
eS (7 + x) ve (carat oe mn *) v (16.2.14) 


in the Dirac equation 


Let 
Mane Sar (16.2.15) 


and on account of (16.2.13), 
{7",7'?} = =QAVA APs 
= —2g"? (16.2.16) 


Hence from Pauli’s Fundamental Theorem in the last section, there exists a 
non-singular matrix U such that 


pees Oita a 6 (16.2.17) 


Here we find it more convenient to use U~! for G in our previous notation 


in §16.1. 
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Upon multiplying (16.2.14) from the left by U, using (16.2.15), (16.2.17), 
and setting 


UW(x) = W'(2’), (16.2.18) 
where x’ is defined in terms of x in (16.2.6), we obtain 
vA M 
(2 oe = W(x’) =0 (16.2.19) 


This amounts into a mere relabelling of x and W in the original Dirac equation 
in (16.1.14). Pauli’s Fundamental Theorem guarantees the existence of a non- 
singular matrix U satisfying (16.2.17) such that (16.2.19) holds true. 

For example, for transformations implemented by A”,, corresponding to 
pure boosts (RY? — 6”), (16.2.15), (16.2.17) lead from the expression for A” ,, 
in Table 16.2, (i = 1,2,3) 


Biy-B 
GB? 


n(y° —y+B) =U-*y°U (16.2.21) 


(i+ 1) ) nBry? =U y'U (16.2.20) 


where we have set v'/c = 3°, |v| /c = B."! Multiplying (16.2.20) by N* = 
B'/B gives 
ny:-N-—7@7° =U7~'y-NU (16.2.22) 


We will see that (16.2.21), (16.2.22) are solved for U by setting 
v 
U = Icosh : —7°y-Nsinh 7 U1! = Icosh +7°y-Nsinh 5 (16.2.23) 


where ¥ is to be determined, and (16.2.23) is consistent with the identity to 
be shown later 


Ut = 7U-1y (16.2.24) 
Upon substitution of the expression for U in (16.2.20)—(16.2.21) gives 
0 0 
cosh” 5 + sinh? x cosh ¥ = 7 (16.2.25) 
., 0 ae 
2 sinh cosh an sinh ¥ = 78 (16.2.26) 
or 
9 = In(n8 +n) 
eae 
= In| 2 —— 16.2.2 
n ( nG+ ir 5] (16.2.27) 


" Note that @ is a standard relativistic notation for |v|/c and should not be con- 
fused with the notation for 7° sometimes used in the literature. 


896 16 Quantum Description of Relativistic Particles 


Using the facts that (7°43)? = J, and that for an arbitrary matrix A, cosh A— 
sinh A = exp(—A), we obtain from (16.2.23) 


Up = exp (-5 1s) (16.2.28) 
where B corresponds to boosts. 

In (16.2.20)—(16.2.28), v denotes the velocity of the frame F” relative to 
the frame F' as determined in F’. 

On the other hand for a pure spatial rotation by an angle y about a unit 
three-vector n, A’) = RY (see (2.1.4)), A°9o = 1, A°; = 0, A*o = 0, and 
almost an identical analysis as given above for pure boosts, with hyperbolic 
functions replaced, in the process, by trigonometric ones gives 


Up = exp (5 pn: =) (16.2.29) 
where 
_f{990)\_ 56 Tag ee, ee ee 
z= & 2 =77°y, Ei = ge [y*,¥'] (16.2.30) 


which should be compared with the rotations of spinors in (2.8.1) discussed 
there in a non-relativistic context. Since 


(n-S)°=2,  (n-B)*=n-d (16.2.31) 


we note again from (16.2.29) the double-valuedness of the spinor under ro- 
tation with 47 rather than 27 radians to return the spinor to its original 
state. 

Since } is Hermitian, Up is clearly unitary, this is unlike Ug for a pure 
boost given in (16.2.28) where for the latter U, # Uz’. Up, however, sat- 
isfies the relation in (16.2.24) (see also (16.2.53)). Since yY9H7° = EB, Ur, 
apart from being unitary, also satisfies the relation in (16.2.24) with U in it 
replaced by Up. We will use this fact in (16.2.61) to formulate quite generally 
the invariance of the scalar product of spinors under homogeneous Lorentz 
transformations, where 7° is introduced and so designed to make up for the 
non-unitary character of Ug without spoiling the unitary property of Up. 

We will encounter the matrix © again in (16.2.46) and in sections to follow 
as well, and, as expected from its expression in (16.2.30), it is associated with 
spin. 

For more general transformations given in Table 16.1, but for infinitesimal 
ones we have 

AY OM + bw", Sw” = —bwh” (16.2.32) 


where dw”, — 0 for v — 0, y — 0. Since in this limit, U — J, we may also 
set 
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Unxtt+ 5p oY Suv (16.2.33) 
Ux T- 5p oY Suv (16.2.34) 


where S\,, is to be determined. 
By equating (16.2.15), (16.2.17) and using (16.2.32)—(16.2.34), we are led 
to 


buy, = —5, 50 [Sxen¥”| (16.2.35) 


Since dw”, is anti-symmetric, the left-hand side of (16.2.35) may rewritten 


as 
Sur! 


2 
which upon comparison with the right-hand side of (16.2.35) gives 


1 
5 (6" bu # yy, + 6” ,bwhry) = (OY XY — 8’ Ir) (16.2.36) 
[Sy] = ih (g’’y" — gh’) (16.2.37) 
The solution is readily verified to be given by 
ih 
gu = + [74]. (16.2.38) 
To find the interpretation of this matrix we first note from (16.2.6), 


(16.2.32) 
g'h ~ oh + bw ya” (16.2.39) 


Hence from (16.2.18), we have 


W’ (2? + Sw? 2%) ~ (1 + ae” Sy) U(r") (16.2.40) 
or : 
1 

W' (a?) & (1 + 5550S W (x? — dw? yx”) (16.2.41) 


Now we may carry out a Taylor expansion of WV in dw on the right-hand 
side of (16.2.41) to get 


W(x) ~ U(x) + 5p oe uv V(@) (16.2.42) 


where 


h A 
Juv = Suv + (x40, = 120) (16.2.43) 
In particular for 7,7,k = 1, 2,3, 


1 
Ji = geigk Tj 
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=§,+L; (16.2.44) 


where L is the orbital angular momentum and 


1 
Si = 5 Eijk Sik 


ih 
= g eigk [yis Ye] (16.2.45) 
is identified with the spin. The remaining part of J,,,, that is Jio, is usually 
referred to as the “booster”. 
In the notation of (16.2.30), we may rewrite the spin S as 


h 1/1 
=> 2 AF (2 41/7 16.2.4 
sa-'s, stow (L431) (6.2.46) 


Upon setting P“ = hd" /i, the following commutations relations are read- 
ily derived 


[PoP a0 (16.2.47) 
[P4, JA] = ih (gt P” — gt” P*) (16.2.48) 
[FPP seed OP | ae (GAO fhe gh fg Pt FAO gAt JP) (16.2.49) 


establishing the algebra of the generators of the inhomogeneous Lorentz 
transformations, where the commutation relation of any two of the gener- 
ators gives rise to a linear combination of the generators. 


16.2.3 Lorentz Invariant Scalar Products of Spinors, Lorentz 
Scalars and Lorentz Vectors 


We will establish the identity in (16.2.24) and consider some of its conse- 
quences. 
Using the reality of A”,, in (16.2.15), and hence the identity 


of” =P (ory? (16.2.50) 
as obtained from (16.1.9), we have from (16.2.50) 
—liv —liv T 
U-ly aa hy U) 7° 
= Ut yoy"? (ut) (16.2.51) 
where we have used (16.1.9), (16.1.12), or 


NE (Uy°Uty°) om (Uy uty)" (16.2.52) 
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This in turn implies that 
’ (U7y°Uty°) = (UyUty°) y¥ (16.2.53) 


That is, Uy°Uty° commutes with all the y” and hence the former is some 
multiple of the unit matrix (see also Appendix to §16.1) 


(UUly°) = al (16.2.54) 
From (16.2.54), we may also write 

Ut = ay°U- 17° (16.2.55) 
Upon substitution of the expressions for 

ut ~r— +4," st (16.2.56) 
2h a 

and U~", given in (16.2.34), for infinitesimal transformations, where we recall 
that dw” is real, in (16.2.55), and using the property 

SiS Sa (16.2.57) 


as follows from (16.2.38), give a = 1. Thus from (16.2.55) we have established 
(16.2.24). 

Now we may formulate the Lorentz invariant scalar product of two spinors 
W(x), x(x) under homogeneous Lorentz transformations in analogy to the 
invariance of the quadratic form 


xe" guvy’ = invariant (16.2.58) 


in Minkowski spacetime. To this end, we note from (16.2.18) that under a 
homogeneous Lorentz transformation U(x) — W’(2’), x(x) — x/(2’), there 
exists a non-singular matrix U such that 


W'(2') = UV(2), v(e') =O x(a) (16.2.59) 
and from (16.2.24) that Ut = y°U~1y°. The later implies tat 
Ut U = 7°. (16.2.60) 
Hence for the combination ©’ (’)7°x'(x’), we have 
W' (2! )y°x/(a!) = Ut (a)Ut YU x(a) = Wt (a)7°x(c). (16.2.61) 


That is, 
Wl (x)y°x(x) = invariant (16.2.62) 
The presence of y° in (16.2.62), which is in analogy to the metric in 
(16.2.58), stems from the fact that although the corresponding transforma- 
tion for pure spatial rotations Up, given in (16.2.29), is unitary UL = US 
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but nevertheless satisfies the relation UL = 7°UR'7° in (16.2.24), the one 
involving a pure homogeneous Lorentz transformation (a boost) Ug, given in 
(16.2.28), is not unitary Uf, 4 Ug! but satisfies the relation UL = y°Upz!y° 
in (16.2.24). This explains the essential presence of 7° in (16.2.62) to make 
up for this distinction between the two transformations and, in turn, ensures 
the invariance property stated in (16.2.62). Because of the importance of the 
combination Wty°, one introduces a special notation for it 


Wi (a2) = (16.2.63) 


In particular, the invariant combination W(x)W(x) is referred to as a 
Lorentz scalar. 
Similarly, by using in the process (16.2.15), (16.2.17), we obtain 


(W'(a")y"V' (2!) = AM, (U(x) V(2)) (16.2.64) 


(see Problem 16.5). That is, U(x)y”W(a) transforms as x” in (16.2.6) and is 
referred to as a Lorentz vector. 


16.3 Spin, Helicity and P, C, TJ Transformations 


In the next section, we will see that neither the spin S nor the orbital 
angular momentum L introduced in (16.2.44) are conserved but the total 
angular momentum J, however, is. The spin component parallel to the mo- 
mentum of a particle is referred to as the helicity (see also §5.10). The helicity 
as the component of the transformed spin taken parallel to the momentum 
of the particle in question, with the momentum as determined in each cor- 
responding inertial frame, has a Lorentz invariant meaning. The concept of 
helicity is of utmost importance when dealing with massless particles as spin 
measurement along arbitrary directions has no meaning for such particles. 
The application of helicity to massive Dirac particles will be given in the 
next section and to massless ones in $816.5. The investigation of helicity for 
massless particles of higher spins turns out to be quite interesting and will 
be discussed in §16.8. In the present section, we also study the nature of the 
Dirac equation under parity P, time reversal T (see also §2.6), and under the 
so-called charge conjugation C whose physical meaning will emerge later in 


§16.6. 
16.3.1 Spin & Helicity 


The spin matrix S was introduced in (16.2.44)—(16.2.46), and from 
(16.2.30) is given by 


_h(o0\_fh 5 9 
s=3(5 a) = ety. (16.3.1) 
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This may be taken as the spin operator for a massive particle in its rest frame. 
In a momentum description (§16.4), we may define the spin in the frame 
in which observations are made (the “laboratory”) and in which the particle 
in question is moving with momentum p by applying the reversed boost 
operation given by the transformation matrix U in (16.2.23), (16.2.17), as 
reformulated in the present context, where now v in it represents the particle’s 
velocity. That is, we have to find the matrix USU~! = S’ rather than U-'SU 
and the reason for applying the former operation than the latter is, as just 
mentioned, we are going from the particle’s frame to the observer’s frame. 
To the above end 


h 0 
S’ = { Icosh ae 7°y-Nsinh oe 7°7°y { Icosh : +7°y+Nsinh — 
2 5/3 Dy 2 
(16.3.2) 
(see (16.2.23)), where p= Mnv, p® = Mnc, n= (1 — v?/c?)-V/? 


N=2 PP =/p+iPe (16.3.3) 


Ip| 
and from (16.2.25), (16.2.26), 


0 
P IP| 
cosh 3 Me’ sinh 0 Me (16.3.4) 
_, 0 p? — Mec 0 pi + Mc Wy) |p| 
=} h - = SS sh - = h a s 
SS Vite 2 a“) ome. 6 ap ee 
(16.3.5) 
The expression on the right-hand side of (16.3.2) simplifies to 
9 S- h 
gi - Pg_ _P(S:P) Pls ty(y-N)+N]. (16.3.6) 


~ Me Mc(p°+ Mc) 2Mc 


The last term on the right-hand side of this equation may be expressed in 
terms of $° defined in (16.2.38) (see Problem 16.6). 

We consider the components of S$’ = ($4),8‘,) parallel and perpendicular 
to p. For Si), we have 


Si = S| » Pe (16.3.7) 
Mc Mc(p® + Mc) 


and the coefficient of Sj is one, giving 


Si = S\- (16.3.8) 


This result is remarkable. It states that the component of spin parallel to p 
has a Lorentz invariant meaning and quite importantly it is independent of 
the mass of the particle and hence exists rigorously for M — 0 as well. It 
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is also Hermitian. In any other inertial frame, say, F”’, if the momentum of 
the particle in question is determined to be p”, then the component of the 
transformed spin S$” parallel now to p” will also satisfy S{{= S. 

For the component perpendicular to p, 


0 
, _ Pp h Ip| 5 ; 
0 
P i |p| 


and S’, 4 S,, that is, it is frame dependent. It has no zero mass limit, and, 
as is easily verified, the second term is not Hermitian. 

Summarizing then, we have found the privileged direction for spin mea- 
surement, to be given by the direction of momentum of the particle itself, with 
the momentum as determined in each of the corresponding inertial frames, 
and has a Lorentz invariant meaning, and is applicable both for massive as 
well as for massless particles and the corresponding operator is Hermitian. 
Applications of this important result will be given in the forthcoming sections. 


16.3.2 P, C, T Transformations 


Let 2’ = (of —x), then we may rewrite the Dirac equation (16.1.14) as 


-o 7a M 
( 0: (16.3.10) 
i i h 
Multiplying the latter from the left by 7° gives 
Oo M 
(2 : = U(x) = 0. (16.3.11) 
1 
Hence we may set 
W’(2') = np y°U(2) (16.3.12) 


and define the parity transformation by 
P: U(r) — np 7yV(z2') (16.3.13) 


where 7p is a phase factor. 

The products U(x)y? U(x) and V(x)7°7"V (x) transform, respectively, as 
a pseudo-scalar and a pseudo-vector. 

We consider next time reversal and this analysis will be followed by charge 
conjugation. Let 2’ = (—x°,x), then 


“ / 0 7 0 
(2 Ot ) U(x) =0. (16.3.14) 
T 1 
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Upon multiplying the latter from the left by y'y?K, where K denotes the 
operation of complex conjugation, and making use of the fact that in the 
Dirac-Pauli representation 7°, 1, y? are real and (y?)* = —7? (see (16.1.10)), 
it is easy to verify that 


1a 
( Zp =) yy KU(x) =0 (16.3.15) 


and we may set 


W' (a2’) = nr yy" (2) (16.3.16) 
and define the time reversal transformation by 
T: U(r) — nr y' PV" (2’) (16.3.17) 


up to a phase factor nr. 

For the so-called charge conjugation transformation, we consider the Dirac 
equation in (16.1.18) in a given external electromagnetic field A,,(x) given in 
(16.1.17). We first take the complex conjugate of the equation (16.1.18) to 
obtain from (16.1.10), 


Or e 0 e€ Mc 
aN | | 2 jl * = 
| 7 (3 Ay) ty (3 fa) + jv (z)=0 (16.3.18) 


1 1 


where here \ = 0,1,3. Multiplying this equation from the left by iy? gives in 
the Dirac-Pauli representation!” 


Lo" (FE ede) +A] Go) <0. (163.19) 


We may then define charge conjugation by 


T 


C: U(x) — We(2) = ne (i7’°) © (2), (16.3.20) 


where 7c is a phase factor, and we have used the fact that ou! = 


4° (iy)! = W*. That is, for a given external electromagnetic field, Ve (x) 
satisfies the same equation as (a) with the sign of the charge e in it simply 
reversed e — —e. The physical significance of this will be discussed in §16.6. 


16.4 General Solution of the Dirac Equation 
We consider the general solution of the Dirac equation 
O M 

(2 + x) U(x) =0 (16.4.1) 


'2 The i factor in iy? is chosen for convenience to make (io) a real matrix. 
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x = (x°,x), 2° = ct. We carry out a four dimensional Fourier transform 


U(x) = [oa eiPt/h Y(p) (16.4.2) 


with the Lorentz invariant measure 
(dp) = dp°dp'dp?dp? (16.4.3) 
and the Lorentz scalar 
pr = p-x— px", (16.4.4) 
p = (p°, p). From (16.4.1), (16.4.2), we have 


(yp + Mc) U(p) =0 (16.4.5) 


where yp = ¥“pu = Y-P+7°po = Y: p— 7°p®, and we note that in the 
Dirac-Pauli representation given in (16.1.8), no i factors appear in (16.4.5) 
in the p-description multiplying 7" 

Upon multiplying (16.4.5) from the left by (—yp + Mc) gives 


(p? + M?c”) U(p) = 0 (16.4.6) 

and the solution is of the form 
W(p) = 5(p* + M*c’) &(p) (16.4.7) 
where 6(p? + M?c?) is the Dirac delta distribution, and we have noted that 


af(a) = 0 implies that f(a) = g(a)d(a) = g(0)d(a). Since p? + M?c? = 
p? + M2c? — p°”, we may use the well known property 


6 (v9 - /p? + M?e?) +5 (v9 + Vp? + Me?) 
2\/p? + M22 


leading to two parts contributing to U(p), one for p° > 0 and one for p® < 0. 
Equivalently, (16.4.5) has a non-trivial solution only if yp+Mc has no inverse, 
ie., det(yp + Mc) = M2c? — p? + p? = 0. 

To obtain a more symmetrical expression for w(p), we make the transfor- 
mation p — —p corresponding to p° < 0, and integrate over p° in (16.4.2)) 
using the constraints imposed by the two deltas in (16.4.8). This gives the 
general structure 


6(p” + M?c?) — 


(16.4.8) 


W(x) = fate duy [e*/",(p) + e""/"b_() (16.4.9) 


using a rather standard notation for the Lorentz invariant measure 


dp 1 4 
— 0 16.4.10 
(27h)? 2p’ oe ( ) 


dw,p = 
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as obtained from the Lorentz invariant measure (dp) and the Lorentz invari- 
ant delta distribution 6(p? + Mc”), where now, and from now on, 


p° = ++/p? + M2c?, (16.4.11) 


px is defined in (16.4.4), and p° = —po as always. The 2Mc factor multiplying 
dw, in (16.4.9) is introduced for convenience. The restriction p? + M?c? = 0, 
ie., 

p= —-M?? (16.4.12) 


as provided by 6(p? + M?c?) in (16.4.8) is a restriction on the mass shell 
relating p° and p. Also, ®(p), ®_(p) satisfy the equations 


(yp + Mc) ®,(p) =0 (16.4.13) 


(—7p + Mc)®_(p) = 0. (16.4.14) 


Now it is straightforward to solve these two equations. We consider 
(16.4.13) first. 
For!? p=0, p® = Mc, and (16.4.13) gives 


(I — 7°) ®4(0) =0. (16.4.15) 


From the expression for y° in (16.1.8), we see that I — 7° = diag[0,0, 2, 2], 
and hence the last two entries of ®,(0) must be zero, ie., 


a 
©,(0,0) = (16.4.16) 
0 


which has only two rows and two independent (orthogonal) solutions may be 
chosen 


flee) = deo (16.4.17) 
such as (10)', (01). 
Clearly, the solution of (16.4.13) is then 
®,(p,o) = (-yp + Mc)®, (0,0), (16.4.18) 


with ©(0, o) given in (16.4.16), since (yp+Mc) (—yp+Mc) = (p? + M?c*) = 
0. 

Therefore, up to an arbitrary (one component) function a(p,o) of (p,c), 
we have from (16.4.18) 


,(p,o) = u(p, 7) a(p,c) (16.4.19) 


'3 Here we assume M #4 0. The mass zero case will be considered in detail in the 
next section. 
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where u(p,c) is taken the property normalized four component object 


0 +4 M So 
Pp Cc 
u(p,o) = Sie esis , (16.4.20) 
p? + Mc °° 


corresponding to two solutions in (16.4.16). The normalization condition 
adopted is given by 


u'(p,o)7°u(p, 0’) = u(p,o) u(p, 0’) = doo! (16.4.21) 


where 
u(p,o) = ul(p,a)7. (16.4.22) 
With such a normalization given in (16.4.21), 


0 


ul(p,o) u(p, 0’) = 7 650" (16.4.23) 
Mc 
and obviously 
(yp + Mc)u(p,o) = 0. (16.4.24) 
Also 
u(p,o)(yp + Mc) =0 (16.4.25) 


A more interesting way of deriving (16.4.20) is to apply the booster op- 
eration U~! with N = p/|p| to u(0,c) = (& 0)" to obtain u(p,c). As 
mentioned in the previous section, the reason why we must apply U~! rather 
than U is that the latter gives rise to the motion of the observation frame 
(the “laboratory”) with respect to the rest frame of the particle in question, 
while we are interested in carrying out observations on the particle itself (in 
the “laboratory” frame). 

To the above end, with (B = v/c) 


Bp 
Ne .3 16.4.26 
8 [pl 
p= Mnv (16.4.27) 
yw —1/2 
(1 oa =) (16.4.28) 


and with the expressions for functions of J in (16.3.4), (16.3.5), 


fd * bo 


v 
aif} _ Brace 
U 4 = cosh 5 o-p (16.4.29) 
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which upon normalization as in (16.4.21) gives (16.4.20). It is important to 
emphasize such that such a matrix U may be defined afresh in a momentum 
description where the two frames under consideration are the particle’s (for 
M # 0), and the observation frame. Massless particles will be described in 
the next section. 

Given the solution u(p,c) in (16.4.20) of (16.4.24), we may introduce its 
charge conjugate transform obtained from (16.3.20) given by 


o(p,0) =i? ul (p,c) (16.4.30) 


up to a phase factor, which upon using the identity 


o20' =—002 (16.4.31) 
gives 
o-p g! 
__ [p®°+ Me | p°+ Mc ~% 
v(p,o) = / Mc (16.4.32) 
él 
where 
et = (—ion) és (16.4.33) 


for the corresponding €, in (16.4.20), and 
Ele, = beg, (16.4.34) 
v(p, 7) satisfies the equation (16.4.14), i.e.,!4 
(—yp + Mc)v(p, a) =0, U(p, o)(—Yp + Mc) =0 (16.4.35) 
and u(p,c) = v'(p,c) 7°. It satisfies the normalization condition 
U(p,o) u(p,o’) = dg" (16.4.36) 
and 


0 
u'(p,o) u(p,o’) = P— dgqr. (16.4.37) 
Mc 


Note the minus sign in (16.4.36). 
We also have the orthogonality conditions 


u(p, 7) v(p, a’) = 0 (16.4.38) 


for both values of o, o’ adopted. From (16.4.20), (16.1.8), we note that 
U(p,o7) = u'(—p,c), and we may rewrite (16.4.38) as 


M Note that for p= 0, (I+°)v(0,o) = 0 and hence the first two entries of v(0,c) 
must vanish and is consistent with the expression in (16.4.32) as expected. 
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ul(—p,o)u(p, 0’) = 0 (16.4.39) 


The action of (—io2) on €* in (16.4.33) is clear. If the €, in (16.4.20) are 
given by 


cos ; 7 i¢/2 —sin ; pets 
Ein = F » €N= : (16.4.40) 
sin 5 ei?/2 cos 5 ei?/2 
(see (8.1.16)) with a unit three-vector 
N= (sin 0 cos ¢, sin @ sin ¢, cos 0) (16.4.41) 


then 


Elin = (-ioa) fin = 4EEN. (16.4.42) 


That is, in particular, (—io2) acting on £7. reverses the direction of spin. 
The solutions ®_(p,o) of (16.4.14) are given by 


&_(p,c) = v(p,c) b*(p,c) (16.4.43) 


with b*(p,o) an arbitrary function of (p,o). 

We have thus obtained four orthogonal spinors u(p,c), v(p,o) two for 
each value taken by co in the sense of (16.4.21), (16.4.36), (16.4.38). 

The general solution of (16.4.1) may be then written from (16.4.2), 
(16.4.9), (16.4.19), (16.4.43), (16.4.20), (16.4.32) as 


U(r) = S- [ome dw, [e*/"u(p, a) a(p,o) +e '?*/"y(p, o) b*(p,c) 


(16.4.4) 
For the moment, if we adopt the normalization condition! 


~ [ox Wi (a) W(e) =1 (16.4.45) 


thus providing, in the process, specific units for V, then (16.4.23), (16.4.37), 
(16.4.39) together with (16.4.45) gives the following restriction on the coeffi- 
cients a(p,a), 6*(p,o) 


ye 2 |2Me dw, [|a(p, o)|? + |b(p, 2) |7] = 1 (16.4.46) 


'® The alert reader might wonder why we have not considered the Lorentz scalar 
Ui (a) y°U(x) (see (16.2.62)) instead of U1 (x) W(x) in (16.4.45). The reason is 
that neither U1 (x) U(x) nor d*x are Lorentz invariant but their product is. More 
will be said about the normalization condition adopted in (16.4.45) in §16.6. 
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where because of the normalization factor p°/Mc in (16.4.23), (16.4.37), the 
2Mc factor multiplying dw, appears again in (16.4.46). 

The Hamiltonian may be obtained from (16.4.1) by multiplying the latter 
by —hey® giving 


ih (a) = 40 (ar a Mc) V(x) = HV(a). (16.4.47) 


We may rewrite (16.4.44) as 


V(r) = oS [ome du, eiP*/h le“ 2"/Bu(p, a) a(p,c) 


+ eiP’@*/hy(_p, a) b*(—p,a)]  (16.4.48) 
to obtain the two equations 
Hu(p,o) = p°cu(p,o) (16.4.49) 
Hv(—p, co) = —p°cv(—p,c) (16.4.50) 
where the Hamiltonian in the p-description is given by 


Mc? co-p 
H =~ (cy: p+ Mc’) = (16.4.51) 
co-p —Mec? 


with u(p,a) and v(—p, oc) corresponding, respectively, to positive and nega- 
tive energies Ex = +p%c. The significance of a negative energy solution will 
be discussed in $16.6. Equations (16.4.49), (16.4.50) also follows directly from 
(16.4.24), (16.4.35) if one uses the definitions of H in (16.4.51). 

We consider the parity operation in (16.3.13) as applied to u(p,), v(p,c), 
giving 


Pu(p, 7) = np u(—p,c) (16.4.52) 
Pu(p, 7) = —np v(—-P, ¢) (16.4.53) 


Without loss of generality we may set np = +1 or —1, and the numerical 
factor np chosen is referred to as the relative, intrinsic parity of a particle. 
Equations (16.4.52), (16.4.53) show that the intrinsic parities associated with 
the negative energy solution is opposite to the one adopted for the positive 
energy. The interpretation of this will be also discussed in $16.6 when studying 
the particle content of the theory. 

For the spin related matrix © in (16.2.30), (16.2.46) we note that 


x-Nu(0,c) = tu(0,0) (16.4.54) 
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for €, corresponding to €4n, respectively, in (16.4.40), while 


x-Nv(0,c) = Fv(0,¢). (16.4.55) 


On the other hand for N taken along p, i.e., N = p/|p|, 


5 €4N 
p+ Mc 
= 4/ > 16.4. 
u(p, 7) Ime |, Ip ; (16.4.56) 
~ p°+ Me =“ 
with €, corresponding, respectively, to £:n, and 
a eae 
pi + Me p? + Mc °F 
= 4p 16.4.57 
v(p, 2) VE ( ) 
+é.N 


For the helicity S| (see (16.3.8)), with $) = hX'\/2, and with u(p,c), v(p,c) 
given above in (16.4.56), (16.4.57), 


di u(p, a) = +u(p, c) (16.4.58) 


+} u(p, 7) = Fu(p, c). (16.4.59) 
Unlike the non-relativistic case 
[H ,S] £0, [(H ,L] 40 (16.4.60) 


where the spin S and orbital angular momentum L are defined in (16.2.44)— 
(16.2.46), but the total angular momentum J = L +S is conserved, i.e., (see 
Problem 16.9) 


[H,J] =0. (16.4.61) 
The spin component $j parallel to p, i.e., the helicity (§16.3, (16.3.8)) 
has the distinction that it commutes with H: 


[H,S)] =0, (16.4.62) 


as is readily verified. 
We introduce the following matrices 


Py (p) = > up, c) ap, 0) (16.4.63) 


P_(p) = —S > v(p,¢) W(p,o). (16.4.64) 


They are orthogonal projection operations, i.e., they satisfy the following 
equations 
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P,(p) Py(p)=Py(p), —- P_(p) P_(p) = P_(p) (16.4.65) 
P,(p) P_(p) = 0 = P_(p) P;(p). (16.4.6) 

They project out, respectively, positive and negative energy solutions 
P.(p)u(p,c)=ulp,c), Py (p) o(p, 7) = 9 (16.4.67) 
P_(p)v(p,o)=v(p,c),  -P_(p) u(p,o) = 0. (16.4.68) 


The explicit expressions for these operators may be worked out directly 
from their definitions in (16.4.63), (16.4.64) and are given by 


(—yp + Mc) 
P = + 16.4. 
.(p) = (16.4.69) 
(yp + Mc) 
P_(p) = ~———— 16.4. 
(p) = 4 (16.4.70) 
and satisfy the completeness relation 
Pi (p) + P_(p) =I. (16.4.71) 


We close this section by noting that u(p, o) in (16.4.20) may be rewritten, 
in general, as 
bo 
po + Mc 
2Mc IP| 
pi + Mc 


u(p,o) = (16.4.72) 


o-p& 


where p = p/|p|. At low energies, i.e., for |p|/M/c < 1, the upper component 
is large in comparison to the lower one with the tater suppressed by a factor 
of the order |p|/Mc, and exactly the opposite happens for v(p,o), giving 


bo 0 
u(p, 7) ~ ,  -(p,o) ~ (16.4.73) 
0 & 


For later reference, we note that by using the elementary property 
(HY? = Ip? (16.4.74) 


for the Hamiltonian in (16.4.51), the Dirac time evolution operator takes the 
simple form 


et H et 
e-tH/h — T cog (EE ) i= sin (EE ) (16.4.75) 
pre 


where p® = +,/p? + M2c?. 
In the next section, we consider massless particles in the light of the Dirac 
equation. 
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16.5 Massless Dirac Particles 


To investigate the nature of Dirac massless particles, it turns out to 
be more suitable to work in the chiral representation of the y~ matrices in 
(16.1.26) obtained via the unitary operator G given in (16.1.25). As we have 
seen in $16.3, it is essential to deal with the helicity, i-e., with the component 
of spin parallel to the direction of the momentum p of the particle. 

To treat massless particles it is most instructive to consider first the zero 
mass limit M — 0 of the solution of the Dirac equation and then compare 
the solution obtained with the one derived by studying the Dirac equation 
with the Mass M set equal to zero at the outset. 

In the chiral representation, we have explicitly from (16.1.25), (16.4.20), 


u'(p,o) = Gu(p,c) 


po+Me | o-p é 
2Mc J2Mc(p? + Mc) | 
tages / (16.5.1) 
v2 p+ Me 


bo 


o-p 
2Mc V2Mc(p® + Mc) 
Using the fact that 


O _ 
: Vor ne (16.5.2) 


po +Me Ip| 


and the expression for cosh/2 and sinh v/2 in (16.3.5) leads, using in the 
process (16.2.28), to 


exp (+5 o-N) fs 
oe 
—exp|—xo- _ 
2 


where N = p/|p|, and from (16.2.27), (16.4.26), (16.4.27), 


2|p| 1 — |p|/p® 
0=1 16.5.4 
(Ble ee veg 


where p® = +1/p? + M2c?. u'(p,c) satisfies the normalization condition 


(16.5.3) 


u'(p,o) u'(p, 0") = deo! (16.5.5) 


with w = u'i+/°, as expected. 
For €, = £_n (see (16.4.40)) 
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i e9/2E 
u'(p,o) = = (16.5.6) 

V2 —e9/2¢ 

and from (16.5.4), we note that for M@ — 0 
0 
1 1 2|p| 
; l 16.5. 

u(p,c) =e [5 (22) | ak (16.5.7) 


for which using now the normalization condition 
ul(p,o) u'(p, 0’) = dao" (16.5.8) 
gives rigorously for M — 0, 


0 
u'(p,o) = ; (16.5.9) 
fn 


For the charge conjugate spinor, we have directly from (16.4.32) in the 
chiral representation 


v'(p,o) = —= (16.5.10) 
exp (-3 Oo: N) cr 


where €4.n, = £E4n (see (16.4.42)). Hence for 5 = €_n, 


0/2 [S+N 

v'(p,¢) > —T= 16.5.11 
(p, 0) \ ( ) 

for M — 0, and now with the normalization condition 
vl (p,o) v'(p, 0") = deo! (16.5.12) 

we obtain 
fin 
v'(p,o) = (16.5.13) 
0 


It is interesting to note from (16.5.9), (16.5.13), that the solution of pos- 
itive and negative energies decouple. 
Similarly, for €, = €4Nn, we have for M — 0, 


Cin 0 


u'(p,o) = ; vu (p,o) = : (16.5.14) 
0 €-N 
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When the mass M is set rigorously equal to zero at the outset in the Dirac 
equation, the Hamiltonian in the chiral representation is given by 


co-p 0 
H!=7°y'-p= (16.5.15) 
0 -co-p 
and 
ri _{o 0 1B IO 
B= B= (9 a 7 =(05 : (16.5.16) 


Now it is an easy matter to show (see Problem 16.11) that the only 
matrices in the complete set of the 16 matrices in the ordered set!® {1%} 
in the chiral representation that commute with H’, apart from the identity, 
are 7° and the linear combinations (N = p/|p|) 


o O o O 
aN- (5 Sy) +o0N- G :) (16.5.17) 


for arbitrary numerical factors a and b. The first matrix in (16.5.17) is nothing 
but AH’, up to a multiplicative numerical factor, while the second is © - N 
also up to a numerical factor in the momentum description. Also note that 
[7v°,u-N] =0. 

That is, we have the commuting set {H’,S-N,7°} of operators:!7 


[H’,S-N]=0, [H’,7*]=0, [7°,S-N] =0 (16.5.18) 


where S = fidU/2 (see (16.2.44), (16.2.46)), and we must find the simultaneous 
eigenstates of these operators to specify the state of a particle. For positive 
energy 


H'u'(p,o) = |p|cu'(p, 7) (16.5.19) 
S-Nu'(p,c) =haAu'(p,c) (16.5.20) 
7°u' (p, a)=¢u'(p,o). (16.5.21) 


From (16.5.15), (16.5.16), it is easy to see that (16.5.19)—(16.5.21) are 
compatible only if u’(p,c) has either an upper component or a lower one. It 
is straightforward to show that the solutions for u’(p,o) are given by 


0 E4N 
u'(p,-1) = ,  w(p,+1)= (16.5.22) 
£-N 0 


as before, with 


16 See the appendix to §16.1. 
1” The role of parity will be discussed below. 
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h 
S-Nu'(p,o) =—5 u'(p,¢) (16.5.23) 
7°u'(p, oc) = —u'(p,c) (16.5.24) 
and 
/ nh / 
S-Nuw(p,o) = +54 (p,o) (16.5.25) 
7°u'(p, oc) = +u'(p,c) (16.5.26) 
respectively. 


7° is called the chirality operator, and we see from (16.5.23)—(16.5.26), 
that for a positive energy solution we always have 


=-Nu'(p,o) = yu (p,o). (16.5.27) 


That is, for positive energy massless Dirac particles, helicity and chirality 
have the same sign. 
For the charge conjugate spinor 


v'(p,o) = iy?" (p, 0) (16.5.28) 
the solutions are 
€4N 0 
(16.5.29) 
0 fn 
for €, = €_n, €4n, respectively. 
The corresponding equations to (16.5.19)—(16.5.26) are then 
H'v'(—p,o) = —|p| v'(—p, 2) (16.5.30) 
/ h / 
S-Nv(—-p,o) = —5'(-p.¢) (16.5.31) 
7°ou'(—p, a) = +0'(-p,c) (16.5.32) 
and 
/ h / 
S-Nv (—p, a) = 15 Vv (—p, o) (16.5.33) 
7°u'(—p, a) = —v'(p,o) (16.5.34) 
respectively, and we always have 
=+Nv'(—p,o) = —7°v'(—p,¢) (16.5.35) 


with chirality and helicity of opposite signs in this case. 
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We note that the solutions in (16.5.22) obtained for u’(p, a) are connected 
by the parity operation 


0 £4N 
4/0 ee (16.5.36) 


€-N 0 
up to a phase, where 7° is given in (16.1.26). Massless particles that are 
produced through processes which do not conserve parity, so that a massless 


particle with only one of the helicities, say, —A/2 in (16.5.23) is observed, 
may be described by the spinor 


u(p,o) = ; (I-77) u'(p,o) (16.5.37) 


which is non-zero for u'(p,o) = (0 én), and that 


7u(p,o) = —u(p,c), (16.5.38) 
S-Nu(p,c) = —Fiilp.o). (16.5.39) 


A particle with negative helicity is said to be left-handed, while one with 
positive helicity is said to be right-handed.!® 

The charge conjugate spinor U(p,c) corresponding to u(p,a) would be 
then defined as in (16.5.31), satisfying 


S-(—N) 0(—p,¢c) = +7 v(-p,c). (16.5.40) 


The fact that the positive and energy solutions in (16.5.22), (16.5.29) 
decouple, a two-component formulation for massless particles may be set up, 
but we will not go into it here. 

For massless particles of higher spin s, we will see in §16.8 that the only 
possible values for helicities are +s and no intermediate 2s—1 values appear. 


16.6 Physical Interpretation, Localization and Particle 
Content 


In the previous sections of this chapter, we have developed the Dirac equa- 
tion to provide a quantum description of relativistic spin 1/2 particles in a 
Lorentz covariant manner. We have witnessed the existence of negative en- 
ergy states. This, in particular, implies the unboundedness of the spectrum of 


'8 The word chirality is derived from a Greek word referring to “hand” or in this 
context to “handedness”. 
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the corresponding Hamiltonian from below as the kinetic energies associated 
with such states become more and more negative. One cannot also simply 
exclude these states, in general, from the underlying Hilbert space due to the 
lack of its completeness without them for a correct statistical interpretation 
of the theory. As a matter of fact, quantum transition probabilities of oc- 
currence of basic fundamental relativistic processes turn out to be incorrect, 
i.e., in contradiction with experiments, if such states are not included in their 
calculations. The purpose of this section, is to study the nature of the nega- 
tive energy states, their non-trivial consequences, the idea of localization of a 
particle and that of vacuum fluctuations. As mentioned in the Introduction 
to this chapter, these concepts provide the first steps in the development of 
quantum field theory. 

In the light of the Dirac equation, we first derive expressions for the prob- 
ability density and probability current as follow from this equation and we 
consider the initial value problem. One will soon realize that a single-particle 
interpretation with such a probability density turns out to be not complete, 
and one is led to a multi-particle theory. This is due to the fact that relativity 
allows the creation of an unlimited number of pairs of particles, and we must 
take them into account even when discussing the motion of a single particle. 
We then study the concept of a position of a particle, its localization and 
its role in the nature of relativistic corrections of the spectrum of the hy- 
drogen atom, together, with the proper interpretation of the negative energy 
solutions of negative mass. 


16.6.1 Probability, Probability Current and the Initial Value 
Problem 


Consider the Dirac equation in an external electromagnetic field 


G (% _ <A, ES ae v=0 (16.6.1) 


and of its adjoint multiplied by 7°: UW = Wty, 


ad Or 
wn (S 54, =| =° (16.6.2) 


Multiplying (16.6.1) from the left by W, and (16.6.2) from the right by W, 
and subtracting one equation from the other, we obtain 


Op (U(x)y"U(z)) = 0 (16.6.3) 


S Wt(0) W(x) + V- (cW(x)yV(x)) =0 (16.6.4) 
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providing a conservation law for the probability four-current density J“(x) = 
W(x)y"U(x)/h, in the unit adopted for U(x) in (16.4.45). The latter implies 
that the decrease of probability within a given volume of space is compensated 
by the flow of the probability three-current density J(x) = V(x)yW(a)/h out 
of the volume in question. The product W1(x)W(x)/he = J°(x)/c allows one 
formally to adopt the normalization condition in (16.4.45). We will see below 
at the end of this section, however, that a single-particle probability density 
interpretation of this in the relativistic domain turns out to be not complete. 

The total probability three-current Jyo:(t) at any given time t, may be 
determined from (16.4.44), with u(p,a) and vu(p, oc) satisfying the normaliza- 
tion conditions in (16.4.23), (16.4.37), and the easily derived properties 


U(p, c)yu(p, 0’) = -6,,. (16.6.5) 
Mec 
Op, 0)y0(p, 0") = fe bo0!, (16.6.6) 


to be given by 


Itot(t) = - | d°x W(x)yW(2) 
= oe [2Me dup (2) [latp. 0)? + [p.0) 
an 5 | 2Me dw, [z(—p, o)evu(p, 0’) 


. e2ip®ct/hg*(_p, a) b*(p,o’) +c.c.| . (16.6.7) 


Due to the interference between the negative and positive energy solu- 
tions, the current Jqot(t) oscillates rapidly with angular frequencies w = 
2\/p2c? + M?2c*/h > 2Mc?/h ~ 107! sec~!. Such oscillations were referred 
to as “Zitterbewegung” by Schrédinger. We will say more about them below. 

Given an initial condition U(x, 0), describing, for example, the localization 
of a particle, we will see how the negative energy solution contributes to the 
solution for t > 0. Given an initial condition, the Dirac equation provides in 
(16.4.44) the solution of the problem for t > 0. By using, in the process, the 
normalization properties in (16.4.23), (16.4.37), (16.4.39) we obtain 


a(p,o) = ul (p,c) [ox e'P'x/h (x, 0) (16.6.8) 


b*(p,o) = vu! (p,c) [ex e'P*/2 U(x, 0). (16.6.9) 
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For example, if the electron is initially prepared with spin, say, along the 
z-axis, and 


1 
W(x, 0) = F(x) |p (16.6.10) 
0 
then 
p?+ Mc | 
a(p,o) = \/ ——— éf F(p) (16.6.11) 
2Mc 0 
cos 6! 
* p?+Mce |p| /t 
b*(p,o) = 4 fae 16.6.12 
(p,c) Site po 4 Ale iiae (—p) ( ) 


(see (16.4.20), (16.4.32)), where p = |p|(sin ’ cos ¢’, sin 6’ sin ¢’, cos 0’) and 
F(p) is the Fourier transform of F(x) in (16.6.10). That is, given an initial 
condition, the coefficients b*(p,a), corresponding to the negative energy so- 
lution, would, in general, contribute to U(x,t) for t > 0 in (16.4.44). They 
are suppressed by a factor |p|/(p? + Mc) relative to the positive energy co- 
efficients at low energies. On the other hand for higher energies at which |p| 
is, say, comparable to Mc, |p|/(p° + Mc) is of the order one and b*(p,c) 
would be equally important as a(p,o). Accordingly, if the wavepacket F'(x) 
is such that the particle is localized within a volume of extension R, an hence 
typically |p| ~ f/R, then energies for which |p| ~ Mc will contribute to the 
integral in (16.4.44) and make b*(p, a) of the same order as a(p, a). Roughly, 
as soon as a particle is localized within a radius of the order of its Comp- 
ton wavelength or less, the negative energy contribution to U(a) cannot be 
neglected. 


16.6.2 Diagonalization of the Hamiltonian and Definitions of 
Position Operators 


We carry out a diagonalization of the Hamiltonian in (16.4.51) of a Dirac 
particle by using, in the process, a transformation of the type given in 
(16.1.30)—(16.1.35), which provides a representation of the Dirac equation in 
which the positive and negative energy states may be separately represented 
by two-component spinors. This new representation of the Dirac theory is 
referred to as the Foldy-Wouthuysen-Tani!® representation, and provides in- 
sight into the problem of the position of a relativistic particle and into the 
physics of the relativistic corrections of the hydrogen atom studied at length 
in §7.4. 


19 Foldy and Wouthuysen (1950); Tani (1951). 
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Let a = p/Mc in (16.1.32)—(16.1.34), then the unitary transformation 
matrix G in (16.1.30) takes the form 


1 
G = ———_—— [p° + Mec + y- 16.6.13 
2p°(p° + Mc) Ip y-P| ( ) 


and ; 
Gt =G@i- — —. pi +Mc—y-p . 

Vv 2p°(p® + Mc) 
It is understood that (p° + Mc) in the square brackets in (16.6.13), (16.6.14) 
is multiplied by the identity matrix. 

From the expression of the Hamiltonian H = 7° (cy- p+ Mc?) in the 
usual Dirac-Pauli representation, and the identity in (16.1.35), we obtain the 
following expression for the Hamiltonian in the new representation 


(16.6.14) 


GHG =H! =7°\/p2c? + M22 = 7°p°c. (16.6.15) 
The Dirac general solution U(x) in (16.4.44) then transforms to 


W' (x) = GU (2) 


2Mc dw ip-x —ip?x° 
->/ p p:x/h [e p/h»? (1+7°) u(p,c) a(p,c) 


0 
s/ 2p? (p® + Mc) 
— eip®#®/h 0 (fp _ 40) a(— b*(— 
e p® (I— 7°) v(—p, 2) b*(—p, 0) 


+(x) 0 
= “ (16.6.16) 
0 _(z) 


where we have used (16.4.49), (16.4.50) after having multiplied these two 
equations by 7° from the left. Here we have identified +(x) with the two 
component spinors as extracted, respectively, from the projection operators 
(I+7°)/2 onto upper/lower components as occurring within the square brack- 
ets in (16.6.16). From (16.6.15), (16.6.16), we then have the two-component 
equations 


ihe (2) =t+V R20? + M2c4 bs(z) (16.6.17) 


where we recall the definition of 7° in (16.1.8). 
On the other hand, the Dirac position variable x in the new representation 
becomes 


x’ =GxG! 
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ihy h : 0 
= £ mes >> } 
se 2p° + 2(p°)?(p® + Mc) {Py ay eee 4 


=x+R (16.6.18) 


as is easily verified, where we have used the fact that in the p-description 
x = ihV,, and that the momentum p is the same in both representations 
since G in (16.6.13) commutes with p. In writing the last expression within 
the curly brackets in (16.6.18) we have also used the identity 


i(p+y-py) == xp. (16.6.19) 


The first term x on the right-hand side of (16.6.18) in the new represen- 
tation at time t is given by 


x(t) = elt /P x eit /h oy . : (16.6.20) 
(p°) 
and for the associated velocity 
d 
2) ee 2 (16.6.21) 


dt — (p®)? 
The latter is uniform in t, and is closely analogous to the classical concept 


of velocity, and gives the expression = c when applied to a positive energy 
Pp 


solution. 
The position operator x’ = Sass however, is far more complex. For a given 
well behaved spinor ©’(2), we may write 


Xop U (z )= fers k(x, x’) W/(2°, x’) (16.6.22) 
and formally define, in the sense of distributions, the kernel, 
. Vv? 
k(x,x’) = xd3(x — x’) + a I,(a) y: V Vi2(a) 
27? An? 
+ — SIRE * Vila) (16.6.23) 
where 
1 dz sin(za) 
I = 16.6.24 
1(a@) Gifs x Vz+1 ( ) 
1 fe zdz sin(za) 
Io(a) = 16.6.25 
2(a) a) (224.1) (V2? +141) ( ) 


ede sin(za) 
I3(a) = l @navwea (16.6.26) 
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ery (16.6.27) 
h 

The kernel k(x,x’) does not vanish for x # x’, ie., it is non-local. The 

functions Iy(a), I2(a), I3(a) are plotted in Figure 16.1. They become small 

for a > 1, ie., for |x — x’| >> h/Mc. That is, x’ defines a non-local operator 

with a non-locality spread roughly of the order of the Compton wavelength 


h/Mc of the particle. 


Fig. 16.1. Graphs showing the behavior of [;(a), I2(a), I3(a) in (16.6.24), 
(16.6.25), (16.6.26) on their dependence on a > 0 in (16.6.27). Their rapid 
vanishing properties for |x — x’| >> h/Mc are evident. 


The time evolution operator in the new representation may be explicitly 
written as ‘ : 
eRe tp°c tp’c 
eitH’/A — cog (=) — iy sin (=) (16.6.28) 


where we have used the fact that (7°)? = I, (7°)? = 7°. Accordingly, the 
time evolution of x’ is given by 


x! (t) = elt H/F x! git’ /h (16.6.29) 


and from (16.6.18), (16.6.28), this works out to be 


x x 
p +t PH 


/ 
th= 
5 a 2p°(p® + Mc) p 
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and coincides with the expression in (16.6.18) for t = 0, as expected. 
Equation (16.6.30) also gives 


dx'(t) Poy, f; ipy-p _ ( 2p°te ie 2p°te 
ae = 7 H’+c\ iy (po + Mo) sin i +1y° cos h ; 


(16.6.31) 


From (16.6.18), (16.6.21), (16.6.31) we may write 


d : ipy:p pope 26 2p" te 
qe) =C (iv = weet) sin ( hi + 1y COs hh 
(16.6.32) 


which gives the deviation of the velocity dx’/dt from dx/dt. 

Therefore in the new representation in which the Hamiltonian is diagonal, 
one may, in general, define two position operators x(t), x’(t). The former 
is closely analogous to the classical concept of position corresponding to a 
uniform velocity. The latter, however, executes a complex motion, involving 
a rapidly oscillating one with frequencies 2p°c/h > 2Mc?/h ~ 107! sec 
(the so-called Schrédinger’s “Zitterbewegung”) about x(t) and is responsible 
for the non-locality and the associated spread of a particle over distances 
roughly of the order of its Compton wavelength. Because of the reasons just 
mentioned, x is usually referred to as a “mean” position operator, while x’, 
with its inherited complex motion in time, is responsible for the “jittery” 
behavior of the particle. 

The orbital angular momentum in the new representation reads 


L’=x’xp 


=xxp+Rxp 


=L+Rxp (16.6.33) 
and for the spin 
S’= p GrGt 
2 
A ih A 
5 50 Oe dp°(p? + Me | xp) Xp 
=S-Rxp (16.6.34) 


where now x X p and AX/2 define “mean” orbital angular momentum and 
spin, and both commute with H’, 

h 
2 
(see (16.6.15)), i-e., they are separately conserved. It is interesting to note that 
the total angular momentum remains invariant under the transformation 


Ixxp,H’])=0, =[5,H’]=0 (16.6.35) 
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The derivations L’-L = Rx p, S’—S = —Rx p from the “mean” operators 
are expressed in terms of the deviation of the position operator x’ from the 
“mean” one R = x’ —x, and are of orbital angular momentum of origin. With 
the fluctuating character of R(t), with velocity as given in (16.6.32), one may 
formally interpret the above spin deviation as an additional orbital angular 
momentum associated with the complex motion of the particle over extended 
regions as discussed above. 

A similar analysis as above through (16.6.18)—(16.6.34) may be also car- 
ried in the Dirac representation. In particular, the operator X in the Dirac 
representation whose transformation in the new representation is x is clearly 
given by 

X=G1xG. (16.6.37) 


In the p-description, x = ihVp, and a straightforward application of (16.6.37) 
gives 
ihy A 


».< — 
a 2p° —-2(p°)?(p? + Mc) 


{ipy-p +p" x p} 


=x-n. (16.6.38) 


This position operator is referred to as the Pryce-Newton-Wigner?” operator. 
It satisfies the commutation relations 


and with X(t) = exp (itH/h) X exp (—itH/h), with H given in (16.4.51), 


d pH 
0 0 


qt) = = 24° (y+ pe+ Me) (16.6.40) 


pp??? 


This velocity operator is given by a satisfactory expression. It is uniform 
and gives the expected expression pc/p? when applied to a positive energy 
solution. The operator x(t), however, is far more complex as discussed below. 

By using the fact that x(t) = exp (itH/h) x exp (—itH/h), and the explicit 
expression for the Dirac time-evolution operator given in (16.4.75), we obtain 


ih tHp 
aaa (°yH — pc) + oF 
a ih ( OV FT ) re 2p" te iff r 2p°te 
— pe 3 sin 
2p? ¢ ae y h pc h 


(16.6.41) 


20 Pryce (1948); Newton and Wigner (1949). 
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where the identity 
(y° YH — ep) = — (Hy"y — ep) (16.6.42) 


should be noted. 
For the deviation n(t) = x(t) — X(t), we also have 


dn(t) io _ { 2p°te _ it 2p°te 
Ee = 50 ( yH - pc) sin t ur cos i (16.6.43) 


which has the oscillatory characteristic behavior with frequencies 2p°c/h > 
2Mc? /h. 

Therefore in the Dirac representation, one may also, in general, define 
two position operators X(t), x(t). The velocity corresponding to the former 
is analogous to the classical one, while the Dirac position operator executes a 
complicated motion involving a rapidly fluctuating one with high frequencies 
(the “Zitterbewegung”) about X(t) and is, again, responsible for the non- 
locality and the associated spread of the particle. These fluctuations were also 
encountered when studying the total probability current in (16.6.7) arising 
from the interference of positive and negative energy solutions. We note that 
due to the fact that X(t) is not in “diagonal” form,?! it mixes the positive and 
negative states, i.e., it does not commute with the sign of the energy operator. 
Also note that dx/dt|,_) = cy°y, and (cy)? = 3c? for its square. This is 
unlike the velocity operator dX/dt in (16.6.40). The operator X(t) is usually 
referred to as a “mean” position operator, while x, with its inherited complex 
motion, simply as the Dirac position operator. 

The “mean” orbital angular momentum in the Dirac representation is 


defined by 
L=Xxp 
=xxp-nxp (16.6.44) 


and the “mean” spin operator by 


h 

s-G--ne@ 
2 
ih h 

=S8 xX p4 x xX p) X 
apo’ XP + a 5G + Ma (UX?) XP 


21 By a matrix in “diagonal” form, in this context, it is meant a matrix of the form 
A’ 0 P , 
( 0 B’ where A’ and B’ are 2 x 2 matrices. Such matrices are also referred to 


/ 
as even matrices, while a matrix of the form ( Bl a is referred to as an odd 


matrix. 
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=S+n Xp. (16.6.45) 


These “mean” angular momentum operators are separately conserved. Also 
the deviation S—$ = —n x p may be formally interpreted as an additional or- 
bital angular momentum contributing to spin due to the complicated motion 
of the particle in an extended region as before. 


16.6.3 Origin of Relativistic Corrections in the Hydrogen Atom 


Consider the Dirac Hamiltonian in the presence of an electrostatic poten- 
tial ed(x): 


A, = 7° (cy: p+ Mc’) + e¢(x) 


On, , 
me ie i ec € 
ie it Mc Mc? 


o(x) 


=Mc?H, (16.6.46) 


We wish to diagonalize this Hamiltonian. We will eventually consider correc- 
tions up to the order 1/c? only. 

We first apply the unitary G transformation given in (16.6.13) to Hy, to 
obtain 


ee, ees 2 
Hl =GH,G> =7/1+-2. +. 6(Gxe) (16.6.47) 
Cc 


where we have used (16.6.15), and that 
Go(x)G" = o(GxG~") 
= 4(x’) (16.6.48) 


Here x’ is given explicitly in (16.6.18), where x is the “mean” position and 
x’(t) — x(t) has the complex motion elaborated upon earlier associated with 
“Zitterbewegung”. 

We are interested in finding the corrections in H, up to order 1/c”. That 
is, we have to find corrections in H; in (16.6.47) up to 1/c*. The ¢ term is 
already multiplied by 1/c?, hence we may solve for x’ in (16.6.18) up to the 
order 1/c?. This is given by 


ih h 
4M 2c? 


= xp (16.6.49) 


where we note that (1/p° — 1/Mc) is of the order (Mc)~°. 
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Up to 1/c*, we then have”? 


fo tncte > ih ih h 
300) = 50 (x- Sev- aad XP] 


l2 


Cc Mc 4M 2c? 
1 ih hn? 
~ a(x) = Meat V(x) + 8M2¢ — V"o(x ) 
A 
— a VP) +(x p) (16.6.50) 


expanded about the “mean” position in the Foldy-Wouthuysen-Tani repre- 
sentation. o 
Accordingly, up to 1/c*, Hi is given by 


2 4 
TT! ~ ~0 p p e 
Brey (1+ 2M22 ana) mee 
ihe 4). Vole) + V(x) 
~ oye 8M3c4 
he > 


This is almost in diagonal form except for the y - V¢(x) term which is in 
“non-diagonal” form and sticks out as a sore thumb, and we have to carry 
out a further transformation to diagonalize Hy}. 

To the above end, we use the following key equation. Given any odd ma- 
trix, as defined before, i.e., a matrix O of the form 


Bo 0 
= (36) 


where A’, B’ are any 2 x 2 matrices, then 


1 ae 
57°6.77 =-O (16.6.52) 


and note that minus sign on the right-hand of this commutation relation. 
Therefore to diagonalize H} and cancel out the y- V¢@ term in (16.6.51), 


which is of the order 1/c?, we choose O = (—ihe/2M?c’) y-V 4, and introduce 
the unitary transformation 


2 For any three-vector k which commutes with x, © and p, we note that k-x and 
k- (x p) commute. This allows us to carry out the Taylor expansion on the 
right-hand side of (16.6.50). & x p and y do not commute, however, but keeping 
track of their commutators is not important since this will lead to corrections of 
the order 1/c®. Also note that (y: V)? = —V?. 


928 16 Quantum Description of Relativistic Particles 
/ 1 0A 
G=I+ 37 O (16.6.53) 


correct up to order 1/c3. The term y°O/2 can act only on the 7° term, with 
coefficient one in (16.6.51), as the product of y9O/2 with any of the other 
terms in (16.6.51) are of the order 1/c° or higher. Hence 


it! ~o' ia 


2 4 
~7° [14 . Ls e 
: ( IMIe — Mid) + ea?) 


he 
8M3c4 


he 


x ~ AM3c4 


V" d(x) V¢(x) + (= x p) (16.6.54) 
where we have used (16.6.52) to cancel out the O term, proportional to y-V¢, 
identified above. 

Setting e¢(x) = U(x), and using the identity V¢(x) - (2 x p) = -—h- 
(V(x) x p), the diagonalized form of H; in (16.6.46), up to the order 1/c?, 
is given by 


I of Me p? p* U he v2u 
1 
. ( OM ee es) 8M?c? (x) 
+ S-(VU(x) x p) (16.6.55) 


2M 2c? 


which is obviously in diagonal form. When restricted to the upper component 
of the Dirac spinor, it should be compared with (7.4.32) (see also (7.4.38)), 
where we have generated the Darwin and the spin-orbit coupling terms, as 
well as the leading relativistic correction to the kinetic energy. H1/ is expressed 
in terms of the “mean” position operator x, in the Foldy-Wouthuysen-Tani 
representation, with the corrections arising from the deviation of the position 
operator in this representation about the “mean” one associated, in particular, 
with “Zitterbewegung”. This provides a beautiful explanation of the Darwin 
term arising, as shown in (16.6.50), from the fluctuations of the position of the 
electron which encounters a smeared-out Coulomb potential leading to the 
V’¢ term in (16.6.55). An exact treatment of the bound Coulomb potential 
is given in the appendix to this section. 

Now we generalize the above analysis leading to (16.6.55) by including 
a time-independent three-vector potential A as well. That is, we consider 
the Dirac equation given in (16.1.18) with time-independent A,,. The Hamil- 
tonian associated with such a system is then given by 


Hy = 7° (cy: p+ Mc?) — ey y+ A(x) — eAo(x) (16.6.56) 


where Ag(x) = —@(x). 
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As in (16.6.46), we rewrite (16.6.56) in the form 
gf _eG(x) & 


Hy = Mc? | “A 
sae Mc Mc? Ma" y- A(x) 


= Mei, (16.6.57) 


We will diagonalize Hy up to the order 1 /c* as before. 
The odd matrix in the square brackets in (16.6.57) is 


Le Oe € 
= : A 16.6. 
ae e (p C ) 028) 


The commutation relation in (16.6.52) suggests to consider the matrix 


A jas ol ofr _ YY 
<D! = =O! = 7. (p- <A) (16.6.59) 


We introduce the transformation 
1 
1G’ = exp (20') (16.6.60) 


as a first step to diagonalize Hy. 
Since we are interested in going up to the order 1/c*, we may use the 
expansion 


1 


D'/c —D'/c dl 
ePl’Te eT ee | DsT|s sa 


[p', [D'.7]] 


1 


+5 aa?" [> ['.71]| 


1 


+25 >. LD’ [D’. m.7)]| 


for any term T in Hp. For the terms proportional to 1/c? in Hp, we may 
restrict the right-hand side of (16.6.61) to the first three terms only and so 
on. 

The first transformation is readily worked out (see Problem 16.15), and 
is given by 


(16.6.61) 


Hi = e?'/¢ Hye P'/e 
er | epee saa’ [yy ( ~£a)] 
=7 + 7720) + aypeat [VY \P-- 


1 
Seat b°y-p]*- sal [0° yp, [yy: p, 4] 


930 16 Quantum Description of Relativistic Particles 
ha) (16.6.62) 
ede .6. 


where O” consists only of odd matrices, and all the other terms in (16.6.62) 
consist of even matrices. Is the explicit expression of O”in (16.6.62) impor- 
tant? The answer is no, since it already involves the factor 1/c?, and we may 
just use the commutation relation in (16.6.52) and define the transformation 


0 
gospel 16.6. 
2G +530 (16.6.63) 


correct up to the order 1/c*, to cancel out the O"/c3 term in (16.6.62). The 
term 790" /2c? in (16.6.63) is allowed to act only on the 7° term in (16.6.62), 
with coefficient one, as all the other terms in (16.6.62) are of the order 1/c? 
and of higher orders, and would otherwise lead to corrections of the order 
1/c° and higher. 

Finally, note that (V -A = 0) 


e€ 2 e.\2 eh 
n°y- (p - “A)| = (p = <A) - “35 -B (16.6.64) 
(see also (7.4.31)), where B= V x A, 
[yy p]* = (p?)” (16.6.65) 
and 
[n° -p, [7° =p, a] ~ _A2V26 — 2hE- (Vo x p) (16.6.6) 


All told, we obtain by restricting to the upper components, denoted by 
|,, the diagonalized Hamiltonian 


He |e 3G" (1c? Hi) 2G" 
+ 
1 2 p* 
~ Me? ( A) 
“OM ame 1 UO) 
2 ” 1 
+ VU + sag 8+ (VU XP) 
e€ 
—_——§-B 16.6. 
ne (16.6.67) 


up to the order 1/c?. If A has a built in 1/c factor one may linearize 
(p— eA/c)” in A thus obtaining the Hamiltonian in (7.4.32) up to the rest 
energy Mc?. As discussed in §7.4, upon rewriting the last term in (16.6.67) as 
- (2e /2M c) S-B, the theory provides, the approximate g-factor equal to 2 for 
the electron. The correction to g has been investigated in §8.5. In (16.6.67), 
S =ho/2, U=e¢. 
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16.6.4 The Positron and Emergence of a Many-Particle Theory 


The fact that the Dirac theory predicts negative energy states with en- 
ergies going down to —oo would imply the instability of such a system. For 
example, an electron in the ground-state energy of an atom would sponta- 
neously decay to such lower and lower negative energy states emitting radia- 
tion of arbitrary large energies leading eventually to the collapse of the atom 
with a release of an infinite amount of energy. To resolve such a dilemma, 
Dirac assumed that a priori all the negative energy states are filled with elec- 
trons, giving rise to the so-called Dirac sea or the Dirac vacuum, in accord to 
the Pauli exclusion principle so that no transitions to such states are possible, 
thus ensuring the stability of the atom. 

The consequences of the above assumption of a completely filled vacuum 
with negative energy electrons are many. A negative energy electron in the 
Dirac sea, may absorb radiation of sufficient energy so as to overcome an 
energy gap such as from —Mc? to Mc’, or the corresponding one to the 
problem at hand, thus making a jump to a positive energy state leaving 
behind a surplus of positive energy and a surplus of positive charge +|e| 
relative to the Dirac sea. This has led Dirac in 1930 to interpret the “hole” 
left behind by the transition of the negative energy electron to a positive 
energy state, or the absence of the negative electron, as a particle that has 
the same mass as the electron but of opposite charge.?* This particle called 
the positron (e+) was discovered shortly after in 1932 by C. D. Anderson 
who apparently, as mentioned before, was not aware of Dirac’s prediction at 
the time of his discovery. Incidentally the above argument has also provided 
an explanation of the so-called pair production y — ete~ by a photon (in 
the vicinity of a nucleus?*). The “donkey” electrons, as G. Gamow named 
them, led to the birth of the positrons with opposite intrinsic parity to that 
of electrons. Conversely, if a “hole” is created in the vacuum, then a positive 
energy electron may make a transition to such a state releasing radiation 
giving rise to the phenomenon of pair annihilation. 

In the field of a nucleus, a pair ete~ may be created, violating conserva- 
tion of energy for a short time At ~ h/2Mc?. An electron in orbit around 
the nucleus may jump into the “hole” thus created, while the electron cre- 
ated would travel a distance of the order cAt ~ h/2Mc replacing the initial 
electron in orbit during this time. This process may be repeated providing 
a simple explanation of “Zitterbewegung” as an exchange process between a 
“primary” electron in an atom and a “secondary” electron ejected from the 
Dirac sea in the field of the nucleus occurring within distances of the order of 
the Compton wavelength h/Mc of the electron. Pairs created in the vicinity 
of the nucleus would lead to a particle screening of the charge of the nucleus. 


23 Dirac initially assumed that the particle is the proton since there were no 
positrons at that time. Apart from the large mass difference between the proton 
and the electron, there were other inconsistencies with this assumption. 

24 The presence of the nucleus is to conserve energy and momentum. 
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An electron, in the atom, at large distances from the nucleus would then 
see a smaller charge on the nucleus than an electron nearby (such as one in 
s-state). This leads to the concept of vacuum polarization, by the field of the 
nucleus, and also to the concept of charge renormalization as a result of the 
charge screening mentioned above. 

Thus by putting relativity with quantum physics one has encountered neg- 
ative energy states, going down to —oco. By invoking stability of the atom one 
was led to the discovery of anti-matter which finally led to a multi-particle 
theory. The wavefunction of a single-particle quantum particle in the rela- 
tivistic regime turned out to be not complete, and a formalism which would 
naturally describe creation and destruction of particles became necessary. The 
so-called “hole” theory, with a filled Dirac sea of negative energy electrons, 
although it gave insight into the nature of fundamental processes involving 
relativistic quantum particles and of vacuum fluctuations, turned out to be 
also incomplete. For example, in the “hole” theory the number of electrons 
minus the number of positrons, created, is conserved by the simultaneous 
creation of a “hole” for every electrons ejected from the Dirac sea. In nature, 
there are processes, where just an electron or just a positron is created. Ex- 
amples of such process are 3~ decay or muon decay: ~~ — e7 + Ve + Vp, 
and 3+ decay: p> n+e* +, for a bound proton in a nucleus, where 1, 
Ve are neutrinos, and % is an anti-neutrino, associated with the respective 
particles. Also we have seen in (16.3.19) that the Dirac equation, in a given 
external potential A,,, may be transformed by charge conjugation to describe 
a priori a particle of positive charge + |e|. That is, if we were set initially to 
develop a relativistic theory for the positron and go through a “hole” theory 
with a sea filled with negative energy positrons would we have discovered the 
electron? 

The Dirac theory with its tremendous accomplishments led to the devel- 
opment of modern (relativistic) quantum field theory, where fields describe 
the creation and annihilation of particles and the number of particles need 
not be conserved. The concept of a potential, with its inherited inconsisten- 
cies, in a relativistic setting, in such a multi-particle relativistic theory with 
particles created and annihilated, was abandoned in describing the interac- 
tion of elementary particles. The interactions between the particles were thus 
described by the exchange of particles rather than by potentials. Lorentz in- 
variance turned out to be readily implemented in quantum field theory?’ and 
vacuum fluctuations were found to be a natural consequence of the theory. 


25 See, for example, the expression of the Lagrangian density in (16.9.32). 


16.6 Physical Interpretation, Localization and Particle Content 933 


Appendix to §16.6: Exact Treatment of the Dirac 
Equation in the Bound Coulomb Problem 


In §7.4-87.7, we have given a detailed treatment of relativistic corrections 
to the hydrogen atom spectrum and further physical insight into the prob- 
lem was gained through the analysis worked out in (16.6.46)—(16.6.55), in 
the problem of approximately diagonalizing the Dirac Hamiltonian with the 
Coulomb potential and in the light of the concept of “Zitterbewegung” of the 
electron. In this appendix, we give an exact treatment of the Coulomb po- 
tential problem in the Dirac equation. The exact expression derived for the 
bound energy spectrum, when expanded in powers of the fine-structure con- 
stant, agrees, as expected, with the corresponding one obtained in §7.4, §7.5 
to the order kept there. As we have already seen in §7.6, §7.7, however, there 
are additional physical effects that contribute to the hydrogen spectrum. The 
retention of higher order terns, to arbitrary orders, in the expansion of the 
exact energy expression obtained below cannot then be justified as there 
are other competing contributions to the spectrum not accounted for by an 
analysis of the Dirac equation in this potential alone. 

Upon writing the Dirac spinor WV as (wy we) |, then the eigenvalue equa- 
tion as obtained from the Hamiltonian H, in (16.6.46) gives the following 
two two-component coupled equations 


co- py + (U — Mc? — E) d2 = 0 (A-16.6.1) 


co py2+ (U+ Mc? — E) y, =0 (A-16.6.2) 


where U = e@. 
We use the identity 


(o-x)(o-p) =x-p+io- (x x p) (A-16.6.3) 


which upon multiplying from the left by o - x leads to 
(-p) = 0-x(x-p+ -o-L) (A-16.6.4) 
r 


where r = |x|, X is a unit vector along x. 
The Hamiltonian H; commutes with J”, J3, L?, and upon using 


A 
J=L+ 52 (A-16.6.5) 
we may write 
eek 2 2 349 
u-L= 7 € L ah ) (A-16.6.6) 


and also 
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1 2 2 349 
o-L= ip (s L af ) (A-16.6.7) 
where it is understood that the right-hand sides of the latter two equations 
are multiplied by unit matrices. 

In Appendix to §7.5, we have combined the orbital angular momentum 
and spin, and given the expressions for (0, | j,& = j = 1/2,m,;) in (A-7.5.4), 
(A-7.5.5) as two-component objects. It is easily seen that 


o-L (9,6) 5,0 = 5 $ 1/2,mj) = [Eh (j + 1/2) — A] (0,615,0 = 5 ¥ 1/2,my) 
(A-16.6.8) 


or 


(o-L+h) (0,9| 9,0 = 9 F1/2,mj) = £h(j + 1/2) (0,619, € = 9 = 1/2,m;) 
(A-16.6.9) 


We also use the identity in (A-7.5.7), 


o°X(0,0|j,C=5F 1/2,m;) an (0,¢|7,€=7= 1/2,m5) (A-16.6.10) 


The eigenvalue equation in (A-16.6.9) suggests to rewrite o- p in (A- 
16.6.4) as 


o-p=0-8(p.+=(o-L+h)) (A-16.6.11) 
r 


where p, is defined in (7.2.2). 
Clearly, if we write 


r(x) = (—i)F(r) (0, 6] 9, € = 9 = 1/2,m;) (A-16.6.12) 


where the (—i) factor is chosen for convenience, then 


, ihf. 1 : 5 
o-pwdi(x) =i b. + — (3 + 5)| Fi(r) (0,0|9,€ = 7 £1/2,m,;) (A-16.6.13) 
where we have used (A-16.6.9), (A-16.6.10) and noted that o-X is independent 
of r. 

Accordingly, we have succeeded in obtaining a separation of the variables 
r, (0,@) in (A-16.6.1), (A-16.6.2), by using in the process (A-16.6.13), by 


writing 


tbo(x) = G(r) (8,615, € = 5 £1/2,m5) (A-16.6.14) 
to obtain the radial equations 
1d kK Pe ia eee Mc?) = 
(Ar “) Fi(r)4 = G(r) =0 (A-16.6.15) 


(sare *) G(r) aM) =0 (A-16.6.16) 
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where & = +(j+1/2), and in obtaining (A-16.6.16), we have multiplied 
(A-16.6.2) by (—i). 


1d 
We apply the operator ane to both of the equations in (A-16.6.15), 
r dr 


(A-16.6.16), and rewrite the resulting equations in matrix form to obtain 


fas+ta-|()-[2+ ee] (6) 


1 (/—-K —Za\ (F 
+5 é: . ) GE (A-16.6.17) 


Here a = e?/hc is the fine-structure constant, and we have used in turn (A- 
16.6.15), (A-16.6.16) all over again to obtain the final form in (A-16.6.17). 


We set 
y= VK - 22a? (A-16.6.18) 


and we may diagonalize the last matrix on the right-hand side of (A-16.6.17) 
via the matrix?® 


> Ponte t a) eae Zax ) (A-16.6.19) 
whose inverse is 
z (n/n) Za x+y 
o K(k +7) a Za ) ; (A-16.6.20) 


In reference to the last 2 x 2 matrix on the right-hand side of (A-16.6.17), 
the Q matrix gives 


_1 [-k -—Za _ fy 0 
Q Ge : Je- G =) (A-16.6.21) 
Hence upon setting 
On ) = i (A-16.6.22) 
M2 AH E2 
=), p=2n, (A-16.6.23) 


with |E| < Mc?, and multiplying (A-16.6.17) by Q~+ and inserting the iden- 
tity QQ7! between (F a)" and the last 2 x 2 matrix in the latter equation, 
we obtain 


de Cod 1 EE Za yyH#1) 
; re) A-16.6.24 
E a pdp ( 4 her p pr )| . 


26 We use a method of Goodman and Ignjatovié (1997). 
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We carry out the substitutions 


ug = pte Ply. ui =pirteP/ty_ (A-16.6.25) 
and set # 
fas, SS os - 
n= hoe y (A-16.6.26) 
to obtain from (A-16.6.24) 
d? d ; 
p= 59+ + 2iyt+1) — el 94 — (1 — my =0 (A-16.6.27) 
dp dp 
ee Ma ae ag, Si ne (A-16.6.28) 
pau + By— el gy try =0. 6. 


These equations are particular cases of Kummer’s equation?’ 


< ee ee (A-16.6.29) 
Cage OY ap w= 6. 


where solution, regular at p = 0, is the confluent hypergeometric function 


a a(a +1) p? 


F(a,b;p) =14 bo. A-16.6. 
(a, b: p) eyo (A-16.6.30) 
and has the asymptotic behavior 
e T'(d) p ,a—b 1 
F(a,bsp) ~ ray ep [i+0(24)] (A-16.6.31) 


for p — co. It becomes a polynomial, however, for a a non-positive integer. 


From (A-16.6.25), (A-16.6.27), (A-16.6.28) we may infer that 
ys =cy F(1—n',2(y4+ 1);p) (A-16.6.32) 
y— =c_ F(=n',27; p) (A-16.6.33) 


The asymptotic behavior in (A-16.6.31), in conjunction with the defi- 
nitions of u+ in (A-16.6.25), necessitate that yz are polynomials. For (A- 
16.6.32) this means that the possible values for n’ are 1,2,..., while for (A- 
16.6.33) that n’ = 0,1,2,.... For n’ = 0, however, F (1,2(y + 1); p) diverges 
as e? for p — oo and hence c; must be taken to be zero for n’ = 0. 

All told, we may solve for E from (A-16.6.26) giving 


VAP —1/2 
=| (A-16.6.34) 


E=M¢e p+ 
(n+ 


27 Cf., Abramowitz and Stegun (1972), p. 504. 
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One may introduce the principal quantum number n = n’ + (j + 1/2), to 
finally rewrite (A-16.6.34) as 


-1/2 


72. 2 
E=Me |14 i (A-16.6.35) 


[» - (j+1/2)+ VU +1/2)" — Za? 


where we have used the fact that |«| = 7 + 1/2 (see below (A-16.6.16)). The 
reader may which to carry out an expansion of (A-16.6.35) in powers of Z?a? 
to make a comparison with the results in 87.5. 

The radial functions F'(r), G(r) may be now determined from (A-16.6.22), 
(A-16.6.25), (A-16.6.32), (A-16.6.33), with cy = 0 for n’ = 0, and may be 
then property normalized.?® 


16.7 The Klein-Gordon Equation 


In this section we consider the quantum description of relativistic spin 0 
particles. As in the spin 1/2 case, we encounter the expected negative energy 
states. For bosons, however, an argument based on a completely filled sea of 
bosons of negative energies to ensure stability breaks down as an arbitrary 
number of bosons may be put in a given energy level. Accordingly transitions 
to such negative energy levels would be possible and the argument collapses. 
A quantum field theory treatment, however, overcomes such a difficulty. To 
some extent the analysis given here follows the one of the Dirac theory in the 
previous sections. 


16.7.1 Setting Up Spin 0 Equations 


Under a homogeneous Lorentz transformation x — x’, as given in (16.2.6), 
a Lorentz scalar ®(), by definition, remains invariant, ie., 


@'(x') = (2). (16.7.1) 
For infinitesimal transformation given in (16.2.39), this becomes 
©! (x? + wa) ~ B(x) (16.7.2) 


or . 
B! (x) ~ B(x? — bw, 2) ~ B(x) + 5p oe” Ew ®(e) (16.7.3) 


where 


8 See Goodman and Ignjatovié (1997). 
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Lup = («50 _ 059.) (16.7.4) 


which upon comparison with (16.2.43), for example, one may infer the spin 0 
content of a Lorentz scalar (also pseudo-scalar). 

From the energy-momentum constraint in (16.1.6), followed by the so- 
called minimal substitution in (16.1.16) in the presence of an external elec- 
tromagnetic field, the wave equation of a spin 0 follows and is given by?® 


| (v" _ = A*) (P. = = Ay) - M?¢?] &(x) =0 (16.7.5) 


and is referred to as the Klein-Gordon equation of a spin 0 charged particle. 
Here we identify p, with h0,,/i. 

From one’s experience with the Dirac equation, the first thing that might 
come into one’s mind, regarding the above equation, is how to rewrite it in 
first order in space and time derivatives. This is easily achieved. 

To the above end, set 


1 e 1 
— [p* — -—A¥ |) 6 = —y# 16.7. 
WAG Cc ) es 1677-8) 


from which, we may rewrite (16.7.5) as 


1 


i 


(Pn - =A,,) x" +Mcb =0 (16.7.7) 


We may recast this equation elegantly in a form similar to the Dirac equation 
in (16.1.18) as 


[6* (py - <A,.) + Me] E(x) =0 (16.7.8) 


where 


(16.7.9) 


(i) 
lI 


00001 00000 
00000 00 00-1 

B°=i| 0 0000], B=i}]0 0000 (16.7.10) 
0 0000 00000 
—10000 0-100 0 


29 Recall the definition of our metric in (16.1.20), and the property of raising or 
lowering a four-vector index ‘yu’ as given in (16.2.4). Note also we have here 
divided the energy-momentum constraint equation in (16.1.6) by c’. 
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00000 0000 0 
00000 0000 0 

6? =i)/00 00-1], 6?=i/000 0 0 (16.7.11) 
00000 000 0 -1 
00-10 0 000-1 0 


as is easily verified. It is understood that Mc in (16.7.8) is multiplied by the 
identity element. 

The wave equation in (16.7.8) is referred to as the Duffin-Kemmer-Petiau 
equation. It is also readily shown, that the matrices G" satisfy the relations 


BY 8” BP + BP BY BY = — (BMg’? + BPg’"). (16.7.12) 


The matrix 3°, in particular, has no inverse. This is problematic in recasting 
(16.7.8) in the form of the Dirac equation given in (16.4.47) with the time 
derivative simply appearing on its left-hand side with no 3° multiplying it. 
This is due to the fact of the redundancy of some of the components in 
(16.7.9) as seen through the definition in (16.7.6). 

We may also rewrite the Klein-Gordon equation in another form which 
is of first order in the time derivative where no (singular) matrix, such as 
8° in (16.7.8), multiplies it on its left-hand side, and readily allows the de- 
termination of the Hamiltonian theory. This, however, turns out to sacrifice 
the relativistic appearance of the resulting equation. It is a two-component 
equation and is usually known as the Feshbach-Villars®° equation which we 


study next. 
To obtain the above mentioned equation, we rewrite (16.7.5) as 
7 n° +7 
14 1 ® 4 G= 16.7.1 

( in) ( in) M?c? : on) 

where j 
rH = (p" = < av") (16.7.14) 

c 


This suggests to set 


79 79 
(1 + i) P=, (: = z) ® = yo (16.7.15) 


obtaining the following two equations from (16.7.13), 


0 

Met! = vi + pa ip 2 (ti + ye) (16.7.16) 
7 . 

Mee? = pe as (v1 + Y2) (16.7.17) 


By using the properties of the Pauli matrices in (2.8.2), and setting 


30 Feshbach and Villars (1958). 
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p= i) (16.7.18) 


we may combine (16.7.16), (16.7.17) into the two-component equation 
now = (eA + Mc’o3) p + (a3 + io2) ia (16.7.19) 
ot = Se EEE ~ 


which looks like a non-relativistic equation. 


By taking the complex conjugate of the (16.7.19) and multiplying it from 
the left by —o1, we get 


* > cA : 2 
ine ( ) es (—eA® + Mc?o3) i + (03 + ig2) (P ve ) te 


(16.7.20) 
thus obtaining the charge conjugate wavefunction 
Yo = te (16.7.21) 
Vi 


That is, given an external field A,,, ¢- satisfies the same equation in (16.7.19) 
with the charge e — —e. 


16.7.2 A Continuity Equation 


We obtain a continuity equation in analogy to the Dirac case in (16.6.4). 
We will see below that the definition of the adjoint operation is to be modified 
here and is to be taken as 


Caaj = 03(C*)' 0 (16.7.22) 


where (C*)!' is the familiar one. 
Multiplying (16.7.19) from the left by (W*)'o3, and by following, in the 
process, a procedure similar to the one in deriving (16.6.4), we obtain 


° [vost] 4. J=0 (16.7.23) 
where 
h 
T= 5 (w*" (03 + ion) Vy — (Vu"") (03 + ion) ¥) 
2 aw (a3 + ig) w (16.7.24) 


The quantity w* | ogy) is given by 
w* osb = |dn|? — ol? (16.7.25) 
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and is the difference between two positive definite densities. Because of this 
property, w*' a3) is interpreted as a charge density when multiplied by e, 
rather than a probability density and is consistent with a theory describing 
particles of both signs of the charge. 

In reference to (16.7.25), we note that if we adopt the normalization 


[ex v*'o30 =4, (16.7.26) 
where q is some constant, then this implies that for the charge conjugate w, 
in (16.7.12) 
[ex wv 03 ue = 49 (16.7.27) 
as expected. 


The emergence of wl os rather than of the more familiar expression 
u* ' w has led to define expectation value of an operator O as*! 


(O) = [ox wv o30W (16.7.28) 


and define the adjoint of O as in (16.7.22). This is important because we may 
infer from the expression of the Hamiltonian 


rr 


H = (eA® + Mc’o3) + (03 + ioe) sar (16.7.29) 


as obtained from (16.7.19), that it is Hermitian with the definition given in 
(16.7.22), where we note that 


03(io2)* | 03 = ioe (16.7.30) 


16.7.3 General Solution of the Free Feshbach-Villars Equation 


Following the procedure developed for the Dirac equation in §16.4, we 
write the solution of (16.7.19) for A, = 0 as 


v(e) = [eure dup [e?*/" u(p)a(p) +e" o(p)b"(p)] (16.731) 


where u(p) is the positive-energy solution satisfying 


2 
0 P P 
[p Mc ie Mc 
| 


2 
ie [p? + Me + Br. 


u(p) =0 (16.7.32) 


31 For additional details on the definition of the expectation value see Feshbach and 
Villars (1958). 
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with p? = +\/p? + M?2c?. The solution of (16.7.32) is elementary and with 


the normalization condition 


aT; pe 
: = — 16.7. 
ur o3u = FF (16.7.33) 
(see (16.7.25)), is given by 
ee Mc+p° 
uP) = Cae (16.7.34) 


Similarly, for v(p) we have from (16.7.32) with p® — —p°, the solution 


ee Mc-— p® 


where p® = +,/p2 + M2c?, with the normalization condition 


0 


«T Pp 
S45 16.7.36 
wv o30 = — FF ( ) 
and 
u*!o3v =0 (16.7.37) 
The normalization condition 
[ex w* (x) 03 W(x) = 4 (16.7.38) 


for a given constant C’, then leads to the constraint 


[eMeae, [la(p)|? — |b) |?] = 4 (16.7.39) 


on the expansion coefficients a, b* in (16.7.31). The integral in (16.7.39) is, in 
general, not positive definite. This has led to interpret e times the integrand 
as a charge density as mentioned before. 


16.7.4 Diagonalization of the Hamiltonian and Definition of 
Position Operators 


The Hamiltonian 
2 


Me a Be 
H = Mc‘o3 + (03 + io) aM (16.7.40) 


for the free system may be diagonalized by the transformation matrix 


1 
G = —— [p®+ Mc + (p° — Mc)o1] (16.7.41) 


2,/Mcp® 
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whose inverse is 


1 
G-t = — [p® + Mc - (p® - Mo)oi] (16.7.42) 


2/Mcp? 
The transformed Hamiltonian works out to be 
WS=CaG. 
=o3p°c (16.7.43) 
and the position variable x becomes 


x’ = GxG"! 


1 


=x+R (16.7.44) 


The first term x in the new representation develops in time as 


x(t) = eitH' /hy Q—itH!’ /h 


=x+tShH' (16.7.45) 
p 
with associated velocity 
dx(t 
JU ee i (16.7.46) 
dt 


and is in close analogy to the classical concept of velocity. 
As in the Dirac case, x’ = x(,,, however, defines a non-local operator, and 
for a given well behaved function w’ (x), 


Xop (x) = | Px! k(x, x’) f(a, x’) (16.7.47) 
where 


k(x, x’) = x 0°(x — x’) 


oh? i dp iP: (x—x’)/h 


2 (Qrh)> [p? + M22] 
—Mc|x—x'|/h 
2558 Gen 505) aig IS — | (16.7.48) 
T |X — xX 


This shows a clear non-locality, associated with the position operator x’, 
with a spread roughly of the order of the Compton wavelength h/Mc of the 
particle. 
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To investigate the time development of the operator x’, we note that the 
time evolution operator in the new representation is given explicitly by 


ae, et 46; 
octtH’ J — og (ue ) eee (52) (16.7.49) 


This leads to 


x! (t) = eitH'/hy! it HH’ /h 
ih 2p° ct 2p° ct 
ajo Poi ae cos ( a ) ~ iogsin ( - )| (16.7.50) 


and for the velocity associated with the deviation R(t) in (16.7.44), we have 


dR(t) — icpo, | . ( 2p°ct ; feed 
rader > oe sin | + io3 cos Z (16.7.51) 


As in the Dirac theory, R(t) has a complex “jittery” motion, the “Zitterbe- 
wegung”, with high frequencies 2p°c/h > 2Mc?/h, where now M is the mass 
of the spin 0 particle. In analogy to the spin 1/2 case, the position operator 
x(t), in (16.7.45), is referred as a “mean” position operator with associated 
velocity which is uniform and is closely analogous to the classical concept of 
velocity, with x’(t) executing a complex motion about it. A similar analysis 
for corresponding position operators may be also carried in the original rep- 
resentation with the Hamiltonian given in (16.7.40) and is left as an exercise 
to the reader (see Problem 16.20). 


16.7.5 The External Field Problem 


We consider the Hamiltonian in (16.7.29) in the presence of an external 
time-independent weak electromagnetic field A,,(x), and rewrite first H as 


H = Mc? 403 ioe ee eel 
2M2c?2 Mc? 2M 2c? 


=Mc?H (16.7.52) 


where U = eA? = —eAp. 

We diagonalize the Hamiltonian H by considering relativistic corrections 
in it up to the order 1/c*, and hence up to 1/c® in H. 

The only non-diagonal term in (16.7.52) is the one involving o2. The 
other Pauli matrix which is non-diagonal is o,. To diagonalize H, we use a 
key commutation relation given below in (16.7.54). 

Consider the non-diagonal 2 x 2 matrix O of the form 
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O = ioza + boy (16.7.53) 


where a and 0 are any numbers, then from the anti-commutativity of 03 with 
01, 02, we have 


(5030, 05] me 5 (16.7.54) 


where note the minus sign on the right-hand side of this equality. 
The commutation relation in (16.7.54) suggests to choose, as a first step 
of the diagonalization process, the transformation 


== 01) (16.7.55) 


where 


= = Dy (16.7.56) 


From the expansion 


a x Ss i 1 
-1 
H' = G,HGy! ~ H+ S[D1,H] + 35 [Ps [Di #]| 
1 
+ 55 |D1. [D:, [D:, #1] (16.7.57) 
valid up to 1/c®, we obtain the expression 
, 7 ‘i 7 U [r, [7’, v]| 
H’~ 14+ 
. | IMP saa t wae | + we + 32M 
1 ~ 
+ 40> (16.7.58) 
where 
‘ee mc ve [n?, U] 
O» = -i | 16.7.59 
ce 102 F= sina | 1 4M! ( ) 


The matrix Op is the only non-diagonal one in (16.7.58) and has the structure 
given in (16.7.53). 

To cancel the O/c! term in (16.7.58), we introduce the transformation 
Gp = exp (D2 / &) such that the commutator of D2/c* with the 03 term with 
coefficient one in (16.7.58) cancels the term in question. This is clearly given 
by (see (16.7.54)) 
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03 1 x 
io ee le 
Go + 5 aO2 
7 ; [n?, U] 76 
=1 O71 3M4c4 t 109 8M3c4 t “1 78Me6 (16.7.60) 
The above transformation gives 
HY" =. Cie. 
- Pe mmf ae [p, [n?,v]] 
~ 931° OR 8M4ct ' 16M8E| M2" 32MeE 
ee 
+ 603 (16.7.61) 


where O3 is not-diagonal in the form given in (16.7.53). Its explicit expression, 
however, is not important since by choosing a transformation G3 ~ 1+ 
7303/2c® we may cancel out the O3/c® term in (16.7.61) by applying to 
(16.7.54) one more time. Here it is important to note that the only term that 
7303/2c° can operate on is the first term involving 03 with coefficient one 
in (16.7.61). All the other terms would, otherwise, lead to corrections of the 
order 1/c® or higher. 

All told, the diagonalized Hamiltonian corresponding to the one in 
(16.7.52), up to the order 1/c*, is given by 


oe oom 72 4 x ae iy 
S03 Ee VOM 8M32° 16Meeé 
[p?, [r?,v]] 
eee oe 16.7.62 
+ Seagal (16.7.62) 


For a Hamiltonian with an electric field only which is weak, (16.7.62) 
simplifies to 


2 4 6 
H" = M 2 p p p U 
val “Tom 8M32 * 16MecA| * 
[P?, [p?,U] | 


Up to the order 1/c?, corrections arise only in the kinetic energy term, and 
the next order correction is 1/c* rather than 1/c?. On the other hand for a 
weak magnetic field a correction of the order 1/c?, linear in A, comes from 
the —n*/8M3c? term within the square brackets multiplying o3 in (16.7.62). 
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This concludes our treatment of the spin 0 case. As mentioned in the 
introduction to this section, an argument based on a filled sea of bosons of 
negative energies to ensure stability breaks down. The quantum field theory 
treatment, however, overcomes such a difficulty. The spin 0 case with equa- 
tion in (16.7.8) will be considered again in §16.9 in discussing the spin and 
statistics connection. 

In the next section, we develop relativistic wave equations for arbitrary 
spins. 


16.8 Relativistic Wave Equations for Any Mass and Any 
Spin 


We generalize the Dirac equation for M > 0 to any spin s = 1, 3/2, 2, 
.., with s = 0 studied in the previous section and spin s = 1/2 studied in 
§16.4, $16.5. We consider in turn the cases M > 0 and M = 0. 


16.8.1 M > 0: 


We introduce a totally symmetric spinor U°%-°*, with k = 2s, where 
each of the indices a; goes over from 1 to 4, satisfying the equations 


mA) M an Pi 
(2M) ne 


(16.8.1) 


M On Br 
(2 +4 =) YO 18k (7) —0 
1 


That is, ’%!-°*(x) satisfies a Dirac equation with respect to each of its k 
indices. From the complete symmetry of U°''%* in the a,’s, an equation 
in (16.8.1) satisfied with respect to one of its indices implies the validity of 
the equations with respect to the other indices by permutations. The set of 
equations in (16.8.1) is called the Bargmann- Wigner set of equations. 

Lorentz covariance of the above equations implies, by invoking, in the 
process, Pauli’s Fundamental Theorem as in 816.1, that there exists a non- 
singular matrix, say, G such that under homogeneous Lorentz transformations 
YA1--Pe (ge) 4 Yeo!) with 


Yi (gy!) = (GEL... (G)%Pe YA1-—-Be (gp) (16.8.2) 


Fourier transforming U°!°*(a), as in (16.4.2), and multiplying the re- 
sulting equation corresponding to the first one in (16.8.1) by (~yp + Mc) ™™, 
or the second one by (—yp + Mc)°?°?, and so on, we get 


948 16 Quantum Description of Relativistic Particles 
(p? + M?c?) U%~** (p) = 0 (16.8.3) 


Therefore repeating the analysis given through (16.4.5)—(16.4.14), we have 


Yr Oe (x) = [ome dw, jevetagtee ee (p) 4 e ipr/hg@ 1. Ae (p) 


(16.8.4) 
with p° = +,/p? + M2c?, 
(yp + Mc) 291. @ 8102 0% (p) Zi 
(16.8.5) 
(yp + Mey, 1--44-0P4(p) = 0 
and 
(—yp + Mc)™"@_f12-%8(p) = 0 
; (16.8.6) 
(ap + Me)" _e--44-2F4(p) = 0 
Due to the symmetry of 6,°%''"°*(p) in the indices a1,...,@%, it may be 


written as a symmetrized sum of the positive energy Dirac spinors u(p, a) in 
(16.4.56) to satisfy the set of equations in (16.8.5). We may then write 


Der pie) = S- O64 443 0KYE) [u™ (Pp, 01) . --u*(p, ox) a(p, c) 
O15. 0KS 1 

=u (py, a)a(p,o) (16.8.7) 
with the €, taken to be the €4n given in (16.4.40). Here C(oj,...,0%30) 
is a symmetrization operation over the indices oj,...,0% thus leading to 
a symmetric ®,°'"°* in (a1,...,a%), and the o; take on the values +1 
corresponding to €4n. In (16.8.7), o denotes the number of +n occurring in 
[wu (p,o1)---u°*(p,o%)], ie., to the number of o1,...,0% taking the value 


+1. The a(p,c) are arbitrary (one component) coefficients. Clearly, o takes 
on (& +1) values. 

An explicit expansion for the symmetrization operation C(01,...,0%;0) 
may be given, in the spirit of the one given in (2.8.18), by 


1 ) 


esere o 7 k-o 16.8.8 
Klol(k—o)! 1OGe,  OGo, (9+1)" (9-1) (16.8.8) 


C(G4, 22530557) = 


where g;, g— are two independent variables. We note that for any given fixed 


value assigned to 0, C(o1,...,0%;0), after carrying out the differentiations 
with respect to go,,---+;Jo,, With the simple rule 
6) 
(941) = 60,,41; (16.8.9) 


09o; 
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is independent of gi, g_. For example, for k = 2, 


C(o1, 0230) = 89,,-1902,-1 (16.8.10) 

1 
C(o1, 02} 1) = Va (Oo1,41909,-1 + bo1,-1909,+1) (16.8.11) 
C(01, 02; 2) = 85, ,41809,41 (16.8.12) 


(see also Problem 16.21). 

The multiplicative nature of the transformation rule via the product 
(G)*141... (G)F* in (16.8.2), implies, from (16.2.29), that the spin operator 
has the additive nature 


GOP 1 OR Be (sy §0282 fr Be 4. 4 Ge B1 | §oew—1Bk-1 (Sows 


(16.8.13) 


h 
where (S)°? = 5 ()°? is defined in (16.2.46). 


With N = p/|p|, in the p-description, we may rewrite (16.4.58), in the 
present notation, as 


+ Nu(p, +1) = +u(p, +1) (16.8.14) 
Accordingly, from (16.8.7), (16.8.13), (16.8.14), we have 


1 P1,---;AnPk Te PE nh A... AK 
(B+ NyPreehs GHP (p, 9) = — [a — (k — 0)] 4°" (p, 0) 


where (& — 7) denotes the number of oj,...,0% taking the value aie 
stage, we write k = 2s, and set 
g=™Ms.t+ 8 (16.8.16) 
Since o = 0,1,2,...,2s, we may infer that 
Ms, = —8,-s4+1,...,5—-1,5 (16.8.17) 


and obtain from (16.8.15), 


(S . N)o fon 025325 @ , 91---Pos (p, me + s) — hm, @ 4.0%? (p, me + s) 
(16.8.18) 
thus establishing the spin s character of ®4°!"°?s(p,m, +s). Needless to 
say, the same conclusion is reached if we go to the rest frame of the particle, 
choose the quantization along, say, the z-axis, and apply the spin component 
S3 to 4°" (O,ms + 8). 
Similarly, we may carry out the expansion 


P_MFk(p a) = C(o1,..- 0%; 0) [v(p,o1)...u(p, o%)] O* (p, o) 


O1,..-0h=t 


(16.8.19) 
in terms of the charge conjugate spinors v(p,o) in (16.4.57), and repeat the 
analysis given above now applied to ®_°'*"°?*(p,o). The relevant details are 
left as an exercise to the reader. 
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16.8.2 M =0: 


For M = 0, we work in the chiral representation and with the measure 
2Mcdw, in (16.8.4) simply now replaced by dw,. The equations (16.8.5), 
(16.8.6) take the form 


(7'p) Deiat (p) -0 


(16.8.20) 


(7'p)*** ee aed ee (p) =0 


where the 0 in the subscript corresponds to mass zero. 
For example, for ®,5'"°?*(p), we may rewrite the above, more conve- 
niently, as 
(7°¥' . N)° B F102 (p) = DO (p) 
; (16.8.21) 


ar a seh a Oy... 
(7°Y' : N) eee Pp) = 8457 "*(p) 


where we have used the fact that p? = +|p|, and set N = p/|p]|. 

As a direct generalization of the analysis carried out in $16.5, we consider 
the set of operators which commute with the operators occurring on the 
left-hand sides of (16.8.21): 

(9/9 y' N) 717! 50282 goede 
5211 (-/9 -y! «NY 972 52989 GK FR 
O ) . (16.8.2) 


FOrPr | FAe-18k-1 (4/0 y! No 


The analysis in $16.5 shows that the operators commuting with the above 
operators are of the form 


(-/'8) 2294 fa2a,._ Ganb 


(16.8.23) 
/ Non nes 608-18 K-1 (3) * 


or ones involving two, or three, or...or k y/* matrices. Such an operator is 
(2) 20% Gerba. §aw—1Bk—1 (yf) nF (16.8.24) 


involving two 7° matrices. Also other operators commuting with the ones in 
(16.8.22) are 

(S . N)°2% 60282 Se Nesta 
(16.8.25) 
Pontes 6% 18-1 (S F N)oFe 
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or ones involving two, or three, or...or k (S- N) matrices, also operators of 
the form in (16.8.22) involving two, or three, or...or k (7/° y’ +N) matrices. 

To find simultaneous eigenstates of these commuting set of matrices is 
straightforward. To this end, we carry out the expansion 


BG (Pe)= DT Oleren-.- 450) 
1 


O1,..-Ok=t 


x [u' (p,a1)...u’*(p, o%)] a(p, o) (16.8.26) 
as in (16.8.7), where in the present notation 
/ 2 €4N ! rea 0 
u(p,+1) = , u(p,—-1)= (16.8.27) 
0 é-N 
(see (16.5.22)). 


From the definition of C(o1,...,0%;0) in (16.8.8), the following equalities 
are readily established (see Problem 16.22) 


C(+1,02,...0%30) = 1 £C(02.---0n50- 1) (16.8.28) 


k-oa 


C(=1,02,...0n30) = i/ C(02,..-0%30) (16.8.29) 


which in turn vanish for o = 0, o = k, respectively. 
The equalities in (16.8.28), (16.8.29) allow us to rewrite (16.8.26) as 


ory (po)= Sd) {+} ul (p,o1)...u!* (p, on-2) ap, 0) 


O1,-.-Ok—-2=+ 


(16.8.30) 
where 
{ - = a ) Tikes (p, +1)u’°?(p, +1)C(o1, . Ok-230 — 2) 
+ a 2) (w’™ (p, +1)ul”? (p 1) 4 yu (p Lu'**(p, +1)) 


: ‘Le —a)(k-o-— )) a (p, 1)u’*? (p, -1)C(o1,... oh-23.0) 
(16.8.31) 


Now consider the application of the first operator in (16.8.23) to 
b, 0k (p, o) given in (16.8.30). This leads from (16.5.24), (16.5.26) to 
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Ge)" B , Brera (p, o) 


= So {-} ules (p,oi)...u'*(p,ox-2)a(p,) (6.8.32) 


O1,;-.-Ok—-2=+1 
where 
O\O 
{-}'= Te d yas (p, +1)u!(p, 4)C(o1, + Ok-250 — 2) 
a(k—oa 
fat le (B,41)u/0*(p, 1) — wl (p, 2a, +1) 


x C(o1,...0n-2;0 — 1) 


(k—o)(kK-o-1) 49, ae . 
i k(k —1) U (p, 1)u (p, —1)C(o1,...o% 230). 
(16.8.33) 


Note, in particular, the minus sign within the square brackets in the second 
term in (16.8.33). 

Clearly, in order on (p,c) be an eigenstate of (7/°)™, either o = 0, 
or o = k, so that the second term in (16.8.33), in particular, vanishes. For 
o = 0, the first term in (16.8.33) also vanishes, while for o = k, the third 
term vanishes. For the latter we have, respectively, 


(8) wht (p,0) = OT (p,0), oo = 0, 
where 
OF? (p, 0) = u'™ (p, -1)... u’*(p, —1)a(p, 0) (16.8.34) 
or 
(-/8) 1! GP102--28 Dp) = 4BUO-% (Hk), G=k (16.8.3) 
where 
BY" (pk) = u/™ (p, +1)... u'*(p, +1)a(p, k) (16.8.36) 


Now it is obvious that to have ®{5°°*(p, a) as a simultaneous eigenstate 
of all the commuting operators enumerated above, we must have either o = 0, 
oro =k, ie, a(p,o) =0, foro =1,...,k-1. 

Upon writing k = 2s, and setting 0 = m, + s, we then obtain 


O45 °7#(p, ms + 8) = d(mMe, s)u'(p, +1)... u's (p, +1)a(p, 2s) 
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a d(ms, —s)u'™ (p, =k) as hoes (p, —l)a(p, 0) 
(16.8.37) 


written in terms of Kronecker deltas. 
For the helicity operator, as obtained from (16.8.13), (16.8.37) gives 


(S- IN) 21 P1022 Bae Dee (p,ms +8) 
= [hsd(ms, 8) — hsd(ms, —s)] 497°? (p, ms + 8) (16.8.38) 


yielding only two possible values for the helicity of a particle. 
For a massless particle of definite helicity, as discussed in $16.5, say of 
helicity chs, with « = —1 (or +1), we may set 


x 1 1 
OCs (p) = 5 +e?) a + er) 22efas H1.-P2e (1 + €)s) 
(16.8.39) 


where we recall that 7/°u'(p,¢) = eu’(p,<), and Cen (p) satisfies the set 
of equations in (16.8.21), and 


(S- N)%1 P11 0928s oe (p) = chs GO as (p) (16.8.40) 


One may treat the charge conjugate solutions in the same manner. 


16.9 Spin & Statistics 


This section is involved with the spin and statistics connection. We con- 
sider only spin 0 and spin 1/2. Familiarity with Lagrangian dynamics is es- 
sential. In the appendix to this section we provide, however, a fairly detailed 
treatment of the action integral including for systems admitting Grassmann 
variables. The method developed there is extended within this section to sys- 
tems with infinite degrees of freedom. Accordingly the reader is advised to 
consult this appendix while reading this section. 

First we examine the solutions of the free Klein-Gordon and Dirac equa- 
tions and interpret their expansion coefficients as operators for creation and 
annihilation of particles and anti-particles thus generating quantum fields. 
We set up field equations for both systems and corresponding Lagrangian 
densities which are of first order in 0,,, and are expressed in terms of Hermi- 
tian fields. We then introduce Schwinger’s ingenious constructive approach?” 
which treats such Lagrangians in a unified manner and leads naturally to 
the spin and statistics connection. Finally we use the results stemming from 
this analysis to establish the spin and statistics connection by examining the 
Lagrangian densities set up earlier. In the present section we do not dwell on 
the Spin and Statistics Theorem in arbitrary dimensions of space. 


32 Schwinger (1953); see also Schwinger (1951b, 1958a,b, 1961b). 
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For obvious reasons there has been much interest in the Spin and Statistics 
Theorem over the years since the early work of Pauli on the so-called exclusion 
principle named after him. I am pleased to see that the activities on this 
problem are still going on and even escalating in the recent literature. 


16.9.1 Quantum Fields 


The solution of the Klein-Gordon equation in (16.7.5) for free spin 0 
particles, i.e., for A“ = 0, may be quite generally written as 


= Vic2 lat> [A@)el”*/* + B*(p)eive/h (16.9.1) 


Qrh FE 


where the numerical factor Vhc? is inserted for convenience, pz = p-x—p°x®, 
p? = +1/p?2 + M2c?, «° = ct, and, of course, the mass M does not have to 
be the same as for the spin 1/2 case. 

Similarly for spin 1/2, the general solution in (16.4.44) of the free Dirac 
equation in (16.4.1) is given by 


= pome ear 0 [e (p, c)a(p, )e?*/" + v(p, a)b*(p, a)e~ iP?!” 


(16.9.2) 

The expansion coefficients A(p), a(p,a) correspond to positive energy 
solutions, while B*(p), b*(p,o) correspond to the negative energy ones. 

To overcome the dilemma of negative energy solutions and associated in- 
stability problems, A(p), a(p,c), in quantum field theory, are interpreted as 
annihilation operators of the corresponding particles of spin 0 and spin 1/2, 
while their adjoints A'(p), a'(p,c) as creation of the particles in question 
acting on the vacuum state. Similarly, B(p), b(p, co) are interpreted as annihi- 
lation operators of the associated anti-particles, and their adjoints Bt (p,c), 
bi(p,o), (B* (p,c) — Bi(p,c), b*(p,c) — di (p, a)) as creation operators of 
the anti-particles, all with positive energies, acting on the vacuum state. 

The operators, A(p), A'(p), B(p), B'(p) may be written in terms the 
field 6, now as an operator, integrated over all space and evaluated at any 
given time t by the method of Problem 16.23 as follows. 

We use the facts that 


ih | 3x @iP2/h By ei'®/h — 9p (2nh3) 53(p — p’) (16.9.3) 


and 
ih | 3x eiP2/h By eip'2/h — 9 (16.9.4) 


where 


Bo= Bo = do (16.9.5) 
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to obtain 
ih 3, g-ipa/h 5 
A(p) = Tie d°xe Oo O(a) (16.9.6) 
t ih 3, pipa/h 5 
B'(p) d’x e'?*/" 09 B(x) (16.9.7) 


- Vv he? 


and similarly for their adjoints. 
The operators a(p,c), b'(p,a) may be also extracted in analogous rela- 
tions as in (16.6.8), (16.6.9), but evaluated at any time t, as follows 


Hoey = Jo’ eniPe/ a (p, W(x) (16.9.8) 


Hoes [ox iP2/h yt (p, oc) U(a) (16.9.9) 


in terms of the operator U(x). Similar relations as in (16.9.8), (16.9.9) may 
be also written for a'(p,c), b(p,c). 

For physical interpretations and applications, commutativity properties 
of the above creation and annihilation operators have to be consistently es- 
tablished. 

To establish commutativity properties of A(p), A'(p’), B(p”), B'(p”), 
we note from (16.9.6), (16.9.7) and the equations for their adjoints, that 
commutativity properties of the quantum field ®(x), as well as of its adjoint 
61 (zx), have to be established at all points in space including at different ones, 
but may be taken all at the same time. A similar statement holds for the 
operators a(p,c), al(p’,o’), b(p”,a”), b'(p’”, 0”) in terms of the quantum 
field U(x), as well as of its adjoint U'(x), taken at different points in space 
also. 

The burden of this section is to establish commutativity properties of 
(x), ®'(x) as well as of U(x), W'(x), and in turn establish the spin and 
statistics connection for particles of spin 0 and of spin 1/2. 

We next set up Lagrangian densities for the above two systems. 


16.9.2 Lagrangian for Spin 0 Particles 


We consider the wave equation in (16.7.8) for spin 0 free particles 


mn 
(Fa.+ =) = =0 (16.9.10) 
1 


where © is defined in (16.7.9), and the 6" matrices are given in (16.7.10), 
(16.7.11). The matrices 6"/i are real, 


(3°)' = 6°, (p)'=-6 (16.9.1) 
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To obtain the equation satisfied by the adjoint of =, we introduce the 
matrix 


10 0 0 0 
' 0-10 0 0 
A=2(6°)"-I=|]00-10 0 (16.9.12) 
00 0-10 
0000 1 
which satisfies the relations 
[A, 6°] =0, {A,B} =0 (16.9.13) 


Accordingly, by taking the adjoint of (16.9.10), multiplying the resulting 
equation from the right by A, and using (16.9.11), (16.9.13) we get 


He 
sta (-§ ie ve) zy (16.9.14) 
1 


We may rewrite (16.9.10) in terms of real components by defining, in the 
process, 


1 : if : 
& = — (6, +182), x*= 75 (xf + ixd) (16.9.15) 


V2 


where the y“ are given in terms of ® in (16.7.6), now for A¥ = 0, ie., 


h 
Hore 16.9.16 
er ( ) 


Introducing the ten-component entity 


Mc/ 9.1.2.3 0.1.2.3 ~ 
@ = (¢a) = i (x3 Xi X1 Xi 1 X2 XQ X2 X93 2) (16.9.17) 


with a=1,2,...,10, and the 10 x 10 matrices 


BY 
pe-[i? pet? (16.9.18) 
0 pe ’ 0 A . . 
i 
we may rewrite (16.9.10), (16.9.14) as 
M 
(Ba, + *) ¢=0 (16.9.19) 


or (-2a, Ee ve) =0 (16.9.20) 


16.9 Spin & Statistics 957 


written as matrix multiplication. The advantage of these equations over the 
earlier ones is that they are written completely in terms of real objects. 

The following two matrices B”, I’ are of central importance in our spin 0 
case. In particular, they have the following properties 


FB B® (rBt)! = ree, ri=T  (16.9.21) 
and 
(rB*)'’=-(rB4), rar. (16.9.22) 


We introduce the Lagrangian density for spin 0 particles in which the 
derivative 0, appears linearly 


= ~~ [OT B"0,6 — 0,60 B"§] — “or (16.9.23) 


and the properties of 'B“, I in (16.9.21) ensure the Hermiticity of the 
Lagrangian density. 

We will see below, while studying Schwinger’s Theorem, how the La- 
grangian density in (16.9.23) leads to the equations (16.9.19), (16.9.20) sat- 
isfied by w and investigate the nature of the spin and statistics connection. 

We next set up the Lagrangian density for free spin 1/2 particles. 


16.9.3 Lagrangian for Spin 1/2 Particles 


We work in the Majorana representation, with the gamma matrices given 
in (16.1.28), (16.1.29). To simplify the notation, we will still denote these 
corresponding gamma matrices by 7", and recall that in this representation 
the matrices y“/i are real. 

In the Majorana representation, defining the 8 x 8 matrices 


bb 
re = c ”) (16.9.24) 
and writing the Dirac spinor W in terms of real and imaginary parts 
1 
Uv = (VY, + iV2) (16.9.25) 


v2 


we may rewrite the Dirac equation in this representation as 


rm. M 
(Ha, ie =) = (16.9.26) 
i h 
where Z 
b = (He) = (uw 0? W3 ww) 2 WB v4) (16.9.27) 


For the adjoint of the equation (16.9.26), we have 
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ve 
yr (-= 8, + ve) =i (16.9.28) 


The matrices of central importance for spin 1/2 here are the matrices I“. 
In particular 


(rereyiarore, (mylar, (PM)? =7 (16.9.29) 


ot 


(Pm) spre, pao (16.9.30) 


The Lagrangian density of spin 1/2 is defined by 


Rit pore Mc 0 
L= i(¥ ; On — Ow : v) meee (16.9.31) 


The Hermiticity of the latter follows from the Hermiticity of P°r’, IP. 

As for the spin 0 case, we will show how the Lagrangian density in 
(16.9.31) leads to the Dirac equations in (16.9.26), (16.9.28) and investi- 
gate the spin and statistics connection after having developed Schwinger’s 
constructive approach. 


16.9.4 Schwinger’s Constructive Approach 


The Lagrangian densities in (16.9.23), (16.9.31) are of the form*? 
1 1 
L(x) = 5 (male) Qe,0uM6(x) — Opna(z)Qasno(@)] + 5 Malt) Qavno(x) 
(16.9.32) 
where the numerical constant matrices QQ“, Q are such that 


(Q*)'=-Q", Qh=Q (16.9.33) 
(see (16.9.21), (16.9.29)), thus guaranteeing that “is Hermitian, and either 


(Q")" ==9", .Q" =Q (16.9.34) 


as in the spin 0 case in (16.9.22), or 


(Q*)' =Q@4, Q'=-Q (16.9.35) 


as in the spin 1/2 case in (16.9.30). 
Here it is worth noting that any square matrix C' may be written as the 

sum of a symmetric matrix S and an anti-symmetric one A, 
1 


Cab = 2 (Cab t Cha) t 


1 
5) (Cap Coa) 


33 See also the appendix to this section. 
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= Sap + Aap (16.9.36) 


Al =—A, S' = S. For a matrix C such that Ci = —C, this means that 
At = —A, S$? = —S which correspond to the cases encountered above. It is 
important to note that in the Lagrangian density in (16.9.32), a symmetric 
matrix Q goes with the anti-symmetric ones Q", while an anti-symmetric ma- 
trix Q goes with the symmetric ones Q“ as shown, respectively, in (16.9.34), 
(16.9.35). 

The Lagrangian density in (16.9.32) is local, i.e., all the terms in it are 
evaluated at the same spacetime point. At a given time t = x°/c, locality 
means that we may add up the Lagrangian densities “(ct,x) evaluated at 
different points to obtain a Lagrangian L(x°) = [d?x Y(ct,x) since relativ- 
ity implies that different regions in space at the same time are dynamically 
independent as no signal can propagate between them. Here the position 
variable x specifies the infinitely uncountable number of degrees of freedom 
in the fields n(x°, x) = nx(ct), with the latter as dynamical variables, in the 
same way that the index i in q;(t) specifies the various degrees of freedom 
in particle mechanics. One may then define the action associated with the 
Lagrangian density “(x) in (16.9.32) by 


A= ; [ox ex Piggy e : | (dx) Y(2) (16.9.37) 


where we have integrated over all time. The action in (16.9.37) with La- 
grangian density in (16.9.32) should be compared with the one in (A-16.9.9). 
In the present case we are, however, dealing with a system of infinite degrees 
of freedom. 

To emphasize the functional dependence of the action on the field 7, we 
write L(x) = L[n(x)]. We consider the variation of the action in response 
to variation of the field n(x) — n(x) — 5n(x) to first order in 5y(x). The 
variation of the Lagrangian density is then given by 


6-2 |n(2)] = 2|n(2)] — Z[n(x) — dn(2)] (16.9.38) 


paying special attention to the order in which 67(x) appears relative to n(x) 
and 0,7(z) in (16.9.38). 

Following Schwinger, we consider variations of the action as arising from 
c-number variations 67(x) of the field. We will then encounter that such 
variations we were set to achieve with 67(x) commuting with the field itself 
is possible only for the system satisfying the conditions in (16.9.34), while 
57(x), as a c-number Grassmann variable, anti-commuting with the field, is 
possible only for the system satisfying the conditions in (16.9.35). 

The variation of the action, multiplied by c, corresponding to the variation 
of the Lagrangian density in (16.9.38) is explicitly given from (16.9.32) to be 


[eae ezine) = flax) (J tons2r Qk duomle) + nae) 2p8m2) 
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—O5na(X) QevMo() — Opna(#) Qandno())] 


+ ; [S12a(x)Qavt (a) + nel) Qus8ma(2)] 
= 5 [aay ae (5 malar Qtysnato) — Snel) Qh 


+5 flax) [ona(2)Q%,2um(e) ~ Apna) Qk8nm(2) 
au d1a(X)Qavne(x) a Na(®)Qavd(x)] 
(16.9.39) 


with the order in which 57(2) appears kept intact. 
To treat both cases in (16.9.34), (16.9.35) simultaneously, we may write 


ab = €Qba,  Qad = —€Qba (16.9.40) 


where € = —1 corresponds to the case in (16.9.34), while « = +1 corresponds 
to the case in (16.9.35). 

To simplify the expressions within the second pair of square brackets on 
the extreme right-hand side of (16.9.39) we use the relation 


Ont (x) = On flowy 5 ) O4(a! — x) ny (2’) 


= dia mo(2')0,6°(2' — a) 


- few) n(x" )01,6*(a" — x) (16.9.41) 
to write 
feo) QM, 5tta(t) Oy moat) = — foxy(ae') Sa) no 2’) Qe ,01,5%(0! — 2) 
(16.9.42) 


Similarly, we have 


feo Q4,Ouna(ae)8no(22) = — foo) (de!) ral”) (0)Q", 0,54 (a! — 2) 
Se foxy(ae') race!) 5m (20) QE, 07, 54( ce! — 2) 


— fox(as') mot" )5rpa (20) Q4,0,54(ce! — 2) 
(16.9.43) 
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Thus the second integral on the extreme right-hand side of (16.9.39) 
(which is not the integral of a total differential) may be rewritten as 


=F flaz)(ae’) [Sna(e)n(x") — ene(x')na(2) 


x [QM ,0,04(a! — 2) — Qard*(a —2)] (6.9.44) 


where we have used (16.9.40). It is remarkable that one is able to combine the 
terms depending on Q", with the terms depending on Qa» in (16.9.44) pre- 
cisely because of the opposite symmetry properties of Q“ and Q in (16.9.40). 

From the expression in the first pair of square brackets in (16.9.44) we 
learn that a c-number variation with d74(2) commuting with the field 7,(2’) is 
possible only for ¢ = —1 satisfying the conditions in (16.9.34), ie., for spin 0, 
while a variation 67,(a) as a c-number Grassmann variable, anti-commuting 
with the field p(x’) is possible only for ¢ = +1 satisfying the conditions in 
(16.9.35), i.e., for spin 1/2, otherwise the expression in the first pair of square 
brackets is identically equal to zero. Therefore 


[5na(x), mo(a")] = 0 (16.9.45) 
for ¢ = —1, while 
{5na (2), m(2’)} =0 (16.9.46) 
for e = +1, ie., 
S1a(x) mo(x") = —em(2') da(2). (16.9.47) 


In both cases, (16.9.44), (16.9.39), (16.9.40), (16.9.47) then give 


[eo dL [n(x)] = feo) On a dn (x) 


+ flac) Sna(2) (Qk2mo(z) + Qusnm(a)]  (16.9.48) 
For 574(a) arbitrary, the principle of stationary action gives the field equation 


OQ d.n(2) + One) =0 (16.9.49) 


As for the system in (A-16.9.22)-(A-16.9.32), the variation of the ac- 
tion, multiplied by c, in (16.9.48) now extended to a system with an infi- 
nite (uncountable) numbers of degrees of freedom introduces the generator 


( = 0/A(ct)), : 
G= ~ [ax MNO" Sn(a) (16.9.50) 


which brings about the change of the dynamical variables (a) by 6n(a)/2 
with 
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bn(x 1 
{ = na) GI (16.9.51) 
In detail, (16.9.51), with 2’? = 2°, reads 
de (x) 1 0 3 ! d(x’) 
= — 8, | d?x! [ne(2), a(x!) 
ee) a's fax! [re(e), tale) 


= Fos fe°x! (nel )nala!) + ene(ayne(o)) "MP (16.9.52) 


or 
[ne(x) a(x") + ena(x")ne(x)] Q24 = ihc due 6°(x — x’). (16.9.53) 
We apply the above results to spin 0 and spin 1/2. 


16.9.5 The Spin and Statistics Connection 
Spin 0 
By comparing (16.9.23) with (16.9.32) we see that 


Qt=-rB", Q= -“*r (16.9.54) 


where I’, BY are given in (16.9.18). From (16.9.49), we obtain 


iy (Bra, a *) ¢=0 (16.9.5) 


which upon multiplying from the left by I’ gives the field equation in (16.9.19) 
and hence finally to the Klein-Gordon equation. 

From (16.9.21), we note Q° = —r'B° = —B°®, with B® given in (16.9.18) 
and 3° in (16.7.10). That is, 


0 = —6d(a, 1)d5(b, 5) + 6(a,5)6(b, 1) 
— 6(a,6)5(b, 10) + 5(a, 10)5(b, 6) (16.9.56) 


written in term of Kronecker deltas. 
From (16.9.53), (16.9.56), (16.9.17), with go = ¢° ande = —1 (see 
(16.9.33), (16.9.40)), we obtain 


— [be(x), 1(x')]6(0, 5) + [be(2), o5(2’)] 5(b, 1) 


— [de(x), 6(x")] 6(b, 10) a [de(x), d10(2")| 6(b, 6) = ihc dbe (x Pt x): 
(16.9.57) 


That is, the only non-vanishing commutators are 
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[1(z), o5(2")] = ihe d°(x — x’) (16.9.58) 

[¢6(a), d10(a’)] = ihe d(x — x’) (16.9.59) 
From (16.9.17), (16.9.16), these give 

[1 (x), Bi (a")| = ihic?d3(x — x’) (16.9.60) 

[Bo (x), Bo(a")| = ific?d3(x — x’) (16.9.61) 


For the Klein-Gordon field ®(a) in (16.9.15), we then have, in particular, 
@?= 2") 


[®(a), 61 (a’)] = ihe?5?(x — x’) (16.9.62) 
[®(x), B(x’)| =0, [®(x), '(2’)] =0 (16.9.63) 
[(a), 64 (a’)] =0 (16.9.64) 


From (16.9.62)—(16.9.64), and the expression for the creation and annihi- 
lation operators At(p), A(p), as obtained from (16.9.6), we have 


[A(p), At(p’)] = * ap (2nh)°6°(p —p’) (16.9.65) 


[A(p), A(p’)] = 0, [AT(p), AT(p’)] =0 (16.9.6) 


If we denote the vacuum state by |vac), that is, in particular, A(p) |vac) = 
0, then a two-particle state involving spin 0 (bosons) of momenta p, p’ may 
be constructed as follows 


Al(p)A'(p’) |vac) = |p, p’) (16.9.67) 
The second commutator in (16.9.66) then implies that 


Ip, Pp’) = |p’, p) (16.9.68) 


which is the spin and statistics connection for bosons in its simplest form. 
Commutation relations involving the anti-particle operators are similarly ob- 
tained. 


Spin 1/2 


Upon comparing (16.9.32) and (16.9.31) we obtain 


Pore Mc 
a = — 
Q —, Q=-— 


where I is defined in (16.9.24). Equation (16.9.49) gives the equation 


re (16.9.69) 


ce Ge + = ~=0 (16.9.70) 
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which after multiplying it from the left by °° gives the field equation in 
(16.9.26). 

The matrix Q°, relevant to the relation in (16.9.53) takes particularly 
the simple form iJ, where J is the identity matrix, as follows from the first 
relation in (16.9.69). Hence (16.9.53) with 2° = 2’° gives 


{e(x), Pa(a’)} = Redeq 5° (x — x’) (16.9.71) 


where we have used the fact that ¢ = +1 (see (16.9.35), (16.9.40)). 
From (16.9.27) and (16.9.25), we conclude that for the Dirac field U (2) 
with 2° = 2”: 


{Wa(a), Uh (a')} = he da 6°(x — x’) (16.9.72) 
{W,(2),W(2’)}=0, {¥i(a), Wf (a’)} =0 (16.9.73) 


where now a,b = 1,2,3,4. 

The expressions for the creation and annihilation operators a‘(p,c), 
a(p’,c) as obtained from (16.9.8), then imply the following anti-commutation 
relations: 


{a(p, a), a'(p’,o’)} = he P bq (2ah)*O"(p —p’) (16.9.74) 
{a(p,),a(p’,o’)} =0, {al(p,c),a!(p’,o’)} =0 (16.9.75) 


Again if we denote the vacuum state by |vac), that is, in particular, 
a(p,a)|vac) = 0, then a two-particle state involving two spin 1/2 (fermi- 
ons) particles of momenta p, p’ and spin indices o, o’ may be constructed as 
follows 

a'(p,o)a'(p’,o’) |vac) = |p, a; p’,o’) (16.9.76) 


The second anti-commutation relation in (16.9.75) then implies that 
IP, o;p’, 0’) = — |p’, 0’; p,c) (16.9.77) 


which is the spin and statistics connection for fermions in its simplest form. 
Anti-commutation relations involving the anti-particle operators are similarly 
obtained. 

Similar methods may be applied to spin 1 particles. There are some com- 
plications, however, associated with higher spins and we will not go into such 
details here. In theoretical descriptions of higher spin particles the latter are 
often considered as composite particles and not as fundamental as the lower 
spin ones. 
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Appendix to §16.9: The Action Integral 


As in Chapters 10 and 11, we use the (q, p) language and to simplify the 
notation, we often suppress the indices in (q,p;) corresponding to various 
degrees of freedom. For example, we may often write }°, pig; simply as pd. 

The action integral is defined by 


To 1 
A= E (pdq + dqp) — H(q,p, t)dt (A-16.9.1) 
Ty 
written in a symmetrized form in the product pdq to ensure Hermiticity, 
where H(q,p,t) is the Hamiltonian and should be Hermitian. 
We consider variations of the action that arise by the variation of the 


dynamical variables only: 
q>G=4- 4 (A-16.9.2) 


p—-p=p-— sp (A-16.9.3) 


Since &(dg) = 5(q¢(t+ dt) — q(t)) = d&g, and so on, the variations in (A- 
16.9.2), (A-16.9.3) lead to 


To 1 
bA= | d E (pdq+ iu) 
Ty 
To 1 
a i: E (Sp dq + dq dp — dp q — dqdp) — 5Hadt (A-16.9.4) 
Ty 
where 
5H(q,p,t) = H(q,p,t) — H(q— 89, p — 5p, t) (A-16.9.5) 


For infinitesimal numerical changes 5q, dp in (A-16.9.2), (A-16.9.3) which 
commute with the dynamical variables, we may write 


T2 T2 
dA = d[p dq] + | [5p dq — dp $q — 5Hdt] (A-16.9.6) 
T Ty 
Here we recognize the pq term in the first integral (see (2.3.13)) associated 
with the boundary terms, as the generator for the infinitesimal transformation 
in (A-16.9.2). To obtain the generator for both transformations in (A-16.9.2), 
(A-16.9.3) in a unified manner, one rewrites the action integral in (A-16.9.1) 
in a more symmetrical way in qg and p as follows by omitting, in the process 


the integral 
se i 
az [., gp + Pa) 
Ty 
of a total differential, to obtain 


T2 


1 
A= E (pdq + dqp — dpq — qdp) — H(q,p,t) dt} . (A-16.9.7) 
Ty 
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It is more convenient to set 
= _ fof + (O-L 


and rewrite A as 
T2 1 
Ti 


with the indices enumerating the various degrees of freedom. 
For infinitesimal variations 


Z272F=2-82 (A-16.9.10) 


with 5H (z,t) = A(z,t)—H(z—&z,t), we have for the corresponding variation 
in A 


T2 O.. 3 gapped 
bA= al; (2 OH by, — ba, 28 “1)| 
aa ae 2 


T2 
Ty 


with the order in which the variations 5z; appear kept intact. 

We now consider numerical variations which commute with the dynamical 
variables, i.e., [zi(t), 52; (’)] = 0. From the property of Q in (A-16.9.8), the 
expression for 5A simplifies to 

ee ore % 
SA= aS 5s) +f [zi Qij de; — 5Hat (A-16.9.12) 


Ty 


The principle of stationary action gives 
SH = 62; (t)Qi52;(t) = —2i(t)Qi52;() (A-16.9.13) 


and the variation induces a generator 


t 
G= a 2 5.14) (A-16.9.14) 
which is Hermitian, and generates a unitary operator U = 1+ iG/h for 


infinitesimal 5z, changing the dynamical variables z by $z/2, i-e., 


bz; 1 
a = 5 zi Gl (A-16.9.15) 
The change brought about in an operator B(z(t),t)) by the change of 
the dynamical variables z(t), as generated by the generator G, is linear in 
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5z(t), and should also coincide with the transformation given in (A-16.9.15) 
for B (z(t),t) = z(t). Accordingly quite generally, 
5B 1 
—=—|B,G A-16.9.16 
2. 53.0 (A16.0.19 
In particular, (A-16.9.15) implies that 


52; (t 1 & 
Pal) = s(t) en | Qe (A-16.9.17) 
or 
[z;(t) 5 tall Qre =ih O50. (A-16.9.18) 
From the expression for Q in (A-16.9.8) this gives 
[pi(t) ,p;(t)] a 0, [ai(t) ’ q;(t)| ne 0, 9; (t) ,Pr(t)] _ ih djn 
(A-16.9.19) 
as expected. On the other hand, from (A-16.9.13), (A-16.9.16) we have 
6H 1 5z0(t) &z¢(t) 
oe = ih [H, ze(t)| Qne 2 = —2;(E)Qie 2 (A-16.9.20) 
leading to 
ihd;(t) = [z;(t), 0] (A-16.9.21) 


again as expected, verifying the consistency of the analysis. 

We note that Q in (A-16.9.8) is anti-Hermitian (Q' = —Q) which en- 
sures the Hermiticity of the Lagrangian (the integrand) in (A-16.9.9). It also 
satisfies the restrictive relation Q' = —Q as it is real. 

Now any square matrix may be written as the sum of a symmetric ma- 
trix and an anti-symmetric one (see (16.9.36)). If the square matrix is anti- 
Hermitian then so are its symmetric and anti-symmetric parts. This allows 
us to consider, more generally, two classes of Lagrangians: 


i — 
L= a (MQ; — MQiinj) — H(n,t) (A-16.9.22) 

with 7 real, as before, where (Q=[Qi; |) 

Qi =-@Q (A-16.9.23) 
and either 

Q'=Q (A-16.9.24) 
or 

OT S26 (A-16.9.25) 


The system satisfying condition (A-16.9.25) was already considered above 
for a specific matrix Q. The same analysis as above then gives for an arbitrary 
matrix Q satisfying (A-16.9.23), (A-16.9.25), 
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aS ont) (A-16.9.26) 


and, in particular, 
[nj(t), me(t)] Que = ih de, = =Q'=-Q. (A-16.9.27) 


We will see that c-number variations 6n;(t) commuting with the variables 
n,(t’) is possible for Q satisfying (A-16.9.25) only, while c-number variations 
anti-commuting with the variables, so called Grassmann variables already 
encountered in Chapters 6 and 10, are possible only for Q in (A-16.9.23) 
satisfying (A-16.9.24). 

To reach the above conclusion we may refer to (A-16.9.11) where now 
Q (Qt = —Q) is arbitrary but satisfies either (A-16.9.24), (A-16.9.25). For 
Q satisfying (A-16.9.25), 5z; cannot anti-commute with z; otherwise both 
expressions within the round brackets in the two integrals in (A-16.9.11) will 
vanish. On the other hand, for Q satisfying (A-16.9.24), 5z; cannot commute 
with z; otherwise both expressions in the round brackets just mentioned will 
vanish again. Therefore, we may state that 


[5ni(t),nj(t’)] =0, QT =-Q (A-16.9.28) 
{dn(t),n(t’)} =0, Q'=Q (A-16.9.29) 


are admissible for the corresponding matriz Q but not vice versa. 
In both cases, we have, by referring to (A-16.9.11), (A-16.9.14), that 


5 
G=nQ s (A-16.9.30) 


and, in particular, the commutator in (A-16.9.15), together with (A-16.9.28), 
(A-16.9.29) give (Q=[Qyy]) 


[nj (t),me(t)] Que =ihdje, Q'=-Q (A-16.9.31) 


and 
{nj(t),me(t)} Qe =ihdje, Q'=Q. (A-16.9.32) 
The above results are extended to systems with an infinite degrees of 
freedom for actual physical systems which exist in the just given section in 
studying the spin and statistics connection. Here the matrix Q corresponds 
to Q°/e in (16.9.32). Further generalizations as arising, for example, from 
more complicated variations are possible but we will not go into these here. 


Problems 


16.1. Find the expressions of the gamma matrices as obtained from the 
Dirac-Pauli representation via the transformation G given in (16.1.30) 
for fixed n and 0. 


16.2. 


16.3. 


16.4. 


16.5. 


16.6. 


16.7. 
16.8. 


16.9. 


16.10. 


16.11. 


16.12. 


16.13. 


16.14. 


16.15. 


16.16. 


16.17. 
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Establish the transformation of the matrix y°(y +a +1) given in 
(16.1.35) via the transformation G defined in (16.1.30). 

Verify the expressions (16.2.25)—(16.2.27) as obtained from (16.2.20)— 
(16.2.23) to obtained the explicit expression of the transformation Ug 
of a spinor for pure boost in (16.2.28). Show that Ug is not unitary 
but it satisfies the identity in (16.2.24). Show, as expected, that the 
transformation Up for pure rotations in (16.2.29) is unitary. 

In analogy to the analysis carried out through (16.2.20)—(16.2.28) for 
pure boosts, carry out the corresponding one leading to the expression 
for the transformation of pure rotations Up given in (16.2.29). 
Establish the Lorentz vector character of U(x)y"W(zx) in (16.2.64). 
Derive the expression for the transformed spin S’ in (16.3.6) from 
(16.3.2), and prove the second equality in (16.3.9). 

Obtain the PCT product transformation rule of a Dirac spinor. 
Derive the normalization conditions in (16.4.23), (16.4.37), (16.4.39), 
for u(p,c), v(p,a’). 

Show that for the Dirac Hamiltonian in (16.4.51), |[H ,S] 4 0, [H,L] 4 
O but for the total angular momentum J = L +S, [H,J] = 0, i-e., 
total angular momentum is conserved. 

Show that the time evolution operator exp(—itH/h) via the Dirac 
Hamiltonian in (16.4.51) takes the simple form given in (16.4.75). 
Show that the only matrices in the complete set of 16 matrices in the 
ordered set {I%,} (see (A-16.1.1), (A-16.1.7)) in the chiral representa- 
tion that commute with H’ in (16.5.15), apart from the identity are 
7/° and linear combinations of the matrices in (16.5.17). 

Carry out the details in the application of GxG! leading to the 
expression in (16.6.18) for the Foldy-Wouthuysen-Tani representation 
x’ of x. 

Derive the time-dependent expression for the position operator x’(t) 
in (16.6.30) by using the time evolution operator given in (16.6.28). 
Derive the Pryce-Newton-Wigner position operator X in (16.6.38) as 
obtained from the transformation given (16.6.37), verify the commu- 
tation rules in (16.6.39) and obtain the expression for corresponding 
velocity given in (16.6.40). Finally derive the time-dependence of the 
Dirac position operator in (16.6.41). 

Carry out the transformation eP'/e Hye "/¢ to obtain, as a first step 
of the diagonalization process of the Hamiltonian Hp in (16.6.56), and 
obtain the expression given in (16.6.62). What is the explicit expres- 
sion for O” in the latter equation?. 

Carry out an expansion of F in (A-16.6.35) in powers of Z?a? and 
compare with the results obtained in 87.5. 

(The Klein-Paradox) Solve the Dirac equation in a potential which 
varies only along the direction x given by V(x) = 0 for x < 0, V(x) = 
Vo for x > 0. For a given energy E = p°c, with 0 < E < Vo determine 
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16.18. 


16.19. 


16.20. 


16.21. 


16.22. 


16.23. 


16.24. 
16.25. 


16 Quantum Description of Relativistic Particles 


the reflected and transmitted current through this potential step. The 
paradoxical result encountered in your physical interpretation is called 
the Klein-paradox. 

Verify that with the definitions in (16.7.6), (16.7.9), the Klein-Gordon 
equation may be rewritten in the form in (16.7.8) which is referred 
to as the Duffin-Kemmer-Petiau equation with the @" matrices as 
given in (16.7.10), (16.7.11). Establish also the validity of (16.7.12) as 
satisfied by the G’ matrices. 

Show that the Hamiltonian of spin 0 particles in the Feshbach- Villars 
form in (16.7.40) is diagonalized to the form in (16.7.43) via the trans- 
formation matrix G in (16.7.41) and obtain the transformed position 
operator in (16.7.44). Derive also the time-dependence of the latter as 
given in (16.7.50). 

Find the position operator of a spin 0 particle in the original Feshbach- 
Villars representation whose Foldy-Wouthuysen-Tani representation is 
x. Interpret your result. 

In analogy to the symmetrization operation C(o1,02;0), for o = 
0,1, 2, in (16.8.10)—(16.8.12), find the explicit expressions for the sym- 
metrization operation C(01,02,03,04;0) for o = 0,1,2,3,4. 

Prove the properties of the symmetrization operation C(1,...,0%;0) 
given in (16.8.28), (16.8.29). 

Prove the equations in (16.9.3), (16.9.4), and show that A(p), B'(p) 
are given by (16.9.6), (16.9.7) by integrating ®(x) over all space and 
evaluated at any given time. Write the integral expressions for A'(p), 
B(p) as well. 

Show that the second equality in (16.9.39) follows from the first one. 
Verify all the commutators involving the Klein-Gordon field ®(x) as 
obtained from the system in (16.9.57). Work out also all the commu- 
tators involving the operators A, At, B, Bt as obtained from (16.9.6), 
(16.9.7). 


Mathematical Appendices 


I 


Variations of the Baker-Campbell-Hausdorff 
Formula 


The purpose of this appendix is to derive expressions for the product of 
two exponentials exp(A) exp(B) of two non-commuting operators A and B, 
as well as of taking the derivative of exp(A(r)), with respect to a variable 
7, of operator-valued functions A(r) which are not necessarily linear in T. 
Several formulae will be given. 


1. Integral Expression for the Product of the 
Exponentials of Operators 


By carrying out the derivative with respect to the parameter A in the 


following 
d 


dX 
gives upon integration over A from 0 to 1, an integral expression for a product 


[e*4 e*2 | = e*4(A— B)e 4? (1.1.1) 


1 
eet =1+ far e4A(A-— Bye. (1.1.2) 
0 


2. Derivative of the Exponential of Operator- Valued 
Functions 


To take the derivative of exp(A(r)), with respect to 7, where A(r) is an 
operator valued function, it is wrong to say that it is given by A’(r) exp(A(r)) 
or by exp(A(r)) A’(r). This is not true even for 


A(r) => Aj + 7A» (1.2.1) 


where [A1, Ao] ~ 0. It is true, however, for A(r) = 7A. More generally, 
(1.1.2) gives 
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1 
eAtrt ane AUT) 1 4 | dd PAC+AD [A(r + Ar) — A(r)] eA. (12.2) 
0 


If we multiply the latter from the right by exp(A(r)), this gives for Ar > 0 


1 
jeAir+ar) = eA] ae ff dd AM Al (7) ae) Alt) (1.2.3) 
0 


or 


1 
dar) = / dd A) Al(7) eae A(t). (1.2.4) 
dr (0) 
Similarly, one may derive that 
1 
G CAlr) = pA) | | dre 4M) A’(r) oe] (1.2.5) 
dr (0) 


One may, of course, carry out a formal Taylor expansion in 4 and integrate 
term by term over 4 in (1.2.4) to obtain 


[A@), ... [A(r), A'(7)] 0] 
(n+1)! 


A A(r) 
ao = e (1.2.6) 


n=0 


where n denotes the number of A(r)’s that appear in each term in the sum- 
mand. 
As an example, consider the seemingly simple expression of the form in 
(1.2.1) 
A(r) = aip + bar (1.2.7) 


where a, 6 are some real constants, and p is the momentum operator 


p = —ihd/dz. Then A’(r) = ba, [A(r),A’(7)] = aiblp,z] = abh, 
Aw), [A(r) , ’(7)] = 0. Accordingly, (1.2.6) gives 


d bh 
qy ex (aip + barr) = (0 + ) exp (aip + bar) (1.2.8) 
= 


and is neither equal to A’(r) exp (A(r)) nor to exp (A(r)) A’(r). Note that 
(1.2.5) also gives 


[[--- [4 AM]. 


an (1.2.9) 


d air) _ AC) 
ae =e dX 


From which we also obtain 


d bh 
qo (aip + bar) = exp (aip + bar) (0 - *) . (1.2.10) 
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3. The Classic Baker-Campbell-Hausdorff Formula 


To obtain the formula in question, let G(r) = 7(A+ B)+C(r), such that 
C(0) = 0, C’(0) = 0. Then (1.2.6) or (1.2.9) lead to 


72 


e@) =14(A+B)rt le") ahah B)’| oy 


3 
+ leno + : [C’”(0) (A+ B) + (A+ B)C"(0)] +(4+ By | — 
See (1.3.1) 
On the other hand 
2 
e478 —14(A+B)7 + (B?+2AB + A’) 5 
3 
(BP SAR? 4 OB EA) en, (1.3.2) 


3! 


Upon equating (1.3.1) and (1.3.2), and comparing the coefficients of 7? 
and 7° we obtain 


C"(0) = [A, B] (1.3.3) 


CO’ (0) = = |A,[A, B]] + = [B,[B, A]]. (1.3.4) 


1 NO ed 


We may also write 


72 


3 
By tO" Olay +e (1.3.5) 


exp G(r) = exp |(A+ B)r + C” (0) 


which from (1.3.3), (1.3.4), and the fact that we have equated the left-hand 
sides of (1.3.2), (1.3.5) give 


eAteP = e(AtB+C) (1.3.6) 
where 
C= =[A,B)+-—[A,[4,B]] + — [B,[B,A]] +... (1.3.7) 
as Ie? Lae es 


4. A Modification of the Baker-Campbell-Hausdorff 
Formula 


A modification of the formula in (1.3.6) may be obtained as follows. We 
may use the latter to write 
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exp (A+B) exp(~A) exp (~B) 
= exp(A+ B) exp (-(4+5)+ ; [A, B] 


-= [A,[A,B]] - 5 [B,[B, A]] +.) 


= exp (5 [A, B] + - [A,[A, B]] - : [B,[B, Al] +...) (1.4.1) 


where in writing the last equality we have used (1.3.6)/(1.3.7) all over again 
to the product of the two exponentials occurring on its immediate left-hand 
side. This gives 

elAtB) eA eB — QP (1.4.2) 


where 


D= 5 14,B]+5[4.14,8]] - 5 [BBA] +... (1.4.3) 


II 


Convexity and Basic Inequalities 


In this appendix, we study the concept of convexity and some of the in- 
equalities which follow from such a property. The inequalities derived, include 
the Minkowski, the Hélder and Young’s inequalities of integrals. 


1. General Convexity Theorem 


Let h(x) be a real function of a real variable x such that its derivative 
h'(a) is non-decreasing, i.e., h(x1) < h(a) for 21 < xg. Then for any real 
numbers qa; > 0, a2 > 0 such that a, + a2 =1 


h(aya + agb) < ayh(a) + agh(d). (II.1.1) 


For a = 0, this trivially holds true with an equality sign. Therefore, without 
loss of generality, we may take b > a. 

To prove this, we note that by the Mean-Value Theorem, we may find 
numbers c and d (b> a): 


such that 


no Meta He Howie —H9) ay 


1,» _. R(b)—h(aya+azb) _ h(b) — h(aya+ agb) 
ba) b—(aya+agb) a;(b—a) ue?) 
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But, by the property of h’(x), h’(c) < h’(d), hence 


h(aya + agb) — h(a) e h(b) — h(aya + ab) 
a2(b — a) = ay1(b — a) 


(11.1.4) 


which by rearrangement of terms and recalling the property a; + ag = l, 
leads to the inequality in (II.1.1). 

As an example, consider the function h(x) = x”, for x > 0, p > 1. Then 
h'(x) = px?! and is non-decreasing in x, and 


(aya + ab)? < ay(a)? + a2(b)? (II.1.5) 
for a, b> 0. 
As another example, consider the function h(x) = —Inz, x > 0. Then 
h'(a) = —1/z, and is increasing in x. That is, 
—In(aya + a2b) < —a; Ina— agInb (11.1.6) 
or 
In (a“b°?) < In(aya + a2b) (II.1.7) 
and 
aetb™ < aya+ ab (11.1.8) 
a,b > 0. 


2. Minkowski’s Inequality for Integrals 
For 1 < p, let 
1/p 
isn = (far iseor) — >0 (1.2.1) 


where f(x) may, in general, be complex. Then the inequality in question 
reads 


If +olle < llflle + lla: (11.2.2) 
To prove this, we note that 
ftal if ll if ll 
_~ t =a I Ze 
[fle tlle < [fle tlole * Wip+loly  “Iflp Tal “2 
Mee ll lal 
= P = __!9lp 12: 
“= Tlptlale’ ~~ [lp + llolp oe) 


and with a = |f|/||fllp, 6 = lgl/Ilgllp, (11.1.5), dealing with real numbers, 


gives 
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If +l < (Illy + llallp)” (« ae aia ity] (11.25) 


where we recall that 


(Wl)” = fax icor. (11.26) 


Therefore upon integrating (II.2.5) over x, and taking the 1/p root leads to 
the inequality in (II.2.2) finally by using the fact that aj + a2 = 1. 


3. Hodlder’s Inequality for Integrals 


Let 
(Fla) = fax #°(%) a(x) (11.3.1) 
and ||f||, as defined in (II.2.1), then inequality in question reads 
Kf la)I < If llp gle (II.3.2) 
where p, q > 1, such that 
1 1 
-+-=1. (11.3.3) 
P sq 


To prove the inequality in (II.3.2), we use (II.1.8) to write 


up 1/q 
IPs)? loc l4 1 IFC 1 I9x)It 
(fe (get) < P (IIfllp)? +5 (lgll,)° (II.3.4) 


which upon integration over x gives 


[2° #0019001 < If lla (- x: ~) =[lfllpllgll:  (L3.5) 
RY Pp q 
But 

Flo < fa’ [4] la) (11.3.6) 


from which the inequality in (II.3.2) follows. 
For p = q = 2, (11.3.2), gives the Cauchy-Schwarz inequality 


< (fax reer) ([arx eo) (11.3.7) 


[ax reat 
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4. Young’s Inequality for Integrals 


We provide one of Young’s inequalities and it is given not in the most 
general cases but in a form which is sufficient for the applications considered 
in the text as stated: 


a's f ary (x) Gc—¥) o(9)| <I IG lle UL) 


where p, q > 1 and satisfy (11.3.3). 
To prove (11.4.1), we note that its left-hand side of (II.4.1) is bounded 
above by 


[arx [ery (Itealleee—y)1"") (ln@I1@@—y i") 2) 


which from Hélder’s inequality (II.3.2) in 21 dimensional space, is bounded 
above by 


( fa’ [ary inca tace—y1) "(fats [ary aie aes)” 


= |flly Gl gla (II.4.3) 


where we have used (11.3.3) in writing ||G||1, 


iG]: = fe ax |G(*)| (11.4.4) 


III 


The Poisson Equation in 4D 


In four dimensional Euclidean space, a vector 
r= (2’, 2, 2°, 2*) (III.1) 
in spherical coordinates may be written with components as 
1 


x =rsin@cos ¢sin x 


x? = rsin @sin dsin y 


(III.2) 
x® = rcos@sin x 
x’ = rcosx 
when r = |r|, and 
O<r<ow, 0<d0<7, 0<G<2n, 0X yK<T. (III.3) 


For the Jacobian of the transformation (x!,2?,2?,24) — (r,0,, x), we 
have 


O(a ea) 


and the volume element is given by 
d‘r = r°dr sin dé d¢ sin” y dx. (III.5) 


The volume V(R) of a sphere of radius R in 4D is then equal to 


R T Qn wT a? R4 
V(R) = | rdr i sin 6 d0 i dd | sin? xdy = (III.6) 
0 0 0 0 2 


To obtain the surface area S(R) of the sphere of radius R in 4D, we replace, 
in the process, r?dr by R® in (III.5), giving 
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S(R) = 277 R? (III.7) 
with the solid angle element 
dQ = sin 6 dé d¢ sin? x dx. (III.8) 
The Dirac delta distributions are given by 
i(p — ry = =) 10-8) (6 ~ 9) Hx -W) 


III.9 
i sin @ sin? x ( ) 
oy 5(0 — 0") 5(6 - 4") d(x — x! 
sin # sin” x 
1. The Poisson Equation 
The Poisson equation in 4D is the content of the following result: 
1 
“Pore = —4n? 54(r — 1’). (III.1.1) 
To establish this, let 
VW’?G(r,r') = —O4(r — rr) (IIT.1.2) 
and upon writing 
i(r—r’) = | dK kee’) (IIT.1.3) 
r-r)= (On) e€ wl. 
we obtain , eee 
d k el ear 
+= —_——. TII.1.4 
G(r,r ) le k? ( ) 


The expression on the right-hand side may be rewritten as: (p = |r — r’|) 
27) 2 f* 2 dre 
( ™) - | ak [ax (pewrers) sin x 
(27)* p Jo 0 dx 
ey. a dy cos x ei*P cos x 
An p Jo 0 


1 Ook Gy. fT. 
=——_— | dk — | dy elke osx, IID.1.5 
me aL ee ( ) 


Using the properties, 


i dx el SSX — a Jg(kp),  Jo(0)=1,  Jo(oo) =0 (III.1.6) 
0 


where Jo(z) is the Bessel function of order zero, in (III.1.5) establishes the 
result stated in (III.1.1). 
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2. Generating Function 


In 3D, |r—r’|~! may be expanded in terms of Legendre polynomials with 
well known orthogonality properties. We extend such a procedure to 4D. 

To the above end, we introduce the special functions referred to as Cheby- 
shev’s polynomials U,,(x) of type II. The generating function of these special 
functions is given by 


——_—— t?U, 1 t} <1. T1.2.1 
TSE -oru ye Wel aay ie (111.2.1) 


The following properties of the U,,(x) are to be noted: 
Uo(x) a 1, U;(z) = 2a, Up(z) = 477-1, 


U3(x) = 4a(22? —1), U4(x) = 162* — 1227 +1,... (III.2.2) 
Un(—2) = (-1)"Up(z) (III.2.3) 
Un4i(a) = 22U,, (x) — Un-1(2). (III.2.4) 


They satisfy the differential equation 
(1 — 2?)U" (a) — 32U} (x) + n(n + 2)U,(x) = 0, (III.2.5) 


the orthogonality relation 


[ow /1 — «2 U,(2) U, 


and the following integral 


[erm cy + V1—22 /1—¥2 Z\=2 og Un(a) Uni) (III.2.7) 


In particular, since Up(x) = 1, (III.2.6) gives 


1 
i. dz V/1 — 22 Up (x) = + bn0- (III.2.8) 
-1 


The following explicit expression is also useful 


ieee (III.2.6) 


sin ((n + 1) arccos x) 


Un(z) = sin (arccos x) Se 
For x — 1, the latter gives 
U,(1) = (n+1). (III.2.10) 
For z = 0, we also have 
U2y(0) = (—1)” (III.2.11) 


Urn+1(0) = 0. (III.2.12) 
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3. Expansion Theorem 


Let 


ree min(r,?’), ro =max(rr'), «= —,, (III.3.1) 
rr 


then the expansion theorem (Theorem III.1) directly follows from (III.2.1). 


Theorem III.1 


1 3 (ee Uy (==) (11.3.2) 
pore Ga Mae -_ 

To establish a result given below, we note that the above expression may 
be conveniently rewritten as 


1 


|r — r’|? 


= ys {or =r") (r')” [r-2-"U (2) 


+O(r’ — r) (r')7?2 [PwC \ (III.3.3) 


where « is defined in (III.3.1). 
By using the properties 


0 ae x ; 
Aa O(r-1r') = ; o(r—r') (IIT.3.4) 
3x 
Yc= eo (III.3.5) 
2 3 
Ver= (IIT.3.6) 


y (2) a (II1.3.7) 


y &) ae =e (III.3.8) 


4 
Or Ox 
(=) (5 =0 (II1.3.9) 


where z is defined in (III.3.1), and the differential equation (III.2.5), we obtain 
from (III.3.3) that 
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1 d(r—r) 
= = S 2(n + 1) Up (2). (I11.3.10) 


n=0 


2 


|r — r’|? 


Upon comparison of (III.3.10) with (III.1.1) and using (III.9), (III.10), we 
may infer the following result: 


> case, Un (x) = (2 — 0) (II1.3.11) 


where 


v=N-N’ (III.3.12) 


and N, N’ are unit vectors in 4D. 
Now we prove the following indispensable orthogonality /completeness re- 
lation over angles. 


4, Generalized Orthogonality Relation 


Theorem IV.1 


[oo U,(N-N”) Up(N” +N’) = OnmUn(N+N’) — (III.4.1) 


where N, N’, N” are unit vectors in 4D. 
Without loss of generality we may choose the «* axis along N’, and set 
N -N’ = cosy, N”.N’ =cosy”. (II.4.2) 
We may then write 
N-N” =cosx cosy" +sin x sin x” cosa (IIT.4.3) 


where 
cos a = cos @ cos 6” + sin @sin 6” cos (¢ — ¢"). (III.4.4) 


We may expand U,, (N : N”) in terms of Legendre polynomials in the 
angle a, i.e., 


Un(N+N") = So Ank(x, x”) Pe (cos a) (11.4.5) 


aT ee S> Ym’ (9,4) Yibns(8" 6") (11.4.6) 
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Upon integration of (III.4.5) over (cosa): —1 < cosa < 1, using the 
integral (III.2.7), with the argument of U,, in the integrand given in (III.4.3), 
and the fact that : 

i d(cos a) P;, (cos a) = 25x0 (III.4.7) 
-1 


we obtain for the coefficient Ano: 


U,(cos x) Un (cos x") 


ray (III.4.8) 


Ano(x; x") _ 


For the integral on the left-hand side of (III.4.1), the latter is explicitly 
given by 


n ok 
4 
y > Rea Yim (89) Joo Ank(x3 x") Vien (0", 6") Um (cos x") . 


k=0 m’/=—k ) 
(IIT.4.9) 
From the definition of the solid angle element dQ” (see (III.8)), and the 


integral 
20 wT 
| dg” ? de” sin g” Yim (0", ¢") =v An Ono Om'0 (III.4.10) 
0 0 
(III.4.9) reduces to the expression 
1 w 
in Vie fa ” Anol(x, x”) Um(cos x”) sin? y” 
are ox o(X, x") Um (cos x") sin” x 
An 


= U;,(cos x) | dy” Un (cos x") Um (cos x”) sin? x” 
(n+ 1) 0 
(111.411) 


where in writing the last step, we have used (III.4.8). The integral in (III.4.11) 
satisfies the orthogonality relation in (III.2.6). Hence by using the fact that 
cos xX = N-N’, the result stated in Theorem IV.1 immediately follows. 
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discontinuity across a cut, 697 

discrete eigenvalue, 43 

discrete set, 65 

discrete spectrum, 183, 200 

dispersion relation, 695 

distinguishable states of a physical 
system, 67 

disturbing transition of a system by a 
measurement, 71 

divergent series, 695 

domain of A, 41 

double-slit experiment, 74, 78, 551, 552, 
599 

double-valuedness of spinors under 
rotation, 129, 896 

dual vector space, 23 

Duffin-Kemmer-Petiau equation, 939, 
970 

dynamical phase factor, 515 


effective charge “renormalization”, 856 

effective range, 878 

effective range expansion, 878 

eigenvalue equation, 43 

eigenvalue problem, 24 

eigenvalue problem and supersymmetry, 
224 

eigenvalues, 53, 183 

eigenvectors, 43 

eikonal approximation, 842 

eikonal phase function, 844 

elastic scattering, 800 

elastic scattering amplitude, 866 

electric dipole moment, 404 

electromagnetic potential, 371 

energy “width”, 170 

energy functional, 741 

energy loss per unit path length of 
charged particle, 868 

energy shell, 804 

energy-time uncertainty principle, 170 

entangled ancillary pair of particles, 
501 

entangled states, 72, 499, 500, 503 

entangled swapping, 503 

EPR paradox, 487 

essential spectrum, 47, 48, 53 


Euclidean space, 4D, 592, 981 

Euler angles, 255, 257 

even permutation, 83 

event, 892 

exchange term, 756 

existence of bound-states for v = 1, 192 

existence of bound-states for v = 2, 194 

existence of bound-states for v = 3, 195 

expansion theorem, 984 

expectation values, 10, 92, 246, 675 

explicit expression for the Clebsch- 
Gordan coefficients, 284 

exponential decay, 172, 427 

exponential law, 432, 727, 728, 733 

exponential representation of rotation 
matrix, 84 

expression for g-factor, 440 

expression for the ground-state energy 
of atoms, 762 

expressions for transformations 
functions, 656-661, 664 

external field problem, 926, 930, 933, 
944 

external force, 335 

external sources, 335, 674 


Faddeev equations, 864 
“fall” of a particle into another, 143 
Fermi excitations, 640 
Fermi oscillator, 343 
annihilation, creation operators, 343 
ground-state persistence amplitude, 
344 
Fermi-Dirac factor, 345 
fermions, 36 
Feshbach-Villars equation, 939 
field equation, 531 
filter, 19 
fine-structure, 689 
fine-structure constant, 372, 375, 393 
fine-structure correction, 382, 383 
fine-structure of the H-atom, 377, 379 
£=0 states, 381 
£#0 states, 382 
finite degeneracy, 183 
first class constraints, 618 
fixing a gauge, 621 
fluctuations of the position of the 
electron, 375 


flux, 572, 573 

Foldy-Wouthuysen-Tani representation, 
919, 927, 928, 969, 970 

forced harmonic oscillator, 661 

Fourier series, 573 

Fourier transform, 99 

Fourier transform in the complex 
domain, 174 

“free” time evolution operator for the 
Coulomb case, 832 

functional, 654 

functional derivative, 653, 654 


g-factor, 375 
g-factor for the electron, 930 
g-factor of the proton, 376 
Galilean frame, 81 
Galilean invariant theory, 101 
Galilean scalar, 89 
Galilean scalar (field), 117 
Galilean space-time coordinate 
transformations, 83 
Galilean transformation of operator, 92 
Galilean transformations, 81, 82, 86 
Galilean vector (field), 117 
gamma matrices, 885-888 

chiral representation, 888 

Dirac-Pauli representation, 885 

Majorana representation, 888 
gauge fixing parameter, 621 
gauge freedom, 621 
gauge transformation, 529 
Gaussian distribution, 467 
Gaussian integrals, 543, 613, 659, 702 
general solution of Dirac equation, 908 
generalized orthogonality relation, 985 
generalized oscillator strength, 869 
generating function, 570, 735, 983 
generation of arbitrary spins, 121 
generation of states, 15 

(a| and |b) symbols, 16 

two-stage process, 15 

annihilation, 16 
production, 16 
generators of the Galilean transforma- 
tions, 91 

geodesic, 529-531 
geodesic equation, 529 
geometric phases, 513, 515, 519, 520 
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Golden rule, 720 
Grassmann variables, 137, 344, 633, 
635, 674 
Green function near the energy shell, 
833 
Green functions, 547, 580, 740 
matrix notation, 580 
group properties, 55, 87, 88 
associativity, 87 
inverse, 87 
grouping of particles into clusters, 107 


Ho6lder’s inequality, 148, 222, 782, 979 
Hamilton-Jacobi-like equation, 738 
Hamiltonian operator, 55, 94 
harmonic oscillator, 329 
annihilation and creation operators, 
335 
finite temperature, 340 
Gaussian probability density, 355 
ground-state persistence amplitude, 
336 
persistence probability at high 
temperatures, 342 
spectrum, 331 
stimulated excitations, 340 
time-dependent disturbance, 335 
transition to and between excited 
states, 335 
wavefunctions in the x-description, 
333 
harmonic oscillator wavefunctions, 238 
h (Planck’s constant divided by 27), 
90, 91 
Heisenberg representation, 652 
Heisenberg’s equations of motion, 97 
Heisenberg-like picture, 92 
helicity, 313, 322, 900, 915, 953 
helicity states, 313, 322 
helicity versus standard spin states, 315 
two particle systems, 324 
Hermitian operator, 24, 41 
Hermiticity, 40 
high-density limit for matter, 780 
high-order perturbations, 695 
Hilbert space, 33 
“hole” theory, 932 
hydrogen atom, 370 
eigenstates, 366 
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eigenvalue problem, 363 
orthogonality relation, 368, 369 
stability of, 363 

hydrogen molecule, 692 

hyperfine-structure of the H-atom, 377, 


384 
£=0, 387 
£40, 391 


ideal apparatus, 71, 73, 465, 477, 480 

identity operation, 4, 12 

identity operator, 66 

impact parameter, 830 

impact parameter representation of 
scattering amplitude, 844 

indistinguishable particles, 36, 819 

indistinguishable spin 1/2 particles, 820 

inelastic processes, 800, 867 

inertial frame, 881, 892 

infimum, 51 

infinitely many degrees of freedom, 486 

infinitesimal Lorentz transformations, 
896 

infinitesimal rotation, 83, 894 

infinitesimal transformations, 83, 91, 
92, 894 

inflation of matter, 781 

initial value problem for Dirac equation, 
917 

initially prepared state, 16 

inner product, 23 

inner product space, 23, 34 

intensity distribution, 535 

intensity of light, 33 

inter-clusteral interactions, 108, 109, 
847 

interacting states, 800, 801, 851 

interaction picture, 429 

interaction time, 471 

interaction with a harmonic oscillator, 
467 

interaction with an apparatus, 463 

interactions of short range, 800 

interference effect, 524 

interference terms, 74-77, 469, 477 

internal angular momentum, 119, 120 

internal energy, 104 

intra-clusteral interaction potential, 
108, 847 


intrinsic parities, 496, 909, 931 
intrinsic parity of a photon, 496 
invariance, 89, 97 

invariant, 8 

inverse operator, 45 
irreducibility of constraints, 618 
isolated point, 43 


Jacobian, 118 

Jacobian of transformation for 
Grassmann variables, 638 

Jensen’s formula, 177 

joint probability of spins correlations, 
494 


ket, 23 

key (cryptography), 504, 506 

kittens, 419 

Klein-Gordon equation, 937, 938, 970 
Kramers degeneracy, 132 

Kummer’s equation, 936 


L-H polarizations, 496 

labelling of space-time points, 82 

ladder operators, 250, 254 

Lagrange multipliers, 617, 621 

Lagrangian for spin 0 particles, 955 

Lagrangian for spin 1/2 particles, 957 

Lamb shift, 391 

computation of, 394-401 

Lamb shift and renormalization, 398 

Landé g-factor, 294 

Landau levels, 436 

Landau-Larmor energy, 437 

Laplacian in parabolic coordinate 
system, 417 

law of reflection, 565, 570 

left-hand derivative, 634 

left-handed, 916 

left-handed (L-H) polarization, 495 

Lieb-Thirring bound, 217, 776 

Lieb-Thirring kinetic energy inequality, 
221, 224 

lifetime, 170 

limit of potential well, 190 

line of singularities, 528 

linear and quadratic potentials, 555 

linear combination, 17 


linear combination of selective 
measurement symbols, 11 
linear operator, 57 
linear superposition, 17 
“linearity” condition, 37 
Lippmann-Schwinger equations, 802 
Local Hidden Variables (LHV) theories, 
488 
locally square-integrable, 182 
Lommel functions, 812 
long wavelength approximation, 406 
longitudinal component of magnetic 
field, 534, 537 
Lorentz covariance, 892, 894 
Lorentz force, 531, 538 
Lorentz invariant scalar product, 899 
Lorentz scalar, 900 
Lorentz transformations, 892, 893 
Lorentz vector, 900 
lower and upper bounds for the 
ground-state energy for matter, 
774 
lower bound for the expectation value 
of the separation distance between 
two particles in negative energy, 
148 
lower bounds to the expectation value 
of the kinetic energy, 220 
multi-particle states: bosons, 224 
multi-particle states: fermions, 222 
one-particle systems, 220 


Moller wave operators, 807, 879 

machine, 14-16 

machine operating in reverse, 15 

machine represented by 2 x 2 matrix, 
20 

magnetic dipole moment, 372 

magnetic dipole moment of the proton, 
373 

magnetic field gradient, 537, 538 

magnetic monopole, 528 

Malus formula, 31 

Markov approximation, 431, 731, 732 

mass renormalization, 401 

massless Dirac particles, 912 

master equation, 727, 728, 731 

matrix elements, 12 

matrix elements of exp (ipn-S/h), 127 
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matrix elements of finite rotations, 254 
Do (a, 8, -y) functions, 257, 258 

de, (8) functions, 258 
orthogonality properties, 284 
matrix multiplication, 12 
matter without the exclusion principle, 
765 
mean lifetime, 404, 444 
mean lifetime of the state 2P,/2, 404 
“mean” orbital angular momentum, 
923, 925 
“mean” position, 926 
“mean” position operator, 923, 925 
“mean” spin, 923 
“mean” spin operator, 925 
mean square radius of proton, 417 
mesoscopic states, 484 
metric, 524, 525 
metric on the unit sphere, 527 
minimal electromagnetic coupling, 378 
Minkowski metric, 886 
Minkowski’s inequality, 978 
mixed-description, 12 
mixtures, 25 
mixture, 26, 31 
random mixture, 27, 29 
modified Bessel functions, 220, 570 
momentum operator, 96 
monitoring observables, 79 
monitoring spin, 478 
motion restricted to the surface of unit 
sphere, 527 
Mott scatterings, 837 
multi-channel scattering, 846, 849 
multi-component object, 89 
multi-electron atoms, 105, 739 
multi-particle Hamiltonians, 104, 167, 
739, 766 
multi-particle systems, 104, 106, 148, 
163-168, 739, 766, 850 
multi-particle systems with Coulomb 
interactions, 167, 739, 766 
muon decay, 932 
Meller wave operators for Coulomb 
potential, 879 


N°®/3 law for bosons, 784 
natural units, 90 
“needle” registering-value, 65 
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neutral state, 71 
neutron interferometer, 867, 871, 875 
9-7 symbol, 304, 306 
orthogonality property, 307 
relation to 6-7 symbols, 306 
No-Binding Theorems, 197, 774 
non-abelian gauge theories of funda- 
mental interactions, 519 
non-abelian transformations, 519 
non-commutativity of measurements, 
31, 423 
non-degenerate perturbation theory, 
684 
non-empty subspace, 43 
non-flip of spin, 75, 476 
non-holonomic system, 524 
non-local density of states, 204, 583 
non-relativistic electron, 403 
non-zero spins, 89 
normalization condition, 9, 18, 24, 100 
normalization condition for momenta, 
100 
normalization factor, 70 
normalization of probability, 492 
number of eigenvalues, 210, 583, 668 
Bargmann inequality for number of 
eigenvalues, 215 
number of eigenvalues in one- 
dimensional case, 213 
number of eigenvalues in three- 
dimensional case, 211 
number of eigenvalues in two- 
dimensional case, 212 
Schwinger inequality for number of 
eigenvalues, 211 
numerical factor (b| a), 4 
numerical factors, 7, 8, 10 


observable, 2 
compatible observables, 2 
complete set of compatible observ- 
ables, 2 
continuous set of values, 2 
finite number of discrete values, 2 
incompatible observables, 2 
infinite number of discrete values, 2 
odd permutation, 83 
old quantum theory, 712 
one-time pad scheme, 504 


optical theorem, 822, 853, 854 

optical theorem and its interpretation, 
821 

orbital angular momentum, 117, 258 

half-odd integral values?, 259 

order of measurements, 505 

orthogonal, 51 

orthogonal entangled states, 502 

orthogonal projection operations, 910 

orthogonality conditions, 66, 79 

orthogonality relation, 45 

orthogonality relation of photon states, 
416 

orthonormal basis, 58 

orthonormality property, 122 

oscillator strength, 869 

oscillatory magnetic field, 13 


p-description, 100 
Padé approximant, 738 
Padé approximants method, 702 
pair annihilation, 931 
pair production, 931 
Paley-Wiener Theorem, 172, 174 
Pancharatnam definition, 524, 546 
parabolic coordinate system, 417 
parallel transport, 524 
parallel transport of a vector, 527 
parameter space, 517 
parity, 112, 114 
parity transformation, 130, 902 
particle detection, 807 
Paschen-Back effect, 408 
passage of charged particles through 
hydrogen, 867 
passage of particles through media, 867 
path integrals 
and velocity dependent potentials, 
608 
completeness relation, 601 
constrained dynamics, 614 
for a given potential, 604 
particle in external electromagnetic 
fields, 608 
Pauli equation, 433 
Pauli exclusion principle, 765 
Pauli Hamiltonian, 229, 432 
Pauli matrices, 421, 885 
Pauli’s fundamental theorem, 886, 889 


P,C, T transformations, 902 
PCT transformation, 969 
permutations, 123 
phase factor, 61, 522 
phase shifts analysis, 825, 865 
photon, 391, 442, 447 
physical system transitions, 69 
physically observed mass, 398, 403 
plane rotator, 737 
Poincaré sphere, 526 
Poisson equation, 415 
Poisson equation in 4D, 982 
Poisson probability mass function, 352 
Poisson sum formula, 571 
polarization of light, 29 
0-polarizer, 32 
x-polarized state, 30 
x-polarizer, 30 
y-polarized state, 30 
unpolarized light, 32 
polarization vector, 391 
polarized along the z-axis, 457 
polarized beam, 457 
position operators, 93, 96, 919, 942 
positivity, 536 
positivity constraint, 537 
positron, 109, 393, 494, 882, 932 
possible outcomes of spin measure- 
ments, 492 
potential well, 187 
power of test of detection of the 
intruder, 507 
preparatory channels, 851 
presence of radiation, 403 
primary constraints, 617 
probabilities of counts, 488 
probability, 9, 18, 68 
probability current, 917 
probability density, 29, 536 
probability distribution of the kinetic 
energy, 414 
probability for the “fall” of one particle 
into the other, 148 
probability mass function, 488 
probability of decay, 480, 481 
product of the exponentials of 
operators, 973 
projection operators, 42, 80 
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Pryce-Newton-Wigner operator, 924, 
969 

pseudo-scalar, 902 

pseudo-vector, 902 

pure ensembles, 25 


quadrupole moment 
reduced matrix element, 327 

quantization of the electric charge, 528 

quantized flux, 575, 576 

quantum anti-Zeno effect, 482 

quantum correction, 759 

quantum cryptography, 501, 503 

quantum decoherence, 79, 463, 469, 
472, 482, 728 

quantum dynamical principle, 649, 650, 
652, 663 

quantum dynamical principle to path 
integrals, 669 

quantum dynamics, 100 

quantum electrodynamics, 375, 490, 
881 

quantum field theory, 881 

quantum fields, 954 

quantum mechanical counterpart of 
Lorentz-force, 534 

quantum mechanical phase depending 
on the gravitational constant and 
the Planck constant, 877 

quantum superpositions, 79 

quantum teleportation, 501 

quantum Zeno effect, 482 


R-H polarizations, 496 

radial functions, 404 

radiation field, 391 

Ramsauer effect, 878 

Ramsey apparatus, 13, 14, 473, 478 

Ramsey oscillatory fields method, 473 

Ramsey zones, 477, 478, 509 

N Ramsey zones, 480 

Ramsey-like method, 508 

Rayleigh-Schrédinger series, 688, 700, 
703, 736 

rays, 37 

re-summation methods, 702 

reactive cross section, 866 

real Grassmann variables, 633 

reduced density operator, 468, 472, 728 


1008 Index 
reduced mass effect, 377 
reduced masses, 849 
reduced matrix element, 293, 296 
reflection coefficient, 561 
reflection probability, 711, 874 
registering apparatus, 66 
relative momentum, 848 
relative phases, 56 
relativistic correction to the kinetic 
energy, 375 
relativistic quantum field theory, 881 
relativistic wave equations for any mass 
and any spin, 947 
relativistically invariant, 392 
relativity, 392 
renormalization 
contact term, 449 
counter term, 398, 402 
effective charge, 855, 932 
of mass, 402, 403 
physical significance and stability of 
the electron, 398, 402, 403, 449 
Z3: wavefunction renormalization 
constant, 393 
renormalized mass, 398, 402, 403 
representation of simple machines, 14 
representations of the generators P, X, 
96 
repulsive Coulomb potential energy 
lower bound for multi-particles, 767 
“resistance” of Fermi particles to 
increase in density, 149 
resolution of the identity, 42, 353 
resolution of the identity in the 
x-description, 100 
resolvent, 46, 152 
resolvent set, 47 
resonance, 481 
resonance condition, 509 
rest energy, 371 
retarded Green function, 549 
retarted/advanced Green functions, 801 
reversal of phase, 513 
Riemann-Lebesgue Lemma, 732 
Rigged Hilbert space, 38, 39 
right-hand derivative, 634 
right-handed, 916 
right-handed (R-H) polarization, 495 


role of the environment, 469 

rotation matrix, 82 

rotation of a coordinate system, 84 

rotation of a spinor, 508, 511, 512 

rotation of a spinor by 27 radians, 129, 
511, 512 

Rydberg, 372, 400 


6-3 symbol, 304 
orthogonality relation, 305 
relation to 3-7 symbols, 304 
sum rule, 305 
Samuel-Bhandari (SB) phase, 529 
scale transformations, 8, 146 
scattering S operator, 879 
scattering amplitudes, 804, 815, 826, 
834 
scattering at small deflection angles at 
high energies, 842 
scattering length, 878 
scattering of indistinguishable charged 
particles, 837 
scattering of spin 1/2 particle off a 
spin 0 target, 454 
scattering of spin 1/2 particles off a 
spin 1/2 target, 459 
scattering theory 
functional treatment, 838 
Schrédinger equation, 101 
Schrédinger’s cat, 482 
Schwinger’s constructive approach, 958 
scrambled message, 504 
second class constraints, 618 
second rank anti-symmetric spinor, 129 
secondary constraints, 618 
secular equation, 689 
selection rule, 266 
selective measurement symbols, 2, 14, 
30 
selective measurements, 2, 3, 18, 486 
polarizer, 3 
polarized light, 3, 30, 495 
unpolarized light, 3, 32 
prism, 3 
Stern-Gerlach apparatus, 3, 4, 13 
self-adjoint, 41 
self-adjoint operators, 39 
separability, 34, 80 
separated oscillatory fields zones, 473 


shape invariant partner potentials, 233 
silicon, 873 
smooth function, 722 
solenoid, 572 
solid angle, 517, 982 
solution of the free Feshbach- Villars 
equation, 941 
solutions of the fully interacting 
systems, 802 
space reflection, 112 
space translations, 91 
spatial extension of bound-state 
systems, 148, 150, 151 
spatial extension of matter, 782, 783 
spatial extension of the fermionic 
systems, 150 
spatial rotations, 892, 894 
spectra of self-adjoint operators, 41, 45 
spectral decomposition, 42 
speed of light, 370, 399, 892 
spherical Bessel functions, 827 
spherical harmonics, 262 
addition theorem, 267 
integral involving three spherical 
harmonics, 301 
reduced matrix element, 303 
special values, 265 
tensor operator Y;7, 299 
spherically symmetric potential, 214 
spin, 269, 898, 900, 947 
arbitrary spins, 274, 947 
density operators, 453 
spin s out of 2s spin 1/2’s, 129 
spin and relativistic corrections, 370 
spin and statistics connection, 962 
spin component in the +z direction, 14 
spin of the proton, 385 
spin precession, 441 
spin versus helicity states, 313 
spin-flip, 73, 74, 76, 131, 469, 475 
spin-orbit (SL) coupling, 376, 928 
spinors, 121 
spin 0, 117, 783, 819, 820, 937, 955, 963 
spin 1, 6, 118, 129, 272 
spin 1/2, 5, 7, 74, 77, 270 
general aspects, 420 
spin 1/2 in external magnetic fields, 
423, 425 
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splitting of beam along quantization 
axis, 538 
spontaneously broken theories, 226 
square-integrable, 172 
square-integrable functions, 36, 44, 228, 
229, 233, 242, 243 
stability, 145 
stability and multi-particle systems, 
148 
stability of matter, 774 
stability of the hydrogen atom, 360 
stable system, 150 
Stark effect, 412 
state, 16 
state of combined (correlated) system, 
66, 468 
state of the apparatus, 67 
state vector, 18 
states of a physical system, 37 
statistical (density) operator, 26, 31, 
468, 472, 534, 728, 730 
step-function, 41, 42 
Stern-Gerlach, 2, 11 
filtering machine, 2 
filtering processes, 2, 6 
Stern-Gerlach effect, 531 
Stokes’s theorem, 516 
strong limits, 354, 806 
strong magnetic field, 408, 411 
subsidiary constraint, 622 
successive Galilean transformations, 86, 
91 
successive Lorentz transformations, 376 
successive operations, 3 
successive operations of two machines, 
15 
successive selective measurements, 31 
sudden approximation, 717, 720, 722 
sum of the negative eigenvalues, 216 
one-dimensional case, 218 
three-dimensional case, 216 
two-dimensional case, 217 
superposition principle, 17, 419, 482 
superpotential, 231 
superselection rules, 37, 484 
supersymmetric Hamiltonians in higher 
dimensions, 247 
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supersymmetric partner Hamiltonians, 
231 

supersymmetric partner potentials, 231 

supersymmetric transformations, 138 

supersymmetry, 136, 224, 434 

supersymmetry generators, 136, 225, 
226, 346, 348 

supremum, 51 

survival probability, 169, 432 

“switching on” of apparatus, 466 

symbol A(a) = |a)(al, 2 

symmetric spinors of rank k, 122 

symmetrization operation, 948, 970 

symmetry operation, 132 

symmetry transformation, 61 

systems of n particles, 104, 106, 118, 
148, 163, 166, 167, 739, 766, 850 


21.1 cm wavelength, 388 
3-3 symbol 
orthogonality property, 289 
relations to Clebsch-Gordan 
coefficients, 289 
T-scattering matrix, 865 
tensor operators, 296-303 
TF electron density and the effective 
potential, 740, 742 
thermal average, 340, 680 
thermal mixture, 340, 342, 345, 680 
Thomas factor, 376 
Thomas-Fermi (TF) approximation, 
587, 588, 740 
Thomas-Fermi atom, 739, 740 
computation of ground-state energy, 
745, 746 
Thomas-Fermi-like energy functional, 
769 
time reversal, 112, 130, 903 
time-dependent forced dynamics, 561, 
564, 631, 643, 656, 660 
time-dependent Hamiltonians, 111 
time-evolution, 101, 109, 111, 665 
trace, 12, 86 
trace functionals, 665 
trace operation, 10 
transformation function (x|p), 98, 99 
transformation function from the B- to 
A-descriptions, (a|b), 19 


transformation functions, 22, 649, 652, 

653 

transformation law under coordinate 

rotation, 95, 120, 121, 249, 258 

transformation law under time reversal, 

131, 903 

transformation rule of a spinor of rank 

one under a coordinate rotation, 

133 

transition amplitude, 337 

transition probabilities, 719, 722, 853 

translational independent contribution 
to J, 117 

transmission coefficient, 561 

transmission probability, 711, 874 

transpose, 86 

trial wavefunctions, 690, 777, 788 

triangular inequality, 24, 35 

“tuning” condition, 468 

turning point, 705, 707 

two-dimensional Green function, 570 

two-level atom, 428, 484 

two-level systems, 427, 737 

two-particle scattering, 818-821, 837 

two-particle states of arbitrary spins 
versus helicity states, 322 

two-particle systems and relative 
motion, 104 


ultraviolet cut-off, 396 

uncertainties, 144, 145, 362 

uniform time-independent magnetic 
field, 13 

unitarity, 67 

unitary operator, 55, 57, 78, 89, 112 

unpolarized beam, 455, 536, 538 

upper bound cut-off, 399 

upper bound for the expectation value 
of the kinetic energy operator for 
negative energies, 147, 150, 781 


vacuum fluctuations, 932 

variational form, 652 

variational methods, 690 

vector operators, 290, 298 

vector space, 16, 34 

vectors, 16, 33 

velocity dependent potentials, 608, 612 
Vernam cipher, 503 
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virial theorem, 245 barrier penetration, 709 
volume of sphere in 4D, 981 quantization rules, 712, 714, 715 
radial equation, 715 
WKB procedure in the one-dimensional 
radial equation, 716 


wavefunction in different descriptions, 
18 

Z3: wavefunction renormalization 
constant, 393 

wavefunctions of hydrogen atom, 368 

wavepacket, 533, 809, 919 


x-description, 96, 100 


weak limit, 354, 805 Young’s inequality, 211, 213, 980 
weak magnetic field, 411 Yukawa potential, 157, 198, 244 
weak perturbations, 717, 718 Yukawa term, 347 
Wigner’s Theorem on symmetry 
transformations, 57, 81, 89 Zeeman effect, 406, 412 
Wigner-Eckart theorem, 293, 297, 326, zero point energy, 334, 343, 350 
389 O vector, 17 
winding number, 571 Zitterbewegung, 375, 918, 923, 925, 


WKB approximation, 703, 705 926, 928, 931, 944 


